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Pkey: Unique Eos probeset iden liner number 
CAT number. Gene cluster number 
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TABLE 53C 

Pkey: Unique number corresponding lo an Eos probes Bl 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Certifier (Gl) numbers, "Dunham, et el" refers to Die publfcatfon entitled The DMA sequence of 

human chromosome 22" Dunham, et al. (1999) Nature 402:489-495. 
Strand: Indicates DMA strand from which exons were predicted* 
NLposition: Indicates nucleotide positions of predicted exons. 
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4.15 


22 


4228QB 




na. it \\}£X\ 


cnrunrosDTie 1 1 open reaainy name o 


q no 




436695 


tV\S Z. U J J c 


ria . I tut ou 


ESTs 


D./D 




422247 


1 11R941 
U ItWrt 


Vic 1 flfilT) 


Cy\llllt* /^-Cn-rltf*^ frvnikr 1 /llinll ntfinikr B 

suiiiic cuhibt lamiiy i ^ntgn Hmniiy a 


Q./O 


9 9 


421625 




HS.1 f <KJlHf 


to 1 5 




9 O 
£>£ 


434807 


AA364183 




hvnnthfritral nmTntn R tHHElfi 
«ypwnw*^i ptouoiii rue i icup 


^ R1 


9 9 


42B174 




ns. i £o# # o 


ESTs 


^ nn 


9 9 






( «.uu no 


M aanlane nana frnm PAf'* inAVlft etmt1v4 

n.aduuijs yyiw nuni taw^ mono, stnriiar \ 


^ 11 


9 9 


444467 


AJ150368 




ESTs 


3.81 


22 


433B32 


AA91801B 


Hs 175516 


ESTs 


6.94 


22 




.niv sutSJ 




ESTs 


a.07 


9 9 




AMR1Z11 


Vie cflCRR 
(15.00000 


chromosoma 21 open reading Jistj\B 57 


•3 em 


9 9 


432538 




Via *>7ftqc9 


sriLru-eiiiujirjixj ariogen 


OuDO 


9 4 




AJjTUOr / / 


kin 19GP9.B 

ns. l ztwJD 


rQT. 
to 1 B 




9 1 
£.1 


*t%JwvQ 


AI1R77A9 






3 ill 

Ml 


9 1 


422990 




Vie 1997KA 
ITS. l£i f 04 


QOPA1 4fr^J^"t»Al^u4 nnstnin 

Dr\OA i assoctsiec pro lg in 


o.bo 


9 ^ 






14a mftfifi 

rts. u/Di-T) 


cole 


J.UO 


9 -1 
J.I 


409267 


NM 0194*^ 




imsdmn (b@la)>)j|c9 2 




9 1 




ai n%i?i 

ALUOO 1 1 1 




CO IS 




9 4 


439012 


DtzoajD it 


Via Cd.CC 


rvuvo con fiufnoioy/HiKe z 




9 1 


433812 


AA72S02S 


Hs 971 


PFTTs VlfeuldwclmllartnTHfil't hvnnlhdH 
no i & f vroawy miiiich uj i o id i i fiypouicu 


AM 


9 1 
C.I 




AJQdlOlU 


UT* iQRQCC 
nS.I 


to is. ragtiiy similar v> match v> to 1 


4.38 


2.1 


437249 


AA4322D2 


Hs 1(11147 
rfoi iVu iff 


liy^(llcH#dl piUlcBI rUtlOti' 




9 1 




a A3 Rami 




CCT« 


3.03 


9 4 


427065 










9 1 


420624 


W23624 


Lie ITWIrtl 


ESTs 


J.Uf 


9 1 


426224 


X54017 


n&.iwuoo 


scrosin 


^ 1ft 
j, 10 


9 1 


436954 


AA740151 


Ks iaad2s 


ESTs 


■a on 


9 1 

L. 1 


444470 


AA412195 


1^7 An 
no> i \j t *vj 


ESTs 


A 97 


9 -1 


457579 




nSiOOrOl 


nrvio-iiKe suppressor 


ixdu 


9 4 


427686 


AA41H1R3 


14- 1|1£7JV1 
no, lutroy 


CO 15 




9 4 


439273 


AW139099 


Hs.269701 


ESTs 


3.83 


2.1 


434316 


AW207552 


Hs.116328 


ESTs, WoaWy similar to A395S4 transcrip 


4.W 


21 


427015 


AA39752Q 




ESTs 


4.28 


21 


421596 


AW630942 


Hs.1 06061 


RD RNA-felDdlng protein 


3.30 


21 


427236 


AA399959 


Hs.148271 


ESTs 


3,07 


21 


434520 


AA205273 


Hs,177011 


hypothetical protein 


119 


2.0 


456051 


TB5626 


Hs.76239 


hypothatcd protein FU2060& 


an 


2.0 


418322 


AA2B41S6 


Hg.84113 


cycUn-dependent kinase inhibitor 3 (CoK 


6.41 


2.0 


451B07 


W52B54 




hypolheticat protein FU23293 slmRar to 


3.52 


20 


448884 


AW751955 


HS52753 


hypolhehral protein FU22316 


4.73 


20 


420484 


W32963 


Hs.98289 


VRK3 for vaccinia related kinase 3 


3,86 


20 


4141B1 


AK000476 


Hs75798 


hypothetical protein 


3,33 


20 


42455B 


AF036847 


H&15D490 


FKSO&tinding protein 6 (36kD} 


7.70 


20 



TABLE 548 

Pkey: Unique Eos pra beset identifier number 
CAT number. Gene duster number 
Accession: Gen bank accession numbers 



Pkey 
428616 



CAT Number 
256g_1 



426599 


10110J 


426930 


1310779.1 


434317 


5995B7 W 1 


426620 


142987.1 


419350 


130B6.1 


440622 


532606 1 


421938 


B63665L.1 



BC017938 BI826643 BG715794 B6722697 B1460787 BG773459 H52B59 AI552B53 AI990773 AW6651 93 AVU34Q601 AA913805 AI337099 
BE045942 AW572790 AW515652 H15004 AA9C91 15 BI465310 BM82024 BI561 578 BI463D75 BG722527 R86003 BG623286 H15003 BI562131 
BG435272 

BE736800 BM471423 AL557221 BG763302 BF7421S6 BF991016BG200112BF920027BG5764C9 BG332214 Bte30957Al627504AW163574 
AI805171 A] 126491 AA448257 A1090641 AW1 83329 AA994873 AJ203663BE041513 AA382260 AA382261 AL554B37 BE273483 
A1609689 AA393442 AU50574 A1200886 A1221692 AA605977 AA813213 

AI209Q94AI377740AW117 J B2AWl«2WAI67409SAV«effl>1^ AA906846 
AA974625 AA884357 A1B08590 

AL042392AJ 147451 AA758821 AW450252 AA3 99310 Al 656343 A) 536668 AW515660AJ 190733 A1025812 AA723645 AA709253 AA725709 
AA398244 AA3B 2463 A 11 39637 

AJ2ie809 BC014609 BG7243B3 AJ024359 AA904573 AI138595 AA86S6B5 A1768931 BI528436 BG717350 BG719B00 AW182303 AA448161 
BI826670 BI827131 B1830254 B1824155 BI831745T19190 BI630415 

BG207562 BG192113 AA977616 AW274Q24 A1554897 A) 221 379 AA969158 AA906867 AJ873494 AIOJ5039 
AA412363 AA300575 BG77324S AA412243 AA405951 
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4365BS 
431986 
420431 
436501 
408415 

427745 
440115 
428524 
440933 
427829 
438735 



433792 
442122 
433760 
425956 
413372 



430254 
444467 
436605 
427015 
451807 



33C97J? 



247GB36J 

76926J 

29290J 

10131J 

4581.6 

1372622J 
34^3 

13821B4J 

930517J 

1373537J 

10316J 

36378J 



2204621J 
2684549 1 
5B49B2J 
6S7337J 
32B96J 



13102.1 

10084C0J 

1008207J 

683123J 

17758L2 



NM_004157 X1496B BG48048B BE409869 BG723898 BG476313 AU121626 BE3B6516 AI9S9297 AW172340 AA8896S8 R23436 AI01503? 
BE250558 BF590945 AW3S5993 BF983000 AA07Q235 AL556082 M7B388 AW504473 AW370139 BG913697 BE89909S BG827945 BE741233 
A101 5465 AW370169 BE29735D AA093249 
A1122A28 AA909991 AA759233 

AL591713 BF197609 AI9B5094 AW443916 A1243277 AL449530 AL449629 

AW241405 AW205071 A1671585 AI652354 AI638465 AW590359 AW662771 AW5940B7 BE502532 A1218894 BE466416 BF056295 A1247366 
AI990484 AI9T7746 AW55592S A1216456 AW182169 AA9698B4 AA723888 A101 841 9 

BI602176 BIG031 3B BI459895 BI755030 AW41 87BB AA8B3999 AA72485B AJ46031 1 AW196355 AI004313 AI651117 BE814363 AW589656 

AA443124 AA447962 B1461166 AA405629 

A1208966AA421041AA815377AM11954 

BF980396 R51074 BF9798B3 AI539370 BM128735 AA999397 AIS11039 AW593985 R41B08 
AI2O808O AA442862 AA429772 

AI1254Q4AW593312AI247364A1208217AA91O021 A1915307 

AJ 190292 AI18B225 AA416673 AA416596 AA9528B8 M972172 AA9Q6B74 

M76676 NMJQ22571 BG772522 BF516449 A15374B5 AW517245 BF762536 AA634446 AW196331 A1203035 B6722281 

AK05G32QAL522&40BI793O43AWD71691 AI433682AA855414AA702684 B1792794 H96879 RS2351 AA211126 AA442875 N25725 AA482S63 

N33446 N25222Z41110 N26507 N73447N24077 N2D492 AW275550 H9S619 AL518306 AL522041 AW959849 AL518307 M726907 A1655113 

AI309906F101B4BM451081BE257595BG721625BI82B509 BG700470F1 2568 Z45395B1 829288 AA35461 8 AA364851 AA421448T74231 

R52350 AA482415 AJ2O3021 T8B948 A1565842 

AJ024286 AA769B98 AA778661 AA868972 AA609524 

AI932330 AI190707 AJ376762 AA976847 

AAB09179 AW592321 AA75B2B2 AJ214437 AW072537 AA781937 

BIB31488 AW190479 AI472793 AA460217 AA459937 BF082576 AA393673 AA39B702 

NIVL0C6001 L11645A^205604A1207994AI187362AA709190B1462421 BG772T70 BG722772AA436S91 BG771SS5 BI553260 BF1 26025 
BF125857 BI46267Q BG724164 BI562424 BG721652 BI5S9662 BG722465 BI595415 BG717561 8G722136 BG773507 BG720572 C03867 AI016802 
AL042663 AA770436 AA435720T19365 M626698 AA759057 A120BO21 BF507B44 AI2DB05B AA412719 AA426374 AI20S775 AA977217 AA758055 
F345a5AAia0062 

BG700885 AA868017 AW341719 AA971332 A1688794 Z20462 AJ808145 AW665263 AAB84952 AA9061 36 
AW663704 AN 50366 A1216464 

A112534Q Alt25B84 A1377949 All 26470 A121 B351 AW665355 A1243952 AW663454 A124O603 At 1 87742 AA884214 AA723933 
AA857437 A19GB733 A196B93B AA992784 AA397520 AW236244 

BM479185 AL552795 AL577722 BF03B888 BM127617 BF510346 AW450B52 AAB65478 AW449519 BM127314 AI806S39 AW449522 AA993634 
AIB27626AA9047BB 



TABLE 54C 

Ptey: Unique number corresponding to art Eos probes at 

Ref: Sequence source. The 7 digit numbers In this column ere Genbank Identifier {Gl) numbers. "Dunham, et al.' refers b the pubMon entitled "The DMA sequence of 

human chromosome 22" Dunham, et al. (1999) Nature 402:489495. 
Strand Indicates DNA stand from which axons were pretScted. 
NLposta: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


KL^postttoo 


401979 


2828778 


Mntis 


756S3-75B51.76977-771 12 


406409 


9296364 


Minus 


141101-141255 


401741 


2962169 


Plus 


196686-196823.200241-200381 


40537B 


9255142 


Minus 


125408*126800 


405284 


7329374 


Plus 


28556-28684 


401692 


3540172 


Plus 


26365-26510 


403783 


SDB1824 


Plus 


128412-128635 


404271 


9828129 


Minus 


55392-57645 


401798 


6730720 


Phis 


22331-23448 


402857 


9B01539 


Plus 


13402-14133 


405528 


9581957 


Minus 


22418-22687 


403328 




Minus 


120428420703 


401712 


6682593 


Menus 


7641&76527J6692-76B29,7B737-7BB66,B024 



TABLE 55A: 

Pxey: Unique Eos probBset identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

UniggnelO: Unigene number 

Untgane True: Unigene gene title 

R1: Ratio of nori-seninornatous mixed germ cell testicular cancer compared to normal adult testicular fcsues 



Rcey 


ExAccn 


UnigenelD 


Untgene THIe 


R1 


416GB0 


AW245540 


Hs.79516 


brain abundant, membrane attached signal 


48.80 


423961 


D13665 


Hs.136348 


pariDSta'n(OSF-2os) 


43.30 


412948 


BE243313 


HsJ34851 


UMand SH3 protein 1 


42.50 


428928 


BE409B33 


Hs.194857 


cadherin 1, type 1, E-cadherin (epithet 


36,80 


424247 


X140D8 


Hs.234734 


tyso^yme (renal amyloidosis) 


29.80 


414438 


AI679277 


Hs.76136 


thioredoxla 


29>4G 


40665B 


AJ920965 


H3.77951 


major hlstocompaiiblflty comptex. class 


29.25 


446899 


NM_005397 


Hs.16426 


podoceryxin-Ukfl 


28.70 


411573 


AB0290DO 


Hs.70823 


KIAA1077 protein 


28.40 


432730 


AW6S520 


rls.131356 


EBTs 


28.00 


418B70 


AF147204 


HsJ8414 


chemokine (C-X-C motif), recaptor 4 (jus 


28X0 


444381 


BE387335 


Ks.283713 


ESTs, Weakly similar to 864054 hypotheti 


27.90 


446619 


AU076643 


Hs.313 


secreted phosphopreteln 1 (osteoponbX 


26.80 


447526 


AL048753 


Hs.303549 


small inducible cytokina A2 {monocyta ch 


25.00 


41B174 


L2068B 


Hs.83555 


Rho GDP Association InhlWlor (GDI) bel 


24.15 
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406B56 


AW515336 


Hs.29797 


4146B2 


AI021154 


Hs.76884 


440440 


228925 


H&J188 


417? 39 


M69043 


H3.B1328 


41742B 


NM 002291 


Hs.B2124 


407862 


BE548267 


Hs.337986 


433180 


A1393742 


Hs.199067 


413787 


A13525S8 




408669 


A1493591 


Hs.7B146 


412636 


NM 004415 




414092 


Z14244 


Hs.75752 


414587 


NM_004E62 


H6.76507 


412915 


AW0S7727 


Hs.74823 


406648 


AA563730 


Hs.277477 


412247 


ARJ22375 


Hs.73793 


4461 OB 


AL03B596 


Hs,42322 


42480Q 


AL035588 


Hs.t53203 


413063 


AL035737 


Hs.75184 


415314 


N88602 


Ms 5422 


406656 


M16714 


Hs.B9B43 


426295 


AW3 57283 




406820 


A1223958 


Ks. 108124 


444S62 


AA186715 


H3.336429 


417063 


M54915 


Hs.81170 


449333 


K73444 


Ks.394 


444784 


D12455 


Hs.11951 


414420 


AA043424 


Hs.76095 


425543 


R23313 


Ha 334695 


452679 


Z42337 




420754 


W79431 


Hs 346911 


410397 


AF217517 


Hs.53042 


417640 


D30857 


Hs.82353 


415899 


X78992 


Hs 78909 


406765 


AW161678 


Hs 111334 


422105 


AI929700 


Ha.111680 


422714 


AB01S335 


Hs.1 19367 


444051 


N48373 


Hb.10247 


426996 


AW968934 


Hs.1 731 08 


445863 


R12234 


Hs.1 3396 


429614 


AI371172 


Hs. 21 1539 


410185 


BE2940G8 


Hs,737 


409038 


T97490 


Hs 50002 


410611 


AW954134 


Hi 20324 


448588 


AI970276 


Hs 156505 

4 19i l«IWUw 


421406 


AF179897 


Hs 104105 


420962 


NM_005904 


Hs. 100602 


425234 


AW152225 


Hs.165909 


452322 


BE666343 


Hs 28968 


454413 


AI653672 


Ha 4DQQ9 


415221 


W0741B 




425535 


AB 007937 


He 158287 


450000 


Al 9 52797 


Hs 1DRRS 


418151 


AA86423B 


1 UiDuJEU 


4147S9 


Al 75241 6 


I to. j f u>z.u 


412025 


AI827451 


Hs.24143 


445055 


BES12B56 


Hs 109051 


412490 


AW8D3564 




405437 


AW957744 




430542 


AI5574B6 


Hs. 11 9122 


424670 


W61215 


Hs 116651 


432409 


AA8D6538 


Hs.1 30732 


425945 


AW410669 


H 5.164280 


425995 


W67330 




4499S1 


AW265634 


Ha 133100 


435522 


N&4214 


H&.9774 


407179 


AA206465 




412623 


R2B89B 


Hk7A17D 

(la.rH 1 f V 


429976 


AA249027 




450377 


AB033091 




418509 






440869 


NM_014297 


Hs.7466 


417144 


AA382104 


Hs.81337 


451106 


BE382701 


Hs.25960 


426552 


BE297660 


HS.17032B 


433423 


BE407127 


Hs.8997 


442622 


NM 000435 


Hs,8546 


407951 


W77762 


Ha.79015 


449944 


AF290512 


Hs.58215 


423673 


BE003054 


Hs.1695 


435056 


AWO 23337 


Hs.5422 


405743 


AA91156B 


Hs.279660 


422627 


BE336857 


HS.11B7B7 


420676 


AI434760 


H3.424B 



ribosomal protein L10 


23.66 


Inhibitor Of DMA binding 3, dominant nog 


23.60 


soma domain frrYminnaTchLiJin dornsln 


9^ nn 


nuclear factor of kappa tight polyp ep lid 


22.35 


fsminln, beta 1 


21.60 


Homo sapiens cDNA FL) 10934 ffs» clone OV 


21.55 


v- j Crt>b2 avian srythroblashc leuksnia v 


21.50 


tyrosine 3-monDoxyganfiSG/tiyptDphan 5 -mo 


21.50 


plaLetst/ondothellsi] cell adhesion moJcc 


20.90 


de6moplakui (DPI, DPIl) 


20.90 


cytochrome c oxidase subimrt Vilb 


20.30 


LPS-induced TNF- alpha factor 




hShA 004541 "Hnmn carifrjc NADU rtntivrtmnanac 




major histocompatibility cornplsx, class 


19.10 


vssculsr efitjo'bcJiaJ growth factor 




A kinfisa IPRKA1 arr.hnr nmtpln ? 


174fl 


MyoD family inhibitor 


17.20 


cNflnase 34fte 1 {cattega glycoprote 


16.90 


nlvranrolfiln MfiR 


iD.au 


jriaTftr hishwrmnflMIIfw rvwinlm rJn« 


ID./ 3 


zinc finger protein 6 (CMPX1) 






ifi nn 
1D.UU 


r\.i(\jzii cluih a i wbctri 1 3 gene 


4C DC 


p?m-1 oncoQfinB; 


IO.OU 


ddrcn ornsd u |{tn 


IO.D | 






imniBcfete ©arly respcnso 3 


15.30 


IIUMMIIDl (JlVJlOlil LIUu 




tf^nRiTiPiYlhfnnR DnwlAln flnrlmnpn In Hi tr*r\ 


f ■% nn 


rihnfioflftsl nmfpln J 99 




DKP7n5R4J157 nmhaln 


14, no 








i a fin 


■diiUK, uyi |i JAJIjF|JCfJUUC 


14.0f 


endosulfinB alpha 


14.57 


KIAA0792 aena cradirci 


14 OR 


oWI'SwU ICUWyjmbcll ounBaKXI IIIOKSCU 


i4.U5 


Mnma Bsnnrnt nTOIA 1 Fl I91ftq(7fin rtma H 


44 nn 
14.UJ 


HonMssniariR rJnrw* O^fTJfl mPMA ovumnro 
j iuh nj so^iwid MVP it? estvao IIITVVn SBqUBnCB 


14 nn 

J4.UU 


hypothetical protein MGC4243 


lOiOO 


UTFrndiatfl nariv ivntprn 


11 n^ 


siti aJ 1 indudbla cvtoklnH suhfamH v A fOr 


13 GO 




U.aD 


KIAA1676 


ii 4n 
1 J.4U 




11 in 


MAD /mn^hnni sfuiTnRl rl w;iiv»rYtanl*vii r Dr 


11 nn 


1 0| TWDAJf 911 Ilia BJ IwQvAC lljrfJUUIcU 




ntiiffimflnnrfn f 1 HiftHf art cfe to ■ q\ 
yiumiauuKwi ^iinuiudiiSiiciBSD/ 


11 nn 


PNAS-123 


12J90 


annarin A1 


19 M 








1Z40 


acUn related nmtati> 9T3 rymntax cnhun 

ovum i cncu ^ji laoiii xjo IAA 1 bUUUI 1 


19 An 




11 10 


Wiskott-Aldrich svndramfi rrnlo.ti InteTac 


12.12 


SH3 Hnmatn hlmiim nlubvnln fv^lrfjlnh nr 
v^t n/ uuiimui uiiHJiii^ yiuwiiut* <uu*iiui |Ji 




Homoaanfere? cDWA* FT .1295? R cJoiwH 


11 on 




11 JW 


riboBonnal nmiprn J 1 1b 


11m 




11 on 


KlAAlS75wnlein 

fidllln nsnfor fafntTo 9*i fmffnrhnnrtrfal 


11 >3l 
11. DU 


11 AA 


i iy|Afuiauuu p Well riL 1 III 1 13 


11 An 
11.4U 


ESTs 


ii 4n 


svnnvfal saicoma fmnilnrflSnn rAnn nn 


fl 9R 


ujjiiKjaai, U91D "r> /\ UIIUJIiLEHJIIta 


11 01 
•1.41 


mefaltolhlonfiin IF ffimrJInnali 


11 m 


ribosomal protsJn 56 




K1AA12R5 nrnlpTn 


44 in 
11. ID 


A TO eunttiBBfl U IwtlWWrifnn mitnr«1<fw7r 

~ ojiiuit»*o, n udrepcr ung, rruiocnongr 


44 m 
11.10 


protein expressed h thyrokt 


11.00 


tecSn. gatectostde-Vrndng, soluble, 9 


ii!oo 


W-MYC oncogene 


10.93 


moedn 


10.91 


heat shock 70kD proteb 1A 


10^0 


Notch (Drosophlta) hornolog 3 


10.80 


an^sn ktenCfied by motiodonal antibod 


10.80 


Homo eaptens, Similar to rhotekln, clone 


10J5 


matrix mataDoprcteinasa 12 (macrophage 


10.70 


glycoprotein MSB 


10.70 


tumor protein, Ixanslaftcnelly-controlle 


10.70 


transforrnng gnowUi factor, beta-Induced 


10.65 


vav 2 oncogene 


10.60 
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444060 


AA340277 




42002B 


AB014680 
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409604 
432581 
430556 
431315 
426190 
454227 
429367 
452191 
419073 
442295 
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424875 

427641 

442606 

442495 

439941 

444981 

452472 

418117 

431824 

41755B 

410570 

431805 

430333 

428494 

419641 

4438M 

452852 

444821 

422200 

416003 

42542B 

452033 

414821 

429558 



AW44444B Hs.49124 

AU076465 Rs.276441 

AW967807 Hs.13797 

AW972227 H&1639B6 

AW02B302 Hs.155079 

AW953897 Hs.44743 

AB007867 Hs^78311 

AUD76408 Hs.28309 

AW372170 Hs.183918 

AI827248 Hs.224398 

R68048 Hs.9238 

AA299652 Hs.111495 

A1187945 Hs.199310 

AI270591 Hs.146116 

AW294522 Hs,149991 
AI1B4717 

A1392640 Hs.18272 

AW85539B Hs.122t0 

AW957300 Hs.294142 

A1922013 Hs.83496 
AW972B42 

AF045229 H 5,82280 
A1133096 Ks.64993 
NMJ)14053 Hs.270594 
S70114 HS.2394B9 



H3.1B4634 

Hs.118190 

Hs.134460 

H&.30B22 

Hs.11465 

Hb.347969 

Hs.7894B 

Hs.157211 

Hs.32366 

Hs77424 

Hs.207251 

Hs.3057 
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406797 

412347 

422392 

447197 

447832 
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413593 

45165B 

441224 
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43B763 

418113 
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421748 
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424263 

408867 

449378 

427202 

41894S 

448966 

426711 

433681 

441321 



AA233439 
BE17054B 
H73972 
AKD01972 
AA298065 
AA080B95 
X98001 
AL110251 
R53185 
MS3835 
AI391454 
X92715 
AJ025842 
AI432224 
AW970026 
NMJKJ5908 Hs-1 15945 
H36075 
AI433357 

AW374605 Hs.11007 
AA205248 

AW195351 H3.250520 
AU076QS4 Hs.7753 
Hs.342307 
Hs.154036 
Hs.81892 
Hs.99029 
H8.B3484 
KsJ5807 
NM.014718 Hs.107809 
AA974433 



Hs.73818 



AA479005 

HS0720 

AI583207 

AI272141 

AW2S0089 



447232 
449317 
436372 
422002 
432465 
417924 
446659 
41482S 
418036 
417677 
443559 
423766 
432407 
453485 
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BE246762 
AW372914 
AA383471 
AI004377 
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AW499834 
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AW972301 
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AU077231 
AJ 335361 
AA321568 
Z37976 



Hs.1757 

Hs.182265 

Hbu59892 

Hs.t73936 

Hs.89499 

Hs.86149 

Hs.343800 

H8.2003B0 

Hs.7771 

Hs.22483 

Hs,327 

Ha.132906 

K&3102B6 

Hs.111244 

Hs^7S163 

Hs.B2932 

Hs.225376 

Hb.77436 

Hb.83337 



NML816D55 Ks.82389 
AI076765 Hs.289899 



427816 
406794 
449475 
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AA221036 

BE620712 

HB8409 

AA159248 

A1B90243 

AI34B027 



H&300141 
Hs.13273 
Hs.33026 
Hs>iOS27 
Hs.180909 

Hs.108557 



ESTs 5,20 

KIAA0015 gene product 5.16 

ESTs 5.16 

Homo sapiens cDNA: FU22765 fis, clone K 5.16 

protein phosphates a 2» regulatory subunf 5.15 

KIAA1435 protein 5.15 

pSeafoBI 5.12 

UDP-glucose dehydrogenase 5.1 1 

Homo sapiens cDNA FU 12797 fis, done NT 5.1 1 

Homo sapiens cDNA FU11469 fis, done HE 5.1 D 

hypolhelical protein FU23516 5.10 

Homo sapiens cDNA FU11643 lis, clone HE 5.1 0 

ESTs 5.10 

ESTs 5.10 

ESTs 5.10 

ESTs 5.10 

amino acid transporter system A1 5.10 

hypothetical protein FU 1 3732 similar to 5.09 

ESTs, Weakly similar to C55663 oHgcufand 5.07 

linksrfor activation of T cells 5,06 

gb:EST384937 MAGE resequences, MAGL Homo 5.06 

reguktor of Giwotein signalling 1 0 505 

ATP synthase, H (re importing, milochonaV 5.03 

FLVCR protein 5.00 

TIA1 cytotoxic granule-associated RttA-bi 5,00 

hypothetical protein FU20M5 5.00 

Homo sapiens cDMA: FU21081 fis. cbne C 5.00 

ESTs 5,00 

hypothetical protein FU11110 5.00 

glutattiione-S-lranstease Ifce; glutathl 4.67 

cytochrome c oxidase subunlt IV 4.95 

Rao gerarrylfleraiiyltransferase, beta subu 4.94 

DKFZP586B0621 protein 4.94 

ESTs, WeaWy smter to TWSTjtUMAM TWIST 4.93 

Fc fragment of IgG, hlgnj affinity la, re 4.93 

nucleolar autemlfgen (55kD) smflar to 4.92 

zinc finger protein 74 (Oos52) 4.92 

ESTs 452 

rfbosornal protein L6 4.91 
Ubquind-cytochfomo c reductase hinge p 440 
mannosWase, beta A. lysosomal 450 
gb:yhBBbOU1 Scares placenta NblHP Homo 4.90 
ESTs 4.90 
ESTs, Weakly similar to T21697 hypothec 4.90 
gb:zq76c12.r1 Stralagene hNT neuron {937 4.90 
ESTs, Moilerayy stmlar to 138022 hypol 4.90 
csKirnernn 4,90 
hypothetical protein FU10330 490 
tumor suppressing suttraraterable candid 4.90 
WAA0101 gene product 4.90 
CCAAT/enhancer btntfing protein (C/EBP), 4.89 
SRY(sexd5torminlng region Y>box 4 4.89 
PDZ and L1M domain 1 (shin) 4.B8 
KtAAfl72B gene product 4,87 
fibroblast growth factor 4 (heparin seer 4.86 
LI cell adhesion molecule (hydrccephaius 4.82 
keratin 19 4.81 

[nterfeuHn 10 receptor, beta 4.80 
aratfiloonsto SJipoxygenase 4.60 
phosphoinosftol 3-phosphata-bindhg prot 4.80 
conserved gene amplified In osteosarcoma 4.80 
Homo sapiens cDMA FU13027 fis, done NT 4.79 
BceJI associated protein 4,75 
Homo sapiens mRNA; cDNA DKFZp762M127 rjr 4.75 
InterleoMn 10 receptor, alpha 4.73 
10A24 protein 4.73 
ESTs 471 
hypothetical protein 4.70 
iKm-metastotfc cells Z protein INM23B) 4.70 
cycKn D1 {PRAD1: parathyroid adenomatos 4.70 
ESTs 4.70 
pteckstrfn 4.70 
latent transforming growth factor beta b 4.70 
CGH18piotein 4.70 
ESTs, Moderately sim3ar to ALU8_HUMAN A 4.70 
riboscmel protein L39 4.70 
gh:zri)3f1 Zr1 Stratagem HT2 neuronal or 4.69 
hypctJietlcalpiotamPP2447 4.67 
ESTs 4.67 
peroxiredoxln 1 4.67 
ribosomai protein U 466 
hypoteebcal protein PP1057 4.6S 
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427157 U51166 Rs.173824 Ihyritfne-DNA gryrosylase 4,64 

425410 AA310974 Hs.156828 Homo sapiens cDNA FU 10522 fis, clone NT 4.62 

407874 A1766311 Hs.289047 Homo sapiens cDNA FU 14059 fls, clone HE 4.61 

_ 417536 AA203569 Hs.191482 ESTs 4.61 

5 421818 AW332976 Hs.50098 NMJ!XJ24B9"-Homo sapiens MAOR dehydrogenas 4.61 

408491 AJ088CB3 RsJ882 ESTs 4.60 

426398 AJ249368 Hs.9B55B ESTs 4.60 

410295 M741357 nldogen (enactto} 4.60 

4/ _ 4071S8 H31679 B b*vO4a07.s1 Scares fetal liver spleen 4,60 

10 440327 R12581 Hs.191146 ESTs 4.GD 

417353 AA375752 Hs348140 HornosapJeittrnRNA; cONA DKFZp585F1822 (f 4.60 

425141 C05886 Hs.293972 ESTs 4.60 

412922 M50721 Hs.74870 H2.0 (DrosophflaJ-llke hcmeo box 1 4,59 

_ 434449 AW9534B4 Hs.3849 hypolhstical protein FU22041 similar to 4.58 

15 416987 D86957 Hs.80712 WAA0202 protein 4.57 

418583 BE387036 Hs.1 211 acW phosphatase 5, tartrate resistant 4.57 

416581 H65276 Hb.108288 ESTs 4.56 

4409B3 M20681 Hs.7594 solute carrier family 2 [fadltated gfo 4.56 

4509B6 BE618571 Hs.429 ATP synthase, H transporting. rritochoncV 4.56 

20 413663 BE2475B5 Hs,75462 BTG family, member 2 4.55 

410342 R31350 Hs.743 Fc fragment of IgE, high affinity 1, rec 4.54 

424321 W74048 Hs.1 765 lymphocyte^pecKlc protein tyrosine Mn 4.53 

453330 A126B031 Hs342389 peptldylprolyl fecmarase A {cydophiln 4.52 

417750 A1267720 Ks. 260 523 synovral sarcoma, translocated to Xchro 4.51 

25 440774 AI420611 Hs.153934 ESTs 4.51 

422693 BE300073 Hs.279860 tumor protein, translattonaHy-conh-dle 4.51 

411125 M151647 Hs.68877 cytochrome W45, alpha porypepticte 4.51 

449267 AI63864Q Hs320624 ESTs 4.51 

432605 NM_0021G4 Hs.3C65 graTuyme K (serine protease, granzyme 3; 4.50 

30 432706 NM.01323O Rs.266124 C024 antigen (smelt cell lung carcinoma 4.50 

436823 AW749865 Hs.117077 ESTs, Weafcty similar to 138052 hypothet) 4.50 

437469 AW7S3112 Hs.15514 hypothetical prolan MGC3ZE0 4.50 

413703 BE15B360 gb:PM1-KT03B3-13129WW1-hOB HT03B3 Homo 4.SD 

415526 N76535 Hs265591 ESTs, Weakly stmlar b ALUUiUMAN AlU S 4.50 

35 405623 X69392 Hs.91379 ribosomal protein L26 4.49 

456642 AW451623 Hs.109752 putsfiva c-MyorespOnsive 4.47 

406653 AA574074 Hs.77961 major histocompatibility complex, class 4.47 

408307 A1761786 Hs.2D4674 ESTs 4.46 

428297 AA236291 Hs.183583 serine (cr cysteine) proteinase inhibrto 4.46 

40 447296 AW243614 Hs.18063 Homo sapiens cDNA FU 10768 fis, clone NT 4.45 

416801 X98834 Hts.79971 sai (OrosophBaJ-IIke 2 4.45 

406B70 AA075144 gb:zm8ef06.s1 Stratagene ovarian cancer 4.45 

446291 BE397753 Hs.145Z3 interferon, gamma-indudbfe protein 30 4.44 

. _ 416297 AA167634 Hs.79172 sotute confer family 25 (ntohondrlai 4.44 

45 428773 BE256238 Hs.193163 bridging integrator 1 4.43 

427640 AFD58293 Ks.180015 I^Jopachrurna tautamerase 4.43 

437223 C15105 Hs.330716 Homo sapiens CDNAFU14368 fe, clone HE 4.43 

412265 M101325 Hs.86154 hypolhelicat protein FU12457 4.43 

442232 AI357813 Hs.337460 ESTs, Weakry similar to A47582 B-csl! gr 4.42 

50 441612 AWJ2629 Hs.1 13660 Homo sapiens cONA FU11631 fis, clone HE 441 

434858 A1568170 HS.968B6 ESTs 4.41 

40B3B0 AF123050 Hs/44532 dluMqdrJn 4.40 

411960 R77776 Hs.1B103 ESTs 4.40 

428782 X12830 Hs.193400 InterteuKh 6 receptor 4.40 

55 408360 A1806090 Hs.44344 hypothetical protein FU20534 4.40 

456629 AWB9196S histone deacetylase 3 4,40 

414416 AW409985 rte.76084 hypothefcsl protein MGC2721 4.40 

422499 AI268666 Hs.19631 ESTs, Westty similar to I38022 hypothec 4.39 

^ 414219 W20010 Hs.75823 ALU-fiised gene from chromosome 1q 4.39 

60 427779 AA906997 Hs.180780 TERA protein 438 

422340 AW296219 H&.11S325 RAB7, member RAS oncogene family-tike 1 437 

413276 Z24725 Hs.75200 mitogen Inducible 2 436 
452651 A1218918 Hs.30209 KIAAD654 protein 435 
453467 AI535997 Hs.30089 ESTs 4.35 
65 439961 8E293127 Hs.283722 GTT1 protein 4.35 
415691 AW963979 H&24723 ESTs 4.34 
435958 AW161481 Hs.1 11577 integral membrane protein 3 434 
420099 D80011 Hs.95140 K1AA0189 gene product 433 
„ J 421522 R48681 Hs.102991 hypothetical protein FU13956 433 
70 457073 AA233210 Hb.179943 ribosomal protein L11 4.31 
427337 Z46223 Hs.176663 Fc fragment c^gG, low affinity tllb, r 431 
420732 AA789133 Hs.63525 ESTs 430 
432731 R3117B Ks.237620 ftoronectto 1 4.30 
__ 437275 AW976035 Hs.292395 EST* We^y similar to A475B2 B-caBgr 4.30 
75 408784 AW971350 Hs.63386 ESTs 4.30 
441S82 AW972542 Hs.289008 Homo sapiens cDNA: FU21814 fis, clone H 430 
419926 AW900992 Hs.93796 OKFZP586D2223 protein 430 
429058 AF13B363 Hsu35254 hypothetical protein FLBB421 430 
ar . 434963 AW974957 Hs.288719 Homo sapiens cDNA FU12142 fis, clone MA 430 
50 413677 AW503116 Hs.301819 zlnciinger protein 146 4.29 
42697D BE276B91 Hs.194691 retmoic scid irduced 3 4.26 
400896 A1610447 Hs.48778 nQsan protein 4.26 
433550 AASB9061 Hs.177376 ESTs 426 
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406230 
435555 
448717 
437336 
416759 
447341 
410423 
403453 
428453 
444681 
416072 
450937 
447595 
418452 
430594 
425259 
431660 
439403 
409245 
437296 
406877 
419552 
406665 
452432 
448782 
407110 
422960 
432W1 
415357 
420296 
419726 
426075 
430255 
41BB99 
443035 
457415 
412220 
427509 
444633 
441384 
43195B 
422310 
439815 
417930 
418458 
4244S4 
417035 
412627 
414890 
452243 
4506B7 
444224 
451351 
407792 
439654 
406745 
409132 
41059*7 
409435 



417777 
446979 
416000 
426647 
436394 
409956 
414602 
446013 
452404 
444736 



451338 
449832 



Target Eicon 

AW105663 Ha.6947 HSPC0S9 protein 

R67419 HS.21B51 Homo sapiens cDNAFU 12900 fis. clone NT 

W52452 ribosomal protein L10 

AK0Q0976 Hs,79741 JiypoiheM protefa FU 101 16 

AF106M1 Hs.18142 arrestin. beta 2 

AW402432 Hs.63489 protefn tyrosine phosphatase, non-recept 

A18B5516 Hs.95612 ESTs 

AB01111O Hs.184367 GTPase activating protelrvtike 

AJ243937 Ha.280316 chiomosome 6 open reading frame 9 

AL1 10370 Hs.79000 growth assocteted protein 43 

R4913I Hs.26267 ATP-dapendanl Interferon response protei 

AW379130 Hs.lB953 phosphodiesterase 9A 

BE379749 Hs.B5201 C4ype (cafchim dependent, carbohydrate- 

AK000790 HS.246BB5 hypothec protein RJ207S3 

AL0492BO Hs.1 55397 Homo sapiens mRNA: cDMA DKFZp564K1 43 (fr 

BE244135 Hs.26Q238 hypothetical protein FU10642 

BE265745 ESTs, WeaWy siniiar Id ALUCJWMAN llll 

AA361 037 tHNA iscoentenylpyrophasphate tratsferas 

AA350994 Hs.20281 KIAA1700 

AA226392 Hs.179943 ribosomaJ protein L11 

AL157465 Hs.91973 hypothetical protein 

X6697S Hs.172550 |»!ypyriraia1netr^HndtngorateIn^e 

AW206003 H3.283378 Homo sapiens cDNA: FU21778 Cs, dona K 

AL050295 KIAAD758 protein 

AA018O42 Hs,252085 Piader-WilIi/ArigBlrnan5yridrorne-5 

AW890487 cedherfn 13, H-cadhain (heart) 

M93425 Hs.62 protein tyrosine phosphatase, non-recept 

AA666115 Hs.127797 Homo sapiens cOMA FU1 1381 fis. done HE 

AI199510 Hs.267912 ESTs, Weakly similar to AUJ7_HUMAM AUU S 

U5G330 Hs. 1274 bone rncrphogenerJc protein 1 

AW513691 Hs.270149 ESTs, Weakly sffnllar to 210926OA B eel 

AKD00703 Hs.323822 Homo sapiens mRNA for KTAA1551 protein, 

BE539639 Hs.173030 ESTs. Weakly similar to ALU8_HUMAN ALU S 

Z45322 HB.890S Homo sapiens clone 24889 mRNA sequence 

AKD00010 Hs.256798 hypothetical protein FU20003 

BE350058 Hs.36787 rtrornodornaTii heTrcase D^fA binding protei 

MB2505 Hs.21 61 complement component 5 receptor t (C5a I 

AF111713 Hs.286218 Junctional ad heston molecule 1 

AA447849 Hs.288660 Homo sapiens cDNA: HJ221 82 fis. clone H 

X63629 Hs.2677 cadhenn 3. type 1, P-cadherin {placenta 

AA316622 Hs.98370 cytochrome P450. subfarrily llS.potypept 

AA206079 Hs.6693 hypofoeBtri protein FU2D420 

H81136 Hs.334604 Homo sapiens mRNA for KIAA1 870 prateh, 

AA332S41 Hs.85226 lipase A, lysosomal acid, cholesterol es 

R68537 Hs.17962 ESTs 

AA192455 HsJ229BB Homo sapiens done IMAGE: 451 939, mRNAse 

BE391959 Hs74276 chloride Irrtraoeluler channel 1 

BE281095 Hs.77573 ufdhe phosphorylasB 

AA093668 Rs.28578 mrecteNInd (Drosop}Ula)-«te 

AW056261 Hsv321435 ESTs, Vfealdy sinter to ALU 1JWMAN ALUS 

AI077715 Hs.39334 jjutativeseoreledrgandlicmotogous tof 

AI720078 HS291997 ESTs.Wealdy similar to A475B2BH»llgr 

AW936356 Hs.300925 EST^WeatysimflartoAtfOIOXJinted 

AJ224538 Bs.50732 protein tag&e, AMP-acBvated, beta 2 no 

W16518 Hs^785l8 amyloid beta {A4J ptecursor-4ike proteta 

S80990 Hs-252136 ficolin ((rila^n/farriogen domamxnt 

BE2S6390 Hs.1 69718 cdponin2 

AI823763 Hs.7055 ESTs, WeaWy similar to 178885 serine/th 

A1654443 Hs.197683 ESTs 

R82342 Hs.79856 ESTs. Weakly similar to S65657 aipha-1D- 

AA243464 Hs^94101 pre~B-ce!l teuWiia transcription factor 

AA531W Hs.126705 ESTs 

AW103364 Hs.727 InhJbin, beta A {activin A, ecMi AB a 

AW53M68 Hs.76550 HbmtJ fiaptew mRNA; cDNA DKFZp564Bl264 If 

AI360167 Hs.152774 ESTs 

AW450675 Hs.212709 ESTs 

AA533491 Hs_23317 hypothetical proteni FU14661 

AA811465 H5v1 23375 ESTs 

AW005866 Hs.193969 ESTs 

AA694264 Hsj60049 ESTs 
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412636 1438.1 



426295 51(L1 



425995 13B046J 

429978 35194^2 

450377 12109J 

444060 6315.1 

450139 34017J 



434280 14749Q4J 
410143 MH1244_8 



BC0O3047 880794 HM_Q03405 X78138 AY007132 LOB438 AW340648 AW13I665 AI062748 AH70204 BI711078 BF350700 BI499963 AI0B7141 
AA720684 AA862331 AA605146 BM313650 A10B9749 AI359738 N69107 AW995424 AJ0B6917 A10B3995 AW340217 N99662 AI829449 A1089B39 
AI608761 A!342365 AI199076 AA90B944 A1248943 AU160053 AJ131245 A1218477 AJ077943 AAB64930 A1310394 AA872478 A1279782 W61 343 
AA565955 W46596 AA126B74 AA223241 AA491574 RB4B13 AA491520 9G055114 A1469689 BE464590 AW664539 HB7097 AL534332 C21397 
A1085941 AW028427 BG939B20 A1697089 AJ03SOOB AI125315 AI655561 AW1 50042 L20422 X57345 BM5B375 AU1 42552 B1666S01 BE888276 
ALJ1 19302 BI603754 BG705953 BI5S8754 BE296713 BGG0253BBF951911 N29226 BE9D9424AV698274 AV683116AV708195 AA127798 
Al 1 24697 D54224 F08031 AA340253 BF9233B3 BM467808 BI546644 BG7772Q0 86705941 BG468577 AU127209 AW403970 BI59763Q BI45B091 
AV689560 BI669267 BG5G6219 BI837163 B1667275 AA343750 BE7B31 12 BF671405 BF954720 H67636 H6M56 AA4B4894 81869271 BF993207 
N31547 BF94S817 BFS47918 N90630 BG980194 AA156681 BI493502 AW2731 1 8 AI473B20 AA608688 AJ 359337 AV71 2091 AAOB4101 BF592036 
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N21313 BF674610H02S74 AW975323 R169D4 AA328030AA054671 R79546 8FB32310 AI249109 
BG015794BE16fl357BE15B353BE15835B BE158360 BE158352 BE156351 BE158355 
AAD75144 

AW891 955 AW604749 BE080B72 R1 5559 BE177623 AW883520 AW94S343 A12461 67 T070B2 AWa05679 W95273 M13579B W3261S AW99S418 
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BE003183 AA8471 12 AW580975 AW60476Q AW385727 BE164590 BE003090 AW362791 AWG04759 AW666589 AW604756 N44337 A?378548 
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BE172821 AW605768 AW999517 AW844165 BE17173BAW751633 AW910493 BE1774B4 BE177487 AA090510 AW8441 17 BE173357 AW999S78 
AH24870 BE163472 AW841B23 AW379762 AWB93297 AI2902B6 BE0B9132 AA610287 AW176676 AW607622 BE172639 AW893232 AA329629 
BE0S9008 BE178350 BE17B214 BE063291 AW820236 AW999653 BE0894B6 BE173126 BE171775 BE1 85787 AA558280 A11 74840 AW9991 12 
BE21839* BE172734BE178021 BE1 72738 BE173324 AW6G3494 AUJ35722 R38192 R6Q905 H53721 H410S2AL037917 R37795AW99B972 
M7671B9 AW044272 H50689 AA768399 AA767764 AI0878B8 H44202 BE222792 N90597 W81396 N9061 5 A1935353 BE501168 F1094S 
AWH8215 A1970480 AI627641 AW23B081 AA574090 AJ627652 A5BS1913 A1759993 N69591 N69276 BE467722 AW392780 BE172467 H92861 
A1524921 FD2989 Z3932B F02705 F01414 T8B678 A1215165 HB7220 AW374781 

BM476605 BI545004 BI834636 BG112453 BI199049 BG112758 AA149B46 H97925 AA306121 AA313204 W52451 AJ734997 AA93116B AA429766 
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TABLE 55C 

Pkey: Unique number ccjireapondng to an Eos probes g I 

Ret Sequence source. The 7 digit numbers In this column are Genbsnk Identifier (GO numbers. "Dunham, et 4." 

human chromosome 22" Dunham, etd. (1999) Mature 402>489495. 
Strand; Indicates DMA strand 6cm which axons were predicted. 
NLposifon: Indicates nucleotide positions of predicted exons. 



to the publication en rifled "The DMA sequence of 



Roy Ref Strand 

401466 6682 292 Plus 

401192 9719502 Mmus 

402474 7547175 Minus 

402143 8018280 Plus 

4760409 Plus 



2B74B-29023 
69559-70101 

53526-53628.55755-55920,5753057757 

1130B6-1 14800 

71716-72515 



TABLE 56A: 

75 Pkey: Unique Eos probeset fdsnHner number 

ExAccn: Exemplar Accession number, Genbank accession number 
UmgeneUX Unlgene number 
Unlgene Tltfe:Un?gene gene title 

R1: Ratio of semlnomatous tesficutar cancer compared to normal adult tesficular Ossues 



80 



Pkey ExAccn UnfoenelD UnHaneTIWe R1 

41443B AIB79277 Hs.76136 thioredoxln 47.30 
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424247 X14008 Hb,234734 lysozyme (rena! amyloidosis) 44.80 

438091 AW373062 nuclear receptor stibfiamly 1, group I, m 40.10 

412948 B6243313 Hs.334851 UM and SH3 protein 1 34.90 

_ 417088 M54915 Hs.81170 pin>1 oncogene 31,10 

5 430642 A15S7486 Hs.119122 ribosomalprote*nL138 29.60 

412915 AW087727 Hs.74823 N^_004541:Homo sapiens NADH dehydrogenas 2ai0 

418174 L20688 Hs.83656 Rho GDP dissociation inhibitor {GDI) bet 23.15 

406820 AI223958 Hs.108124 rtatomal prolan S4, X-lmked 2&13 

433300 AJ034361 Hs.135150 lung lype-l celi rr^ton a- associated gJy 28.10 

10 40665B A1920965 Hs.77961 major histocompatibility complex, dass 27.85 

416680 AW245540 Hs79516 bram abundant, mambrane attached signal 27.70 

446525 AW967Q69 Hs.211556 nypotheticB) protein MGC5487 27.20 

432359 AA07604S Hs.274415 Homo sapiens cDNA FU10229 lis, clone HE 26.30 

422578 AF239666 H&.1S45 caudal type homeo box transcription fact 25.80 

15 429978 AA249027 ribosomal protein S6 25,40 

418870 AF147204 Hs.B9414 ctiBmoWne (&-X-C motif), receptor 4 (fus 24.60 

440207 A1371978 Hs.128326 ESTa 2450 

425543 R23313 Hs.334895 rlbosonigJ proNn LtOa 24.30 

442582 BE3795B4 tofchyMiphosphoolkjosacoh 24.10 

20 444562 AA186715 H&J335429 RIKEN cDNA 9130422N19 gene 24.05 

413063 AU035737 Hs.75184 chWpase 3-lika Hcarlifage glycoprote 24.00 

409038 T97490 Hs.50002 smdl trduabte cytokine subtamlry A (Cy 22.90 

420357 AA25909O Hs.257028 ESTs 22.90 

406856 AW515336 Hs.29797 rfbosoinplproWnLIO 22.77 

25 417139 M69043 Rs.81328 nuclear factor of kappa light poiypep^ 2275 

412636 Nht004415 rJesnoplaTdn [DPI. DPI!) 22.40 

420678 AJ434760 Hs.4248 vav 2 oncogene 22.10 

440859 NM 014297 H&.7486 protein expressed In thyroid 21.40 

446627 AI97301G Ks.15725 hypothetical proleb SBBI48 21.20 

30 410315 A1638871 Ks.17625 Homo sapiens cDNA: RJ22524 ffe, dona H 21.10 

420754 W79431 Ha.346911 ribosomal protein L22 20.98 

435538 AB0H54O Hs.4930 low density lipoprotein receptor-related 20.90 

440440 220925 HS.718B sema oVxrian, invnunogtobulln domain (1g). 20.80 

,__ 42B490 AI971131 Hs.23889 ESTs, Weakly similar to ALU7_HUMAN ALU S 2040 

35 425769 U72513 Hs.lS94SS Human RPL13-2 psevdogene mRNA, complete 19.50 

422714 AB018335 Hs. 119387 K1AA0792 gena product 19.15 

430253 AK001514 Hs^36844 hypotheScai protein FU10852 19.00 

413787 A135255B tyrosine 3- monoaxygenase/tryptophan 5-mo 1850 

452874 AKD01C61 Hs.30S(25 hypothetical protein FU10199 1B,50 

40 430255 AK000703 Hs.323822 HomosapknsmRNAforKIAA1551 protein. 18.00 

432606 KML0Q21Q4 H53066 granzyme K (ser&ie protease, granzyme 3; 17.90 

44B5B3 A1970276 Hs.156905 KIAA1876 17.70 

444764 D12485 his. 11 951 ectanucleotfde pyTophosphatase/ptJOSpriodl 17.50 

428782 X12630 He.193400 IntarteuWn 6 receptor 17-40 

45 414092 Z14244 Hs757S2 c^rKomecoxIdaeesubunltVlIb 17.20 

425945 AW410669 Hs.164280 solute canler family 25 [nitochondrial 17.15 

420759 T11B32 KsJ27797 Hon»saplerecDNAFUU381fis,d()neHE 17.10 

440528 BE313555 Hs.7252 WAA1224 protein 17.05 

410143 AA188169 K1AA1 191 protein 17.05 

50 421181 WMJ05574 Hs.184585 UM domain only 2 (rnoraboCn-like 1) 18.90 

' 426083 AW962712 Hs.126712 ESTa, WeaMy sirmlaf to AF191020 1 E2IG5 18.70 

429183 AB014G04 Hs.197955 KJAA0704 proteh 16.70 

450937 R49131 Hs.26287 ATP-dependotf faUartaron response proteJ 1&6G 

449571 AW016812 Hs.200266 ESTs 16.50 

55 432730 AI06652D Hs.131358 ESTs 16.20 

426295 AW367283 zinc finger protein 6 (CMPX1) 16.15 

439180 AQ93742 Hs.1 99067 v-erb-e2 avian erythroblastfc teuWa v 15.90 

420D2B AB014680 Ha^76S carbohydrate tN-acetylgliia^amirie^O) s 16.B0 

421379 Y15221 Hs.103982 srnaB iiducible cytokine subfamily B(Cy 15.80 

60 456236 AF045229 Ks.B22fl0 regulator of (protein signalling 10 15,70 

429469 M&459D Hs.27 glycine dehydrogenase {decarboxylalir>g; 15.50 

42892 & BE409838 Hb.194657 cadherin 1,type 1, Bcadnerfn (eprtnell 14.90 

452322 BE566343 H&289BB gjutfiredoxln pottransferose) 14.90 

40S656 M16714 Hs^9S43 major MstocompalibTrty complex, class 14.85 

65 414020 NMJW2984 Hs.75703 small Induzfbla cytokine M (hrj™iogou* 1471 

450377 AB033091 K1AA12G5 protein 14.70 

425996 W67330 hypothetka] proteJnALl 10115 14.60 

430332 R517SO Hs.2394B3 Human clone 23933 mRNA sequence 14.60 

„ 427691 AW194428 Hs.20728 ESTs 14.42 

70 429614 At37t172 Hs .21 1539 hypothefical protein MGC4248 14^5 

451105 BE382701 H&25960 N-MYC oncogene 14.21 

422241 YO00B2 Hs.170121 protein ^ro^na phosphatase, receptor t 14.13 

436860 H12751 Ks.5327 PR0191 4 protein 13.90 

__ 446899 NM_005397 Hs. 16426 pc^ocalyxJivtlke 13l90 

75 450000 A1952797 Hs.10888 hypotheflcal protein FU21709 13.75 

403380 AF123050 HSv44532 diubiquttln 13.70 

447526 AL048753 Ka.303649 small iiducictecytokkieA2 (monocyte ch 13.40 

427521 AW973352 ESTs 13.30 

410598 AIS17130 Hs.9195 Homo sapiens cDNA FU 13698 ckxis PL U25 

80 428664 AK001666 Hs.lB9D95 sknOar to SAI11 (sal prosopr>lia)-Iite 13J23 

408B22 AW500715 Hs.57079 Homo sapiens qDNAFL!13267fte, done OV 13.23 

425289 AW139342 Hs.155530 interferon, gamrnaHndudbte protein 16 12.70 

426552 BE297660 Hs.17032d moesin 1^69 
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416857 


AA666115 


Hs.127797 


436398 


H87136 


Hs.5174 


418151 


AA664238 


Hs.83583 


453020 


AL162039 


Hs.31422 


410275 


U85658 


Hs.61795 


414587 


NM_004862 


Hs.76507 


425875 


AU077333 


H5-1604S3 


415938 


BE3B3607 


Hs.78921 


413384 


M490855 


Hs.39429 


410185 


BE2940S8 


Hs.737 


407662 


BE548267 


Hs.3379B6 


433793 


AWS75959 


Hs.107513 


406743 


AA91156S 


Hs,279860 


40B9B9 


AW361666 


Hs.4950D 


430268 


AKD00737 


HS.2374B0 


447735 


M775268 


Hs.6127 


41 0325 


AB023154 


Hs.62264 


445817 


MM_C03642 


Hs.13340 


418299 


AA279530 


HS.839GB 


445B63 


R12234 


Hs.13396 


454413 


AI653672 


Hs.40092 


418460 


M26315 


Hs.85258 


428065 


AI634046 


Hs.157313 


432805 


X94630 


Hjl3107 


422511 


AU076442 


Hs.11793B 


444060 


AA340277 




452436 


BE077546 


Hs.31447 


405963 


AA1335S0 


H&.250B57 


402474 






407112 


AA070B01 


Hs.51615 


406786 


AWI61678 


Hs.111334 


428227 


AA321649 


Hs.2246 


444656 


A1277924 


Hs.U5199 


453B56 


AAB047B9 


Hs.19447 


440774 


Ai420611 


Hs.153934 


408569 


AI493591 


Hs.78146 


431639 


AKD0Q660 


HSJ266175 


4443B1 


BE3B7335 


Hs^83713 


418509 


AB026624 


Hs^5539 


437374 


AL3S9571 


Hs.44054 


415899 


X78992 


Hs.78909 


450719 


AJ096837 


Ks.21349 


424800 


AL0355BB 


ris.153203 


446682 


AW2D5632 


Hs.211198 


447211 


AL161961 


Hs.t7767 


425234 


AW152225 


Hs.165909 


422105 


A1929700 


Hs.111680 


417144 


AA382104 


Hs.81337 


422068 


A1807519 


Hs.1 04520 


452651 


A121691B 


Hs.30209 


418707 


U97502 


Hs.87497 


426996 


AW966934 


He. 173108 


427761 


AA412205 


Hb.140996 


443523 


AK001575 


Hs.9536 


402145 






413666 


AJ469213 


Hs.71404 


446468 


AB037782 


Hs.15119 


449246 


AW411209 


Hs33363 


424321 


W74048 


Hs.1765 


407179 


AA206465 




413497 


BE177661 




43006B 


AA4G4964 




446795 


AI797713 


Hs.156471 


435522 


M64214 


Ha.9774 


451864 


N2037D 


Hs.69547 


419490 


NM_006144 


Hs.90708 


419S04 


AA974411 


Hs.18672 


427711 


M31659 


Hs.18O408 


409208 


Y00O93 




424950 


AA802S17 


Hs.156974 


447534 


AW953935 


Ha2BB655 


419223 


X60111 


Hs.1244 


423573 


BEC03054 


H&.1695 


4229SO 


AW890487 




412025 


AI827451 


KsJ*4143 


408764 


AW971350 


Hs.63386 


417407 


AA923278 


Hs.290905 


432409 


AA80653B 


Hs.130732 


440273 


A1805392 


Hs.325335 


407110 


AA01804Z 


Hs.252085 


408161 


AW952912 


Hs.300383 


4342B0 


BE005398 




434524 


AAS35931 


Hs.249716 



Homo sapiens cDNA FU113B1 fis, done HE 12.55 

ribosomal protein S17 12.50 

actin related protein 2/3 complex, subun 12.30 

Homo sapiens mRMA; cDNA DKF2p434M229 (fr 1Z30 

transcription factor AP-2 gamma (aciival 12J28 

LPS-inducBd TNF-alpha factor 12^5 

erythrocyte membrane protein band 7.2 (s 12J25 

A kinase (PRKA) anchor protein 1 1Z20 

ESTs 1Z20 

Immediate aarty proteJn 1Z1 5 

Homo sapiens cDMA FU 10934 fis. clone OV 12.05 

ESTs, Moderately simlar to K1AA1058 pro 1Z0O 

tumor protein, translaJondty-controtle 1 1 .90 

KIAA0746 protsfn 11^0 

hypothetical protein FU20T30 1 1.80 

Homo sapiens cDNA: FLJ23020 fis, done L 11.80 

KIAAG937 protein 11.70 

hfetone acetyl Ir&isfa rase 1 1 1JQ 

irrtagrln, beta 2 {antigen CD1 8 (p95), V f 1 .61 

Homosapians clone 25028 mRMA sequence 11.60 

PMAS-123 11.60 

COB antigen, alpha polypeptide (p32) 1 1.40 

ESTs 1U0 

C097 antigen 11.36 

collagen, type XVII, alpha 1 11 .35 

Honfio sap-ens cONA FU20167 fis, done CO 11 .30 

ESTs, Moderately similar to A46010 X-Hn 11 ,30 

caldum/catrnodullrMlepefident protein k'n 11.11 

NH.00407ft:Homo sapiens cathepsln 8 (CTSS 11.00 

ESTs, WeaWy sintiar to ALU7JHUMAN ALU S 11.00 

ferritin, light polypeptide 10.95 

small inducible cytokine subfamily B (Cy 10.90 

ESTs 10.90 

POZ-UM protein mystique 10.85 

ESTs 10.82 

platetelfendcthelisl cell adhesion molec 10,80 

phosphoprotetn associated with GEMs 10.80 

ESTs > Weakiysirr4larbS64054hypothBV 10.80 

ATP synthase, H transporting, mtochondr 10.70 

nlneln (GSK3B interacting protein) 10.65 

butyrafe response factor 2 (EGF-response 10.60 

ESTs, WeaWy stmflar to RBflS_HUMAN RAS-R 10.43 

MyoD family inhibitor 10.40 

ESTs 10.40 

KtAA1554protBh 10.31 

ESTs, Weakly similar to I38022 hypothec' 10.30 

endosiifine alpha 10.21 

lectin, galacbsUe-birulng t sotubla. 9 1020 

Homo sapiens cDNA F1J13694 Rs, done PL 10L20 

K1AA0854 protein 10.15 

bu ty rophlOn, subtamly 3, member A2 10.1 1 

Homo sapfene cDNA: FU21897 fis, ctane H 10.10 

ESTs 10.10 

hypothetical protein FU10713 s.9Q 

Target Exon g.82 

ESTs g,80 

KIAA1361 protsJn 9.80 

hypothetical potato RJ 10883 9.60 

[ymphrxyts-speclffc protein tyrosine kin 9,60 

Ihymoan, beta 4, X chromosome 9.72 

QtKRC1-HT059ftO2O30^01 1-h02 HT059B Homo 9.70 

gfczxBOflOsI Soares ovary tumor NbHOT K 9.70 

ESTs 9.70 

synovial sarcoma translocation gene on e 9.65 

ESTs 9.65 

gianzyma A (gren?yme 1 t cytotoxic T-lymp 9,60 

ESTs 9.60 

solute carrferfamlry 25 (mitochondrial 9.60 

Intagrin. alpha X (antigen CD11C (pi 5D), 9.52 

ESTs g,50 

ESTs g.50 

CD9 anSgen (p24) g.41 

matrix metaltaproteinase 12 (macrophage 9.40 

cadherln 13, hkadherin (heart) 9.33 

Wlstott-Akfrlch syndroms protein tnterac 9.32 

ESTa 9,30 

ESTs, Weakly snilar to protease [rts&pt 9,30 

KIAA1575 proteVi 9.30 

Homo sapfans cDN A: RJ23523 fis. done L 9.30 

Prader-Wili/ATigelmansyndrome-5 9.22 

hypothetical proten MGC3032 9.20 

gbcCM1-BKO116-l5O400-1B9-hO2 BK0116 Homo 9.20 

ESTs 9.20 
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45D294 R42587 Rs.238730 hypolheM prolan MGC1 0823 9.20 

407254 AW1 29401 Hs,181165 euteryotk translation efongaSori factor 9.10 

434442 AA737415 ESTs 9.10 

. 440268 BE27003O Hs.336959 Hwnosapians.cbnelMAGB3577185.mRNA 9.03 

5 447519 U46258 Hs.339665 ESTs 9-00 

433156 R59206 Hs.17519 Homo sapiens cONA: FU22539 Us, clone H B.98 

410730 AW36B860 DnaJ (Hsp40) homotog, subfamily B, membe 8.90 

436823 AW749865 Hs.11 7077 ESTs. Weakly aimjlar to 138022 hypothefi 8.90 

442806 AW294522 Hs.149991 ESTs 890 

10 433271 BE621S97 Hs.14317 nucleolar protein family A, member 3 [KV 8.B9 

445577 N40696 Hs.l37064 cytoplasmic polyadraylafion element b'md 8,81 

408437 AW957744 Hs.278469 lacrimal proline rich protein B.80 

420962 NVL005904 Hs. 100602 MAD (moHiersagarnstdecapfintaplefllc, Dr 8.80 

431187 AW971I46 Hs.293187 ESTs 8.80 

15 42109B At697S01 Hs.192425 ESTs 8L70 

42452B AYV073971 Hs.238954 ESTs, Waakhy similar to KIAA12D4 protein 8.70 

44610B AL036596 Hs.42322 A kinase (PRKA) ancHor protein 2 8.70 

401 091 decay accelerating factor for complernenl 862 

433412 AV553729 Hs.6185 CGI-44 protein; sulfide dehydrogenase C 8.60 

20 43B089 W05391 nuclear receptor subfamily 1, group I, m 6.60 

431958 X63629 Hs.2877 cadherln 3, type 1, P-catam (placenta 8.60 

452568 AAB05534 Hs.300870 Homo sapiens mRNA; cONA DKF2p547M072 (fr 859 

414191 AW250089 Hs.75807 PDZ aid UM domain 1 (elfin) 8^6 

411979 X85134 Hs.729B4 radnoblastDma-bniding protein 5 8.50 

25 414B29 AA321588 Hs.77436 piactetrm 8.50 

430162 AW450643 Hs-34S348 ESTs 8,50 

448412 AC19063 Hs.42532 ESTs, Moderately similar to ALU6__HUMAN A 8.50 

423753 Y11312 Ks.132463 phoschoinosltide- Wiase, class 2, beta 8.45 

407833 AW955632 Hs.66566 ESTs. Weakly similar to 81 9560 prolne-r 8.43 

30 416975 NM_004i31 Hs.1051 gr^izyme B (granzyme 2, cytotoxic T-lymp 8.40 

43320B AWC02834 Hs.24095 ESTs 8.40 

428970 BE276891 Hs.194691 retinolc add Induced 3 8.38 

425284 AF155568 NS1-assodated protein 1 B.33 

437103 AA434054 Hs.80624 hypotriBticat protein MGC2560 8.33 

35 40B3B0 AW06090 Hs.44344 hypothetical protein RJ20534 8.30 

426B27 AW057805 Hs,172B65 mothylEnBtebchyoVofolate dehydrogenase 8.30 

453716 AAQ37675 Hs.1 52675 ESTs 8.30 

418840 AI821614 Hs.185B31 ESTs 8.20 

434649 AA736254 Hs. 165390 ESTs, Highly similar to A4O350 transcrtp &20 

40 449656 AAO02C0B Ha. 188633 ESTs 8.20 

425535 AB007937 Hs.158287 WAA046B gene product a.17 

409493 AA386192 Hs, 193482 Homo sapiens cDNA FU 1 1903 fis, clone HE 8.13 

432559 AW452B48 Hs.257631 ESTs 8.10 

436797 AA731491 Hs-334477 hypothe«calproleBriMGC14879 B.10 

45 420099 DB0011 Hs.95140 KJAAQ189 gene product 8.01 

417640 D308S7 Hs.82353 protein C receptor, endothelial (EPCR) 8.00 

420337 AW295B40 Hs.14555 Homo sapteuB cDWA: FU21513 fls, ctons C aoo 

434423 NMJJ06769 Hs.3844 L1M domain only 4 B.00 

437886 BE264111 Hs.31314 retjnoUastoma-Hncflng protein 7 8.00 

50 41B522 AA605038 Hs.7149 Homo saptens cDNA: FU21 950 fls, dona H 7.92 

433655 AUJ36539 Hs.3463 rrocisonia) protein S23 7.89 

435968 AW161481 Hs.111577 integral membraiie protein 3 7.89 

434511 R28982 Hs. 18106 ESTs 7.88 

423523 AW29982B Hs.193580 ESTs 7.B6 

55 409327 L41182 Hs.53563 cdlagen.typelX.alpliaS 7.84 

411950 R77776 Hs.18103 ESTs 7.80 

434159 AW135214 Hs.191828 ESTs 7.80 

447500 AJ381900 Hs.159212 ESTs 7.80 

406699 W6505 Hs.1B2979 ribosomal protebi L12 7J5 

60 422603 BE242587 Hs.11B651 hematopoiefically expressed homeobox 7.66 

426759 AI590401 Hs.21213 ESTs 7.66 

406776 T16206 Hs.237164 ESTs,rtgWysimilertoUDHK>iUMANUAC 7.62 

4226B3 AW856865 gb:RC3-CT0297-29010tW)13^03 CT029? Homo 7.60 

... 444795 AI193356 Hs.160316 ESTs 7.60 

65 406663 U246B3 {nrninogtobulin heavy constant mu 7.63 

442821 BE391929 Hs.8762 transmembrane protein 4 7.56 

412347 AW970026 Hs.73818 uWquino!^^tochrome c reductase hinge p 7.52 

407252 AA659037 Hs.163780 ESTs 7.50 

414405 A1362533 KIAAD306 protein 7.50 

70 427395 AW298741 H6.97B51 ESTs, Moderately shttlar to 138022 hypet 7.50 

429999 AI761902 HsJ9S597 ESTs 7.50 

441436 AW137772 Hs.185980 ESTs 7.50 

447644 AWB61622 Hs.108646 Homo sapiens cDNA RJ1 4934 fis, done PL 7.50 

420943 AI7187Q2 HsJZ79930 major Nstocornpatibilrty complex, class 7.46 

75 447B74 BE270640 Hs.19192 cydln-dependenl kinase 2 7.43 

413420 AW410235 HS.7534B proteBSorne (prosome, macropaln) activalo 7.42 

422451 AA310753 Hs.42491 ESTs, WeaWy sfrrilar to S65657 alpha-1C- 7.41 

437t34 AA349M4 Hs.42915 ARP2 (actin-reteted protain 2, yeast) ho 7.40 

408912 A80110B4 HSw48924 KIAA0512 gene product; ALEX2 7.40 

80 419839 U24577 Hs.93304 phospnollpase A2, group VII {platelet-ac 7.40 

431427 AK0OO4O1 Hs.252748 Homo sapiens cDNA FU20394 Hs, done KA 7.40 

437469 AW753112 Hs.15514 hypothetical protein MGC3260 7.40 

43259B AI341227 Hs.157106 ESTs 7.3B 
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451838 
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AW57S960 
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AF117947 

AW5Q23B4 
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AW444433 

AKW0042 
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AA830856 
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AW959433 
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AA397769 
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U67319 

AA52B140 

AW005866 

BE2711B3 

BE620712 

AA233210 
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HS.29B08 
Hs.1 2461 
Hs.77367 
Ha.109051 
Hs.63042 



442492 

449971 

431773 

416401 

426501 

435080 

436876 

449523 

432666 

424201 

425277 

425246 

426728 

430299 

433735 

430556 

417535 

41B117 

417558 

424541 

447341 

407949 

442460 

42B816 

453932 

415221 

460256 

441384 

421684 

441224 

443749 

448094 

416B01 

41B259 

421633 

435937 

445612 

451653 

407136 

422693 



AA285S87 
AA447849 
BE261591 
AU076964 
R3BS2B 
H243B7 
X9B634 
AA215404 
AF121860 
AA830893 
N94126 
W16193 
T64B96 
BE300073 



Hs.292566 hypothetical protein rnJU697 7.34 

Hs.1 1 3660 Hgrna sapaens cON A FU 1 \ 631 fis, clone HE 7.30 

Hs.342307 hypothetical protein FLJ10330 7.30 

Hs^SB719 Homo sapiens cDNA FU 12142 fls. clone MA 7.30 

Ks.152940 ESTs 7.30 

ESTs 7.30 

Hs.323401 dpy-30-ltke protein 7.30 

HS-152774 ESTS 7.30 

Hs.179902 transporter-lte protein 7,20 

sBnim/glucooor8cokJ regulated Khase 7.20 

Hs.135150 lung type-l cell membrane-associated gly 7.17 

gb:og82a10.s1 NCLOGAJ\OvB Homo sapiens 7.16 

Homo sapiens mRNA far FL) 00065 protein, 7.1 2 

proteoglycan 1, secretory granule 7.10 

HS.103B32 simllartoyoaslUpf3 t varlantB 7.10 

Hs. 11 8725 selanophosphBte synthetase 2 7.10 

zkic finger protein 137 (done pHZ-30) 7.10 

Hs,97774 ESTs 7.10 

Hs.174795 POZ domain-containing guanine nucleotide 7.10 

gb;UI-HF-BR0p-ak3-f.12*Uin NIH-MGC_5 7.10 

gb:R<2-BT0548-2003CO-012-e09 BT0548 Homo 7,10 

Hs.127982 ESTs 7.09 

Hs.333893 cMyc target JP01 7.04 

Hs.136061 Homo sapiens. Similar to hypothetical pr 7.00 

Hs.180450 rtbosomal protein S24 7.00 

Hs.35254 hypothetical protein FLB6421 7.00 

Homo sapiens cDNA: FU21 122 &s» clone C 7.00 

UCHN proton 7.00 

monokine Induced by gamma Interferon 7.00 

3H3 domain binding glutamic acid-rich pr 6.97 

DKF2p5B4Jl57 protein 6-96 

Hs.326290 rrypotteScd protein FU125B1 6.96 

Hs.146233 Homo sapiens cDNA: FU2213D fis, ctone H 6,95 

H&.86617 ESTs 6.90 

Hs.1 52983 HUS1 {S. combe) checkpoint homoiog 6.90 

Hs.921 6 caspase 7, apoptoais^etated cysteine pr 6.90 

Hs/107515 ESTs,Wea^sjniiIartoTO0329hypo1hetJ 6.80 

Rs.193969 ESTs 6.B8 

Hs.155975 protein tyrosine phosphatase, receptor t 6.87 

Hs.33026 hypothetical proteh PP2447 6,85 

VBsJcls-Bssociated membrane protein 4 6.84 

Rs.179943 ribosomal protein LI 1 6.63 

Hs.14947 ESTTs 6.83 

Hs.23451B ribosomal protein 123 6.83 

Hs.288581 Homo sapiens CDNAFU1429S fis, clone PL 6.B3 

HS.26B557 pleckstrin rwmotogyJikfi domain, family 6.82 

Hs.266916 ESTs 6.90 

Hs 293616 ESTs 6, B0 

Hs*155111 hypothetical protein FU14428 6.80 

Hs£337 tsodtrate dehydrogenase 2 (MA DP), mhoc 6.90 

Hs.54443 chernofooe (DCrnotK) receptor 5 &80 

ESTs, WeaMy similar to unnamed protefci 6.79 

Hs.1742 IQ motif conMitog GTPase activating pr 6.77 

Hs.1 55478 cydlnT2 6.72 

Hb.155321 serum response factor (o-fos serum respo 6.70 

Hs.191361 hypothetical protein 6.70 

Hs.106747 serine carboicypepttdase 1 precursor prol &70 

Hs.109653 ESTs 6.70 

Hs.13797 ESTs 6.69 

Hs.191482 ESTa &B9 

Hs.83496 linker foracttvaboriDfT cells 6.67 

Hs,B2280 regulator of Gprofeln stalling 10 6.65 

Hs.1fl0559 ESTs, WeaMy similar to AS6194 thromboxa 6.62 

Ks.18142 arresfin, beta 2 6.61 

Hs.247057 ESTs, Weakly simitar to 210926QA B cell 6.60 

PRO0641 protein 6.60 

potassium Inwardy-mdrfylng channel, s &59 

ESTs, WeaWy similar to Idefline not ava 6.57 

annexlnAI 6.56 

MFH-smptted sequences wirh teuctae-ric 6.54 

rte.288660 HomosaplenscDNA:FU22182fe l doneH &51 

He.10B768 KypotheticBlprDtBinFUlOStl 6.90 

Hs.7753 calumemn 6.50 

Hs.1 43463 EST a 6.50 

ESTs,VlteaWysOTla/to138022riypoUietl 650 

sd(Dxo5opMa)-lta2 6.40 

ESTs 6,40 

Hs.106260 sorting nexin 10 6.40 

Hs.119769 ESTs 6.40 

Hs.12969 hypothetica) protein 6.40 

ESTs, Moderately slmflar to HERC2 [H.&ap 6.40 

HS.2B7420 Homo sapiens cDNA FU 11 533 fis, clone HE 6.40 

Hs.279860 tumor protein, tjaristeflonalry-coiitrallB 6.39 
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Hs.62 
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HS.18B662 

Ks^89008 
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Ks.179943 

Hs.12708 

Hs.79172 

Hs.14601 

Hs.71913 

Hs^4769 

Hs-336G5 

Hs.250520 

Hs.263335 
Hs.155079 
Hb.184052 
Hs.324830 

Hs.98370 
Hs.46465 
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Rs.5734 

Hs.6823 

Hs.32112 

Ka.199310 

Hs.1 73081 

Hs.161286 
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Hs.79741 

H*J37396 
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H64159 
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417497 


AW402482 


Hs.82212 
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Hs.19150 


413&56 


D13639 


Hs.755B6 


419556 


L729615 


Hs.91093 


429617 


X89S84 


Hs.211563 
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U51T66 


Hs.173624 


446021 


BE3B9213 
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413B22 


R08950 


H&272044 


412619 


T25B29 


Hs.24043 


448717 


R67419 


Hs^1851 
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Hs.27555 


442358 
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Hs.29798 


424723 
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Hs.152337 


426931 
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Hs.96967 
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Hs.1 96352 


430280 


M361256 


Hs.237868 


438330 


AW450572 


Hs.257316 
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BE04G554 




444794 


AI419991 


Hs. 145225 


445100 


AW1BB205 


Hs. 12311 


449659 
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Hs.198699 
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AA694264 


Hs.6CD49 


452404 


AW45W75 


HS^12708 


447296 


AW243614 
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425097 


NMJ014247 




441607 


NM_O05O1O 


Hs.7912 


405742 


AI468091 


Hs.279860 
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R31350 


Hs.743 


442333 


A1S50877 


Hs.129302 
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AA479Q05 


Hs.154036 
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437386 
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Hs.235709 
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Hs.48604 


409604 


AW444448 
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418666 
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Hs.90315 
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Hs_236463 


428467 
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Hs.1 84465 


432290 
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rfe.274273 


434551 


BE387162 


Hs.2B0658 


436138 


H53323 


Hs.25717 


438915 


AA280174 


H3.2656B1 


4492T7 


AA278536 


Hs.23262 


452994 


AW962597 


Hs.31305 


416983 


BE3&703B 


Hs-1211 


437250 


BE257342 


Hs.94576 


44Q91D 


H97875 


Hs.1 17974 


406653 


AA514553 


Ha.252259 


432295 


BE091049 


Hs.343565 


400244 
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BE149455 


Hs.75415 


409132 


AJ224538 


H&50732 


406746 


AA5B0395 


Ks.279860 


400395 
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443229 


AI057129 


Hs.133396 


450201 


T97B38 


Hs.25722 


409636 


AA30S729 


Hb.18272 
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Hs.1 11244 


444099 


DB7432 


Hs.10315 


453902 


BE5Q2341 


Hs.3402 


415189 


L34657 


HB.7B146 
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406653 


AA574074 


Hs.77961 


400440 


X63957 


Hs.83870 


4J5049 


N67334 


Hs.5Q)58 


428304 


AA215702 




423180 


AF06B302 


Hs.125031 
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AW762714 


Hs^174 


429412 


NM_005235 


Hs^407 


438141 


AW946871 
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CD53 antigen 4.50 

Homo sapiens mRNA; cONA DKFZp564A2164 (f 4,49 

cycJin 02 4.49 

chlOnase 1 (cWtotriosWase) 4.49 

CUUIyxnphoma 7 A AM 

Ihyrnine-DNA gjycosylase 4.48 

ribo&om&i protein L4 4.47 

ESTs, Weakly similar to ALU1.HUMAN ALU S 4.46 

FK506 Ending protein precursor 4.46 

Homo sapiens cDNA FU12900 fis, ctone NT 4.45 

NMJ0D9B8*:Homo sapiens ribcscmrJ pratei 4.44 

hypothetical protein FU 22405 4.43 

ESTs, Moderately similar to ALU4_HUMAN A 4.43 

ESTs 4.42 

APEX nuclease (multifunctional DKA repal 4.42 

hypothetical protein FU20695 4.42 

ESTs. WaaMy sfrnltar to T12483 hypolheS 4.41 

ESTs 4,40 

glycogen In 2 4.40 

KIAA1712 protein 4^40 

protein arginina N-methyllransferase 3(h 4,40 

ATPase, HO-transporh'ng. lysosomal, non 4.40 

neutrophil cytosollc factor 4 (4QkD) 4,40 

hterteuWn 7 receptor 4.40 

ESTs 4.40 

Sb*n41c11jt1 NCLCGAFLRDF2 Homosapians 4,40 

ESTs 4.40 

Homo sapiens clone 23570 mRNA sequence 4.40 

eukaryotic translation foltlaflon factor 4.40 

ESTs 4.40 

ESTs 440 

Homo sapiens cDKA FU1076B fls, done NT 4.39 

POZ domain containing guanine nucleotide 4.37 

neuronal ceil adhesion molecute 4.37 

tumor protein, translatJonally-oonlrotle 4.35 

KIAA1276 protsln 4-34 

FcteomerUof tgE. high affinity 1, rec 4*34 

ESTs 433 

tuner suppressing sub transferase candid 4.32 

ESTs, Weatfdy similar to ALU6JHUMAN ALU 3 4.32 

ribosomaJ protein L10 4.31 

Homo sapiens mRNA; cDNA DKF2p6B7BC71 1 (f 4.30 

DKF2P434B168 protein 4,30 

ESTs 4.30 

gtxycl 1c07.s1 Stratagem lung (937210) H 4.30 

KJAA0007 protein 4.30 

STAT induced STAT inWWfor-2 4,30 

K1AA1 238 protein 4.30 

ryptfh8£calprofetoFLM1259 4.30 

Homo sapiens cDNA RJ10237 ffs, done HE 4.30 
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Homo sapiens cDNA: RJ23454 fa, ctone H 4.30 

WiDiarcs-Beunsn syndrome chromosome cegi 4.30 
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KIAA1547 protein 4,30 

add phosphatase 5, tartrate resistant 430 

hypothetteal protein MGC3QB2 AJ5 

ESTs 4.29 

hypothetical protein FU23059 4.2& 

ribosomal protein S15a 4.23 

Eos Control 4.28 

bela-2-mlcroglabdin 4.26 

protein kinase, AMP-aoo'vated, beta 2 no 4.27 

tumor protefo, tranBlafonally-controlle 4,26 

v-tos FB J murine osteosarcoma viral onco 4.26 

ESTs 4.26 

ESTs 4.2S 

amino add transporter system A1 4.25 

hypothetical protein 4.24 

solute carrier family 7 (ceSonic amino 4.24 

ESTs 4.24 

platetEt/andDthBlial eel adhesion motec 4.22 

Target Exon 4.21 

major hletocompaltlbiGly comptax, class 4.20 

nebulin 4.20 

ESTs 4J20 

gb3r97g10/1 MCLCGAP.GCB1 Homo sapiens 4.20 

chdine^anolamlnephospnolransferase 4^0 

ribosomal protein S17 4^0 

POD domain, class 2, associating factor 420 

gb:RC2^rO022^QO50(W)12J02 ET0022 Homo 4^0 
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AW973253 


43330B 


AA5B2718 


443559 
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450850 


AA6488B6 


4537B5 


AI368236 


406854 


AAS13705 


410768 


AF038185 


419612 


AM982S7 


434203 


BE2B2Q77 


439237 


AW408158 


441374 


AA043696 


443415 


A1056523 


424338 


W78B16 


422305 


AI928242 



400233 

421959 

442622 

424795 

446231 

452933 

427681 

409061 

413B91 

414004 

417035 

410584 

417353 

423645 

430048 

431113 

434170 

434584 

435391 

446768 

448019 

451831 



Hs.301663 ESTs 

9b*3V4-DT0021-2B129947^g01 DT0021 Homo 
Hs.7634 hypolheticsl protein FU122B7 similar to 
Hs.111808 ESTs 
Hs.297939 calhepsin B 
Hs.154993 ESTs 

Hs.26578 masclebllnd IDrosophfla^-lite 
HsJ27B589 general transcription factor II , i 
Ks.12002 K1AA0790 protein 
H5.15227 ESTs 
Hs.1 948 nbosornai protein S21 
Hs.77448 aldehyde dehydrogenase 4 family member 
Hs.1 84014 rtbosomal protein L31 
Rs.4953 golff autoanbgen. gotgln subfamily a. 3 
Hs.346B23 gbty77d05.xt NCLCGAPJ<ki1 1 Homo Bapien 
Hs.79077 Homo sapiens, ctona MGG290B, mRtfA, comp 
Hs.2345 \ 6 rlbosomal protein 123 
Ha.108557 hypothetical proteliiPP1057 

^rel avian reUcutoendolhellosle viral 
Hb.80205 phn-2 oncogene 
HsJ27201 8 tow rrtttectfar mass ubiquinone-binding pr 

Target Exon 
Hs.344096 glycoprotein (transmembrane) nmb 

gteCMVBT06O148020CM21-b10 BT0601 Homo 
Hs.29493 hypottiatjcal protein FL320142 
K&.27B426 progestin Induced protein 
Hs.292812 ESTs, Weakly similar to 138022 hypcfoetl 
Hs^26B9 ESTs 
rfe.291650 ESTs 

Hs.269899 ESTs, Moderately similar to ALU8_HUMAN A 
Hs.1 51999 ESTs 

Hs.2B3732 ESTs, Moderately similar to ALU INHUMAN A 
Hs.252259 ribosomal protein S3 
Rs.65187 Homo sapiens clone 23700 rnRNAseqUence 
HS.110B13 KJAA0421 protein 
rfe.283558 hypothetical protein PR01 B55 
Hs .318893 ESTs, WeaWy slmlat to A475B2 B-cell gr 
Hs.7822 Homo sapiens mRNA; cONA DKFZp564C12l6 (f 
Hs,133472 ESTs 

Hs .49943 ESTs, Weakly similar to S65657 alpha-IO 
Hs.293438 ESTs, Highly similar to AF19B488 1 trans 
Eos Control 

AW7S1497 Hs.98370 cytochrome P450, subfamily l!S t polypept 
NM_000435 Hs.8546 Notch (DrosopWla) homolog 3 
AW102850 Hs.153177 rlbosomal proteJn S28 
NMJ002163 Hs.14453 Interferon consensus sequence Wnong pr 
AW391423 H^288555 Homo sapiens cDMA: FU22425 fis, ctona H 
AB01B263 Hs.180338 tumor necrosis factor receptor superfami 
A1204994 H&.7B74 Homo sapiens cDNfc FU21435 fls. done C 
BE271020 tumor suppressor deleted In oral cancer- 

AA737033 HsJ165 ESTs. Moderately sknibrto 21 15367 A TYK 
AA192455 Hs^29SB Homo sapiens clone IMAGE:451 939. mRNA se 
ABQ11112 KIAA0540 protein 

AA375752 H&348140 Homo sapiens mRNA; cDNA DKF2d586F1B22 (f 

A1215632 Hs.147487 ESTs 

T650S4 Hs.73605 ESTs 

AK000673 Hb.274337 hypothetical protein FU20666 

AA626509 rls.122329 ESTs 

D57341 Hs.188361 Homo sapiens cDNA FU12807 lis, done NT 
AA70458B Hs.58934 ESTs 
AVB60305 Ks.110286 ESTs 

AW947164 Hs.195641 ESTs, Moderately similar to 13B022hypot 
NM_001674 Hs.460 activating transcripfion fector 3 
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TABLE 56B 

Pkey: Unique Eos prabeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
438091 



CAT Number Accession 

2244B 1 AK054860A\ffi52198AV652192AV6^ 

AV646179 AWB80409 AA345002BF155189 BE06B931 X58197 AL603014 AW953629 BM263546 BE550772 AA701 084 A1681 352 AA3586B9 

AvSa^787SAI906B21 X56196 BE833835 AAS28440 BE833808 BF224205 AA70912B BE6738Q7 A19238B6 A ^Jj[^'^5^ i ^ 
AS698 AA987230 BE467708 AWB9B628 AVW98544 AK469B4 AW043642 AI28B24S A1186932 AIG35262 A11 39455 £98739 

AW1126 BF3B2770 A1268939 AI435818 BF475318 AI024767 BE174213 AA757598 AA51 3019 AA902959 A186Q794 AI334764 BF108411 
8KK105S w^mi AI337671 BFD95606 BFO95601 BF0954€8 AW890091 BF095753 AW24340DAWB98607 AWB98616 
Smm BE69946BBE174196 AW1O2023 D52715 BEG99456 D52477 

raWW«^^3m39091 BF4341BOAW579001T556B2H01811 T52522 BF945037 BF95593B 054679 D53933 R671 00 BG925552 
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425996 13B046J 

427521 513212L1 
444060 6315J 



BF99S056 R83430 Z29922 TB5791 W03942 H63289 AJ091537 BF086583 AA345570 H48870 Ha0720 T83523 BW39626 BI03770D R00353 
BFl55ie4N9e343 N79072 H01812T55581 ivj „„„ 

8E73B425 BE738323 BM126944 AW629578 AW265195 AI91673S A1394255 W573090 AI354442 AW612857 AI33955B AI919424 A1377532 
AI354441 A1308821 M772275 AW055215 AI58970S A1336532 AA806647 AVBB2125 H93575 AW071172 AW769904 AIB639B5 AW265018 
AW196655 D79562 BE042393 N75Q17 AW014741 C75509 BE748621 H92431 AW079261 AW901780 AA329482 AW9601 15 BI260621 AI767525 
R31663 BI918664 AW9B3196 CC6195 At678018 

AK056585 BG399272 AA187835 BFB21903 AV660550 AV660556 AV660502 BG564397 BE379584 BF446961 AI653056 AW973709 A1653173 
BG054997 A12B6043 B1064B79 A1658750 AI492830 AW021 142 A1472184 AWl 70066 AI0B2443 A1167921 D59940BI492088 H74l8QAWt30B86 
A1348677 AI278577 AA761 517 A1698203 AA1 15535 A1264790 AW205074 AA860452 AA554902 AIO0Q715 DB2102 BE544768 A1376090 D59939 
AW242249 AA525421 R34211 R3432B BF248064 BF241437 BF572759 BF218B32 

M77830 NM 004415 AF139Q658G6B1115 BG740377 BI712964 BG00055B AA1 28470 BI438324 K2740B BE931630 BE167165 AW370627 
AW370813 J0S211 BG6S8865 BG740734 BGS80618 BG739778 BI7658G7 BM353403 BW353248 AWt77784 AW2O5709 AW95t576 AWB48592 
8E182164 BF149266 BE94Q187 BI060445 BI060444 BF350983 BE720095 BE720C69 BE715154 BE082584 BE082S76 BE0O4047 AA857316 
BI039774 BE713B1B BE71 3548 AW170253 BE160433 BI039775 AWB86475 BM462504 BE931734 BF149264 AA340777 BF381183 BG621737 
AU127260 AW364859 BF993352 BG223489 BE819009 BF381184 BE715955 R5B704 AA852212 AW3S6566 BI090358 BF0B7707 BEB19046 
BE819005 AA377127 BE073467 BE819069 BE819048 BI036306 BG990973BUM0954BF919911 AU140155 AI951766 AU34S1BAWB04574 
BF752969 BE837009 BE925B25 BF149265 AW995615 BE814264 BKJ397B2 AU140407 8E144243 BE709B53 BF985S42 BE001923 BF93351G 
AW265328 BG436319 BE1821 66 AW365175 AW8476B8 BE818280 AW177933 BF873679 AW178000 BE082526 BF476886 BF086994 BF592276 
BE082507 BE0B2514 BE0B26O5 BF873B93 AW06BB40 AW647678 BF804153 AW365157 BE813930 BE002030 AW365153 BE184941 BF749421 
BE134920 BFB39562BE184933 BFB42254 BE698470 BE931048 BF999889 BF36B816 BE184924 BE159646BE714632 BE1B4948 BG9B6B45 
AA131128 AA099891 W39488 C0471S BF096124 BEB65341 AW7B9304AL503116BE149760BE70596? BE70S966BE705968AW848723 
AW376699 AW376B17 AW376697 BG005097 BF7511 15 BE696Q84 AWB48371 AW376782 AW848789 AW849074 AW361413 BF927725 BF09421 1 
AW997139 BE865474 BE1851 87 BE1 56621 BE715089 BE713297 BE713298 BE179915 AW799309 BF872345 BF08B576 BE705939 AW752S99 
BG0O5197 BF350086 BE715196 BE715155 BF752396 BF093B17 BF831 190 BF752409 BE006561 BG95B922 BF094833 BF094748 BF094583 
AW377699 AWE07238 BED82519 AW377700 BF349467 Al 190590 A1554403 A1392926 AU158477BI467252 AU1 59919 AI760B16 BF082516 
A1439101 AA451923 AI34Q326 A1590975 B1791 553 AI700963 AI142882 AA039975 AA946936 AA644381 BM314B84 AA702424 AI417512 
AW19Q555 AI220573 AJ304772 AI270345 AI6273B3 AA552300 AI911702 AW166807 AI346078 W95070 M149191 AA026664 AI830049 AW780435 
AKJ7B449 AI819964 AI858282 B146868B AI8B05B4 AI025932 AA026047 AA703232 AA55Q154 AA515500 AW192085 AA918281 T77861 AJ9272Qf7 
AI2052B3 BF082491 AW021347 AI56BOS6 BE939862 AA0B8866 D12062 AAQ56527 AA7B2109 W19287 W02156 AW1S0Q38 AA0227O1 T87181 
H44405 AI910434 BF082513 AI4940G9 AI270027 AIB35878 AA126330 BG681425 BE705078 R20904 BG680059 BG676647 BF764409 AA02B654 
AV745530 BI762796 BG267391 AW7987B0 BE7C6045 BE926470 AW799118 BF0B7S98 BE002273 AW879451 AI571075 BE067788 AV721320 
A1022B62 N29754 O0337B N84767 AA131077 H30146 BE71429D A1686859 Al 568892 AJ91559S AW105614 A186725B AI538577 BE926474 
BE0B7737 BG3194B6AA247685 AW79B883 AW103521 BF989173AW860B78 BE939707 BE185750BE714064BE713903BE7138BB BE713763 
BG950164 BE713B10 AW36555^ BG955489 BE005272 BF915937 AW365148 AI905927 BF9927B0 AW8S3B12 BG954443 81770853 BG679406 
BG740632 B66810B7BGB9a430 AA455100T87267 BE6962Q9 BE696210 B1089483 BE0O8273 BE872225 AW39I912 BE925515 BG677012 
BG741970 AAD264B0 BE705999 BG677157 BE00909Q BGBB1378 BE712291 BG961498 BG678984 BI040941 AA337270 AW384371 AW847442 
BIQ586S9 BE813665 W95048 W25458 AW177786 AA025851 BE931733 BF154837 BG949393 BE714441 AW99S245 BE711B01 A12B4090 
BE064323 BE719380 BE94014B BG991212 BF37571 4 8F349522 BG996267T4B793 BI013292 BE001925 AW3B5155 AW365154 AW806653 
BF763109 BE931637BE167181 BE713879 BF354008 BF67B726 H90899 AW3B5145 W3B3B2 AU9B487 

BC003O47 S80794 NM_0Q3405 X7B138 AY007132 L0B439 AW34064B AW131 665 A10B2748 AI470204 BI711078 BF350700 B1495963 AI087141 
AA720884 AABB2331 AA605146 BM313650A1089749 AI359738 NB9107 AW995424 AI08B917 An»399S AW340217 N99562 AI829449 AI0B9839 
AI608761 A1342365 Al 1 99076 AA90B944 A1248943 AU160053 AH91245 A1218477 A1077943 AAB64930 AJ310394 AAB72478 A1279782 W61343 
AA5B5955 W46596 M1 20874 AA223241 AA491574 R84B13 AA491520 BG055114 A14696B9 BH464590 AW664539 H67097 A1534332 C21397 
AJ0B5941 AW026427BG93aB20AI697089AI039CX)BAl125315At6555fi1 AWl 50042 L20422 X57345 BI458375 AU 1 42852 K666601 BE8B8276 
AU119302 BI603754BG705953B1598754BE296713BG002538BF951911 N29226BE909424 AV69B274 AV6B3116 AV708195AA127798 
A1124697 D54224 FOB031 AA340253 BF923383 BM4678QB BI546644 BG777200 BG705941 BG468577 AU127209 AW403970 BI597630 BJ45B091 
AV6895G0 BI669267 BG506219 BJB37163 BI6fi7276 AA343750 BE783112 BF671405 BF954720 H67636 H6945S AA484894 BIB69271 BF998207 
N31547BF945B17BF947918N90830BG980194AA156681 BI493502AW273118 Al473820AA608Ba8Al359337AV71209l AAD84101 BF592036 
H13301 A18B4305 AA5056B3 AW23963 AWD84401 AJ917740 R89858 AA033631 N79982 BE88527B AI63S674 AA096126 AA700018 AV707753 
AI0B2545 AU145681 AA629032 Al 421 307 AA7405B9 M150830 A1248541 AA988608M1 50478 W85437 BM310234 AA262704 T28031 A1B1 1116 
BM272753 H21979 T15405 AA938408 F04953 AI18829B AW1S2629 AA905198 BG22305B AI831016 A1766457 A1811102 AA776573 A1922133 
AA775058Al261476AA2194a9Al68B035AI872093 BE537DB4AW1 89076 D82H30A11 23121 A1583492 BE360791 R69901 W65436 BE1 55392 
BI089081 BE155394 AL12053B AW1BS100 AI359620 AI174338 N20527 W47413 AA155615 A1272249 H25293 BE61455B H69B64 AW383484 
H78021 H 1 1617 R56892 H232D4 N21530 R82499 AW383622 M774536 AL534331 W94127 W46459 AI866231 AA513231 AA1924S5 H69B44 
W85627 AW383642 AW3B3529 AA171496 AL537424 BEB148B8 BF823254 R82553 Al 809 61 7 AJ559406 

AK000357 AI571B30AA579613 AA66B790BF939495BF196886 AI9909B2BF591561 A1BB9189 AW410232 BF7397B9 Al 1 44392 BF438 721 
A1707495 A14233S9 N52503 BE855784 N94367 AI023931 AA563662 AI744624 AI221298 AJ299523 AJ299520 BI491312 A1452711 AI360730 
AA522643 AW195955 AA470473 AW392767 AI21B127 AA673430 AA90B005 AA748160 HB9523 AW020252 AWB64988 AA970424 AAB79079 
AJ5B1622 AI361357 AA46849B AW008694 AA903275 A1187724 AH87718 AA8905O5 BF092771 A1066555 BE714172 BG289028 AW080390 
BE001197BF132049 

BE860923 BG390191 AW470082 AW01 4586 AW23255 BI7H731 BGQ54894 AW780248 H31883 AW664132 AW467353 AI983152 AA617918 
BF447795 AWB357 AAB07328 AA576970 AI741153 A1755003 AM74016 A142203O AI348114 AW997055 BM271753 AP363147 BM311311 AI146640 
A1246771 AW512619 AJ35S020 BG054697 AI292234 AG15B30 A128383e C0S205 AW503423 AW272S80 N33205 AWB73021 AA070724A1753886 
AWl 92487 AI0B7151 AA658909 AJ346368 AJ335677 AA825442 AW440066 AW131357 AW513210 AI062314 AI0B5455 BE551404 AA780704 
AW00859B A17969S4 AA917471 AM00531 AA66B526 N72207 A130B482 AW440562 A1B84687 AA347280 AA063536 BF477389 AI241662 AA931543 
AA484310 AA812488 AW32216 AA665779 AI916336 AI350590 BF198106 AJ433377 AI30C63B A1672B2B AOB2741 AI351487 AW105544 AA9736Z7 
AW517914 AA715424 AAS0B464 BF3340B0 A12746W AW367201 AW572519 AW4690B8 AA382095 AI368364 A1148934 AI3571 80 AI381 1B1 
BI911347B1B71044 AA1 36325 BF084010 BF084007 AA335788 AI920878 AA809B14 BE932941 A1878261 C7530B A1148479 BE178174 W88613 

AbS33^9?AL520743 BE81 1813 N53332 N9971 6 AI581910 AA2B0655 BE710392 AV70510O AW293978 AW444566 AA281459 A1679751 AiB73695 
BG700891 BI553517 R80518 BG779771 BG534451 AA479402 AW9615BQ BF081430 A1B57B43 AJ768486 AW51211B AM79302 AWH03B4 
AWO72470Al04159BAI049699AW592fi85AA976261 BEB79747 BG1 141 19 AW183811 AA90993B BF57l€2t BF360794 BF351375 BE925699 
AL050294 BC010371 BF0B227O AL042656 BF095732 AWB12H18 BF095731 BG212397 BF67B765 BI038602 BG38B664 AW675337 BG2B9398 
BE939598 AW805570 AA527097 AW1 50540 A1693720AA743364 A1915793 N4B185 AI573107 AA043474 A1351B15 AI958490 A19107B3 R50866 
AI699181 N73808H0B1G4 

BM008551 AA3B71S2 AW953705 AIS31833 AW237429 AVTO27804 AA72903B BE5O3409 A1521935 BF739953 M70Z9B2 AA557633 AA780065 
AJ218139 AW194264 D2012O AI082715 AI9699B0 BE85768B BE32B71 1 AW953706 A1383749 A1383821 W87199 AI431759 AI796526 AI521 794 
AI796380 AWl 17545 AI749657 AI537634 N50122 . tMMtn AMnflHCC 

AW973352 BF222929 AW016853 BF059130 AI651829 BE551767 AA55B414 AI339359 BF059601 AI961 1B2 AI341422 AI20B248 AEQ6165 
AAS48736M788578 AI539081 AW029957 AA736B37 N79575 AW594357 AA480B92 

AK055070 AK055512 AK000174 BG61980B BG10B086 AI741949 A100417B BFB9193S BF378565 AWl 97163 BE8568S0 0E245124 AW67441 1 
AA490531 AWB74981 AA7407B8 AW274758 BF512523 A152127B AA548758 A1B02431 AW051662 A162B247 AI79960B AW08&103 AA23B549 
A1191529 AW273168A116B451 AW073812AW090611 AW003593AI215845AJ799616 AW47494D AA954927BM193740AW6627O4 AW090127 

655 



WO 03/042661 



PCT/US02/36810 



413497 
430066 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



1518002J 
1177709^-2 
10117^2 



4229B0 11862.2 



434230 
434442 
410730 



414405 



442495 
450139 



406819 
424677 
4389B0 
451129 
432666 
418259 



406806 



4QS794 
431155 
414150 
430915 
436716 
449625 



14749WJ 

nt73g_i 

114639J 
22448 J 



3834,5 



874209J 
112689.1 



928716.1 
34017 1 



0_0 

2S1B_37 

9t7260_1 

1495511J 

144-7 

133B53L1 



451653 MH115364J 



2167905.1 
OlO 

423456J 
QJ> 



433162 
406870 
445493 
405797 
441623 



430504 5477 w 6 



647290.1 
2B143J 



QJ> 



0_0 

1235742J 

2305332L1 

197844.1 

2472836_1 

249224J 

1418J 



AA969444 AI080438 AA552500 AW23753B AA4810BO A1246378 AA565227 AA398921 AA 207051 M72137B BF438608 AJ036295 AI8B6630 
AA904112 AJ8645Ba AW271985 AW07BB68 AA725342 BF326598 AA843572 AJ082536 AA766664 AI453279 AA435673 AI619515 AA879060 
AA234592 AAB90223 AA766824 BI258B22 M393631 AW968840 BE940639 T83865 BE762742 AW897470 WO5809 N41323 T87376 R68544 
H88711 BI0B7136 

BE177661 H06215BE144709BE144829 
AA947566 

AK074047AI144342AW014280BM145128N2B267AW20S231 AA989041 H93197AW594063 BG236296AW236606 AW081031 M765843 
BM144372 AA989341 A1B24B38 AI963970 A1637671 AW196330 BG427526 BM148789 BF893644 BF881946 

BFO90249 AW9M947 AWB90487 A1305236 DS0B45 D60S37 AA825429 W36294 AWA90410 AW08B235 BF740240 AA448709 AI350279 AA8791 1 9 
AA319510 Be02077 BE699Q15 BE702D46AW901293T99319 DB1706 BF47548B D60383 DB1751 BE699260 
BE00539B AA62B622 AA994155 
AA662701 AA633929 AA737415 

BG427950AAB26016AI903441 BG939868AW979154 AA540872 BG547134 AA457Q91 AW36S86D W88B52 R94779 AA0BB823 
BM475665 BE6449t7 AW7707B9 AW9S2971 N64B63 BM263259 AI224545 A11848G6 N691 14 AW51B902 AJ440169 AA809472 AV654440 
AA281642 AU1B5230AW337382 AI872923 AT537113 N73882 T83378 H63731 BF671764 AW897824 AI811204 AA344646 BECQ9N2 BG899664 
H91240R5054BM41701 

BM476120 BE672181 AI6976S3 AA936187 AI280B79 AW340123 A1912727 AI081775 AlQa9556 AI191349 AI871604 AI631607 AI890800 AI701917 
AW771624 AA663041 A199157B A1160G22 AA7717G3 BE089784 BBQ89788 A1222942 AW41 B516 AA3292lt AI095736 BE550294 M738345 
BE21B6B3 BE670548 BG 149505 8F0S1776 D19B21 BEGQ578G BE17B692 BE005728 8F841237 BED0564S R27634 
AW954733 AA315006 AWB56G65 

AL047S96 AA393792 A1670731 AI037957 AWB74364 AKJ38137 N62286 AI241379 BE501096 AW090696 AI927369 AI669226 AI369437 A137I075 
AW612409 AI68671 1 A1183288 AA477717 AI07B122 AA635190 AA7009B4 M781508 D81 020 BF57S223 AI356183 D79312 AI37S5SB H6U1 1 
BG2G3489 BED90666 BEOS0G64 BE090662 H26545 
All 84717 AW51 8883 AF1 21 173 

AK001838 AU135179AU134241 AV651702 AV650032AV651304 AV650101 AV651263AV66188B AV651B66AA62B554 AV651355AV651174 
AV651172 AW856145 AU1 17599 AU13538S BG2S466S AA16691 9 BG483981 AW809606 B6494194 AA62281 1 AI676156 AA687804 AI701729 
AU133725AW961387 AU1443B7AU151757AA551031 BE675412 N34769AA713483AI8S0079AI5BB91B AI361880AI20902OAA66B961 AI24C990 
AA741144 AA490899 A1200221 AW589574 W96201 BG154182 AV655159 AA328145 N3B348 AI081357 1^7671 5 AA693 346 AA742488 AI269719 
AA897483 A1886459 AU1 55B73 H04255 AW2439B6 AA557749 AI286227 R68691 R33453 AW38B097 
AA9D8472 

U09414 NM_00343B AA503545 AI022449 AA043458 AA766074 AA765 442 AAB 05052 Al 028211 AW609708 
AW502384 AA628822 AI982587 
BE072B81 AJ762181 BE072946 

AA55B585 AA565499AI360576 AW204069AA991848 AA864939 

BM310925 AA426110 BM310629 BF434286AW015091 BF475996 AW11BB67 BE8751B6 AI68B588AM53594AW590589 AI6S2425 A1B279S9 
BF056946 AI802866 A13933B0 AI476224 AW5S0639 AW136271 A1458252 AJ524726 AAB4376B M782158 AJ336058 AI097532 AW4S1563 
AA459408 AA459633 AA418444 W236Q7 BG94015G A1493445 AW054729 AI221929 A1868744 AA215405 AA766713 AA621 546 BP928317 
BE464132AJ990909AW271459AI262O61 AA215404N74332BG940151 BG952261 AA97211 5 W9631 5 AA689586 R69057 8F76S886 BE769254 
W05240 

BG119563 AW975776 BG49B478 BE245304 AW450934 BFSQ8792 AA5994B9 AA599477 A1B05268 AA0554B9 AJ 128758 AA761425 AA73185B 
A1302271 BE219432 AA6B7294 AA01 8972 AW629429 Z45675 AW961626 T30940 R34554 T84712 BG9B6B94 
A174231 1 A1025842 AA576643 
AA075144 

AV711317 A1B09936 AI80876B AI240593 A1915771 

A1432224 AW276890 A14S9346 M937014 AA553573 AI318525 AI246219 AA961591 AI270640 

BC022413 BE395396 BF75417S AA506521 BE7O6BB5BE706678 AA723159 BE153169 BE7W298E7065588Et533l2 BE70670SAW371853 
AW371849 BEtS324t BC017410 AI337912AI090244 AWO9O300 BE219837At6236B1 BE50157B BE501734A1742232 AID23964 At458424 
AA975373 AI2B5904 AI984583 AA890325 N32582 AJ3581D2 AW241694 A1038445 AI672071 A1018389 AA576391 AA977874 AW18S392 W37448 
AA612894 AJ27T548 H89551 AI699774 H89365 AA31 5805 AW57918B BC0145B4 BCM14581 AW780125 A1672414 BE328145 AW800919 8F031306 
AW172758 BE70B322 AA345675 BE875779 H2B241 H2S31 3 BF540913 BG179B88 BF110202 AA528775 W37573 BE041G44 AW366504 BM 129522 
BM129822 AI122760 BE71B200 AWB87496 AA149420 BE706307 BE53S395 BE74B765 AI373653 R75904 BF9791B5 8FB91393 B6495535 
BJ094458 BE706702 BG496559 BF24B373 HG494800 

BE219720 BF475241 AI571723 BE219848BI7B9258 A1224899 AA724864AW771487 AA480255 AW845616 AH40295 H52B00 BE21 8790 At68 1575 
AW300064 AW2B2133 H21568 AI363015A1884914 HB6948 
AW1B2329 AA613792 T05304 AW8583BS 

BG216963 BG154885 BG213710 BG204771 BG193014 BG197556 B62t7481 BG198185 BG1B3594 B1596425 AA1 15605 AI589156 BF439839 
BG18B832 AJ359615 BG190473 AI024233BF439574 AWt18065AlB72797AA610042 BG212008 A1204382 R70913 AA033534 AA781036 A1627278 
AA307285 AA034218 BG482749 AW1 62429 BI602460 AA721969 AA476516 AA476416AA903019 8F11 0884 AA307286 AA1 15471 AW964555 
AA423826 AA1 15129 AI419107 

AW088535 AJ889321 AA354221 A1337552 R42581 AW194670 AA064862 AW001 147AA864374 AA630699 AW276176 AA676615AA857965 

AI62542B AA580792 AA582038 AA581668 AA65BQ65 AAB28156 AA857160 AA9361 03 All 49335 AA936925 AA5816B4 AA9541 98 AW238461 

AI28l904AI265B12M^267AVW38ie2AAB7B535AW304286AW474334AI558415^ 

AA552513 AA564954 AWO90553 A1205612 AI245753AA954883 AA947909AI866014 

A1590243 

AW971213 AA493925 AA493567 AA676839 AA934482 
JAG 644 AA13S139 AA136026 

BG425760 BF997600 N4851B W73454 BFai6344BF997601 AA488953 
AJ43354OAA804981 AA728984 

BI91B168 AW779760 N4B674 AI376997 AA235370 BGG99146 AI913631 AI498402 AID16320 AA323193 R49021 D59344 BG986750 N45525 
BG986917T61382R49391 R45432AJ2031D7R35004F07491 R25094R35360 

BC013728 BW34032 BF0903S5 BE410706 ^391 WB0436 BF813124 R90857 R62778 H00591 H02329AA355285AKOOOB2BBC08B721 
BG7440D4 BG479141 BG823B56 BG479061 AU121103 AU138176 BG702567 BJ999840 B642277S BG700944BE280747 AUt38529 BE2B9929 
BG704110AIU123329AU122058AU138135 BG723106 BG722291 BG831041 B6705239BE740517BE207133BE252867AU139772BG714385 
BF568538 BE304393 BF971755BE2679B5 BE272012 AU1 35396 AU12453S B15516B2 BG700612 AJ815488 BG475195BE388273 BE391517 
BW14B991 BF929247 AV853435 AW2S0299 AW249189 BF093150 BF093173 AW20S32S BF057101 BF000551 AI341 108 AL162009 AK026136 
BE907957 W7323B 81711467 BG926027 AIB16428 AA837518 AI754405 BG179142 AA481485 AW023435 BE855747 A18B5101 K52163 AIO1B096 
AW90204BE677045A1523320 A1126855 H26501 AAO43680 AA976459AI03959OAI937917 AI3S1000 W94866 A1375797 AI07SBO1 A1168236 
AA599fl82 AA084368 A1342635 A1190294 N26093 A1085234 AI29B169 BF939715 A1Z231S4 H98704 BE218925 N29394 AI918735 N41520 AI147784 
A1918796 AAB54317 N22193 AA199850 AI149728 M121263 AI051074 AA5651 16 At097349 N22209 AA552917 N331 51 D52422 BF477483 
AA47659D AA5257B7 A127919B D53353AA73B063 AA55840B BI496334AA99994B AA425847 B1496335 AA9D9624 BF197591 AW023259 Al 652819 
T31424 AA0B8213T31115AI2O6650 AA97B796 AT948989 AW248762AA449265 AA2S0687 AI6B2521 AA31022723B743 AA9353B9 AW119141 
BF941067 AI470657 AI349451 AW07933B D45665 N21640 T30O71 AI446705 R60220 BE833481 R49680 E?70049 R41223 R32402 RS99B4 R701 1 1 
BM476906 BE293615 BE382443 BF155682 BE720B38 BE931983 BE720S94 BFO85B90 AA336953 9F0B1638 AA3S9915 AA3841 16 AA3S0142 

656 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



4O9019 



433009 
415995 



434372 
436812 
406799 



418905 



406B13 
418876 
447197 
4S706B 



440538 
400261 



406781 
440129 



419970 



423395 



445093 
43B962 
425097 



437365 



418806 
400244 



32320.4 



21422G6J 
2117J 



85B779J 
659779_1 
OLD 

1237115J 
517J 



Q_0 

121279J 

2176805.1 

1196J 



371165J 
975B_4 

0 0 

2B07882J 

52134_1 

13569.1 

6633_t 
22291_3 



175963J 
195763J 
23175^2 



5541J2 



245947J 
1218BJ 



418304 



10932O9.1 



B1667664 BG823235 BE31S559 BE301958 BEB911 14 BGS26267 BE253680 BG979094 AV722757 N67629 BGS97927 BI915769 BG68Q692 R62777 
BE2511 16 H56358 AW3695B8 BGG77759 BI044604 R75787 BE77O960 H69529 R699B3 AA259238 R97827 AA310379 W01 103 WB733B4 
AL564578BFO3B102 H87182 H87517 H01574T52S73 N28881 AA301397 T92375 R684Q1 AW8Q0465 A126B1728E876949 D54019 AI909769 
AW79&415 BI2223B3 A1393171 C15260 N2B959 H17129T53096 T52494 R68602 A1364765 BF687869AI81 7035 AW1 05354 BE293B20 H14206 
BI093066 C14063 AL534349 BE255883 BE254088 AA428399 AW579360 AW5793B1 N53144 D60748 D54020 BG292106 H96705 D52423 T36174 
054161 R73016 D55021 BIB572Q0 H83444 BI016954 D54163 R15563 BI818B54 BE1 52207 BID4B502 BF8B5667 BEB13212 BE165773 BF149332 
AW807045 BE3O520O 

BM480413 N289D8 H 39792 &E240B26 BE882093 BE240827 AW868637 BF739795 AA700834 AA769597 AA489668 AW968806 AW0851 96 
A109328O AI218457 AA063138 A1632958 AW5150Q5 AI570530 Z4i 724 AA74B789 A16965B4 AA062544 AA773643 AA490285 
AA761668AA573621 R09670R92814 

M9567B NM 004573 AL530754 AM39623 AW374413 BF89BB80 AW630959 BFB75526 AW402206 BF818G90 BF89306B AWS041 10 AW408049 

BG002913 AL530753 A1524C&4 AW769231 M464970 AA293723 A1095051 A1953375 AIS82938 BM146050 AW575804 AA962489 A1655426 

BM146046 T28538 BE241936 T89023 BI910963 AI416986 AI7671 1 1 AJ 422290 AW468260 9E676853 AJ656771 AJ961755 F04675 AA662826 

AA737606 BM1943B2 AA912021 AW18309B BE676B82 AI962227 AI591366 AA621765 AA293724 R51642F10194 BI909727 BF892632 AW9506CQ 

AW950138 BG258587 BF892649 A1800647 BF892710 AA353176 BF894726 AA465B38 BIO40B59 BJB37749 BE244320 H1 80S4 T74300 AW797026 

AA926790 F32814 AW761282A1933994 AA578B23 T7S372BFB99B96 9F682808 BF974969 

BQ622121 AA631424 AA9B8296 AA631373 

AW97B773 AW29B067AA810101 AW194180AA731645AJ690573 

AA90B548 

AW972B30 AM89820 AA527647 AA570362 

BM46907SAA533027 A1127512A13G8B02 AA533141 AA7D0S6D AW576028 AI610851 AI435361 BM129172 AW474544 BM 1 28899 AI81 4292 
AW502D39 AA531243 BF941B58 AW502037 AA702337 AJ41S854 AA652755 AA934364 AI300510AJ291136 AA5052S3 A1144527AI076919 
AI633534A1242473AA938561 BG055372 AA5l2B94Al67l356AA982403BF80e010AAfi63311 AAB47O56AA513301 AA359069 AA377265 
BG29120S AA402298 AA8B5766 AW8010D2 AA302290 AJ305842 AW800873 AA302492 AA478427 A1B17291 AW801 104 AW801028 AAB65744 
BF1 55979 AI374743 M478431 AI159B46 AI369757 AIB00672 BF43578B AA255451 AI937707 AW006198 A1280363 BF062434 AW801 1 1 5 
AI919181 F2B413 WD4214 AW1S2380 AWS01587 AWS01570 AAB86371 AA384251 AI302846 BE701902 M931606 H42673 R33703 AW9015S6 
AAD09B16AW901568 AW352200 AA256558 H15928BI087170 AWB00530 AA3690BB W9B562 H28652 N34644 H97650 H00956 W70039 AI142831 
AA009817 F37136 N70289 AA531347 R72374 K27488 R666Q5 
AW276131 

AA740616AA6548MAA229923 
R36075 R36167 A1366546 

BC022444 BC00413B MM_OOQ970 BC020679 X69391 D17554 BC013B63 BG779630 BG574189 B05719B6 BG494603 BG575713 AV702244 
AU1 45720 A15GB177 BE350654 BG943995 AA73091 8 AA634024 AV744673 A1611 573 A16133B1 AI6 i 3403 AI250146 AW301 759 W31 2307 
AI376488 BE13SB82 AV743357 AI371951 AJ61337B A1513570 AM5719 A1312458 A1612697 A1334453 AW2G8747 AW266318 AI343026 AI335454 
A33130G6 AI30B011 AW075176 AI34H7B2AJ334479 AW268332 A1611433A1371906A1312459 AI371647 A1370051 A1335217 AI312262 AI610971 
A1348798 AI61337B AI612698 AI34B772 A1345434 AI345283 AK12292 AA775853 AI611363 AI345286 AJ318305 BF054854 A16133B4 A1348806 
AIB02B57 At609452 AJ583301 A134925B A1345645 AI345011 A1340442 AI33541D AI310907 BF055B80 BF055774 BF055754 BF054712 AI5B3261 
AJ3096&4 BF054905 AI370092 AI348212 A13455B3 A1334554 A1312242 BF05G856 AJ61268G AI51 1382 A1610952 AJ345301 AI312235 AI610936 
AJ609836AI340650Al312472BF054840AI802fl3BAl345448A]3365()8AI31309B^ 

AI61 1948 AI5833S2 AI583284 AI583245 AI379396 AI371 935 A1340SS4 AW301840 AI8G2239 AI612025 A1611B69 AJ609852 AI345449 AJ312375 
A1312318 A1311993 AI3108958F057960 BE17B604 A1BQ26S0AJ61 2115 AIS1 1886 AJ370059 A1313286 AI&02856 AJ61 1 438 AJ31 1987 A1802821 
A1312997 BF0547H AI611889 A1376483 AI2S2445 AI611577 A1349139 BF054925 A1B02840 AI583266 A1340739 AI612D41 A13451 96 BF054633 
AJ3QB9Q9 Ai61 1B83 AI313275 BF477097 AK09466 AQ48792 AI340468 BG844254 BG941457 AW302007 BG944413 BG841475 D6B042 AA328428 
BG429596 BE3S5392 BM424161 BF663310 AV727364 BM4738B4 BG531 T78 AA307591 BF965435 
BG009500 AJ376551 AAB97445T87714 

BE564506 Z47727 BF028489 BI597545 AA1522T3 AA730753 W07604 AM06507 AA316220 AA424663 M35844 N72473 M442109 W0178B 
AA747605 AA235B94 N41438 AA382786 AI9D6S34 A1906935 BF091197 N85572 N75678 AV75809B B£719745 BE719744 BE719735 
AA63938B AA584945 AA776364 AA776365 AA865528 
AI732997 AA977633 AA86581B 
AF075009 R63109 R63C6B 

AK055270 BE348291 A11 90289 AW61 2022 AI269506 AI266578 AI26S675 AW271406 BEB79B51 BF574163 BI497126 AW903775B1917368 
AW150900 BF244813 H79201 

BC0228B1 AU150944 BG750783AW754175AW857737A19U659 AtDS0036 AA554053A1826259AA56B548 

AK057067 BF1 1 1915 AW249706 A1222199 AI987251 A181 8753 At2446B7 AW135782 BF476693 BF060935 AJ197926 AW251092 A1685636 

AI685630 BE673458 AA932894 AW205200 AA720784 BG236866 AI683152 BE677464 AW149338 A1926439 BG031285 BF915924 BG982583 

BW60025 BG991393 BE830027 BE151985 BE152208 8E152002 BEB1B337 BE7175B1 BE816354 BEB3OO30 AA377477 T51255 BF920781 

BG170B3GN900S2 

AJ207197 BF773544 AW1S6452 

AI207343 BF813684 BF928775 AA828585 

AF070570 BF439282 BF109960 AM80268 At038060 AW082339 BF516290 BE218214 AI469958 AA039955 BE644674 A1861871 AA766231 
AAB45840 W8571G AA676253 AI0B71BB AAO22908 AW953178 T33195 BF594711 AA4889&9 R55652 D81245 D80778 D81560 AW9G0933 
BF930897 BE698103 AAO4O024 BF515960 BE16S475 AA453247 AI267601 R60894 R44223 T 331 94 AA1 1 4936 W31 640 W38B29 W39109 
AA004849 H41952 W88634BF031932 BNM23354 AL041825 H29654AI908178W35754 A19D57B2M309860C04540AA340246 H84659 
BM476605 B1545004 BI834636 B6112453 BI199049 BG112759 M1 49846 H97925 AA306121 AA313204W52451 AI734997AA931168 AA429766 
W47913AAE64321 BF940241 AI0B3648 A10B941O A1347705 A1343661 AHB6232 AIB89031 N9W64 A182O039A1 459034 AV65251 2 AA622990 
BE857200 AA932996 AA740573 AI826264 AA865683 AJ 3 44550 Al 027 349 A1056087 AA442777 AA603724 AAB73347 AI056717 AI092185 AI032895 
AA535689 BF806025BFBOe061 BFB05985BF746099 BF74S097 A1309259 A1597603 BF80BO68 AIM0B53 A1129205 AI248410 H72993 AW615341 
BF805990 BFB05962 AA993B19 734373 T35604 H56242 AA648145T35907 BF808691 N94015 AV70343B BG774276 M82341 R76371 
T65754 AA229658 AA229B57 

X79449 BC017853 AL121035 BF196384 AW119044 AJ028023 AW4511 10 AI97191 1 AW015069 A1079170 AI376367 A1264113 AA829646 AA737579 
AA449679 AA740864 KMJ1011 11 U16121 AL5B7297 BG773B01 BF973874 AV687104 AA527579 AA843S25 BE706355 A10745B9 AI523475 
BEB90249 AW406263 BE074258 AV729485 BF809610BG05B619 AA677244 BE179838 AA622264 A14G010B AA740411 AM9916B AI078223 
A16B2923 BE69B559 AW375385 AA7BB739 BG984978Z40B74T17054 F096G9 AW844043U 10439 BI711870 AW245957 AU15B5S7 AA679305 
AA679316 W72510AI346029BGO59762AW251062AA132373AI925621 Al 860230 AI3401 72 AW1 92891 AI7O79O0A1094937A1O42115AI2O0901 
BE328452 AA644678 AA55t209 BE351 065 AA9707B1 M66609 AW002028 AA160825 A1422774 AW8731 14 AW073597 AW6644B3 A1218710 
AW02O550 AW1S0607 A19B4545 A1871921 A1333870 AI4528B7A1B1B335 AA39B6SS AI554424 A1274187 BE465703 AW512940AW2413B6 
AI923954 AA576649 AW16B294AA813181 AA91 2168 AKM 9738 AW514073 AA548255 A1569630 BE 71 0031 AA244182 A1341697 AA563904 
A1537990 AW51790B AW172943 239498 A1750294 AW15041 4 Al 253293 BE825720 T31860 AW150775 D20310 AA160892 AU133933 BE781 148 
AL038957 BF910979 AA352297 BG988142 AW372175BF229106 AW86B705 BE093482 BG980396 A1499917AA054452 H05484 AI82a502 
BM467331 AO140570 AL135417 BF947202 AW391926 BEB13418 BF99B473 T92021 BI02104B BM048783 AW501366 AW501342AW501549 
BE939021 BE707147 BE 160974 BE305207 W4901 1 AA947119 AA67B8D1 BE536B76 AWB97426 BS329648 BG818540 BE542344 BI919250 
BI253018 AW130996 BE074249 BE895428 BI034852 BE0B3277 BF952166 
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438141 
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413B28 
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1163608J 
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410704 1054673J 



400233 11259J 



413891 B23L1 



410554 36319J 



AA77BB49 AW946871 AW946782 AW94695S 

^1^BG!ram 1 AU124S90 BG829759 AU143705 BG701663 BG699781 AU131718 BE515064 AU121612 BG898B50 AL55B461 BM151887 
AU12875B BM263692 AU099O13 AI241350 UBB316 AW058398 AW4B9340 BF683957 A1470140 AU150993 AA633376AW1 50821 AA536142 
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BE166917 AW877S29 BE1SS928 BF351394 AVW7522 AW877528 BE166861 BE166866 BE166913 BE166919 AW877456 AW677537 BE07BB6B 
AW840571 

BOHM324 NM001Q20 BC007977 M60B54 BMO5062B BG829809 BE3B55M BCT44451 BI826914 BE440007 BI260656 BE3951 17 BE389334 
8E255792 BI194169 BI66B218 BI194376BG716213 BG714408 BE392513 AV722219 AW32a>77 BM424171 BI828267 AW958W6 BG831252 
BE392943 BE394033 BI858915 BI668334 BE62t0t9 BG706995 BE7919B5 8F9674B4 BM93636 BG761859 BM466537 BG747165 BG827488 
A1133550 BM01t511 B1227282 BG489212SG47B388 BE7277B9BI1608808GB31707 BG324692 BM470427 BKB38B9 BG831606 BG754114 
BG420536 BF308210 BE384213 BG832271 BG82B032BG481641 BF205B7S BE899041 BE271558 BI193607 BI159866 BG473786 B6397178 
B119442BBI1 17210 BG7BB326 BG759507BF975645 BF343657 BM02OS98 BGB310S2 BG829943 BG629501 BF306567 BE562511 BM050145 
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BG759B64 A1830O1O BF568381 BE90723B BI161172BI1 16773 BGB27153 BGB250BBBG335419 BG1094G4 A1929068 BE90S3S4 BE408564 
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TABLE 5BC 

Pkcy: 

Ret 

Strand: 



Unique number cenespondtag to an Eos probesat 

Sequence source. The 7 digit numbers in Shis column are Genbank IdenWfef (Gl) 
human chromosome 22* Dunham, el al. (1999) Nature 402i4BM95. 
Indicates DMA strand from which exons were predicted. 



number -Dunham, et at/ refers to the publication enfflted The DNA sequence of 



NCposttJoo; Indicates mjcteotid© positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposMon 


402474 


7547175 


Minus 


53526-53628,55756-55920,57530-57767 


402145 


8018280 


Plus 


113066414800 


401091 


9958240 


Phis 


94780-94898 


401466 


6682292 


Plus 


28748-29023 


401113 


9986541 


Mnus 


19419-19959 


406542 


7711499 


Phis 


117335-118473 


405066 


6072509 


Plus 


736B4-73B41J4081-74217.74610-74779.7492 


401846 


7712190 


Minus 


82775-82023,82912-33022 


404854 


7143420 


Phis 


14260-14637 


405266 


4156171 


Minus 


63337-63552 



TABLE 57A: 703 genes upregillated In testicular cancer reJattv© to normal body Husoes 

Table 57A lists about 703 genes tipreguiated in lestMarcmcerreteforeto normal body tissues that are lfcery to encode prolems amenable te modulation smalt motecuj^, 
peptides or antibodies These genes were selected from 59680 prcbesete on Uib Eos/Affymstrix Hu03 GenecWp array. Gene expression dale for each probeset ohtdned from nils 
analysis was expressed as average Intensity {Al}, a normalized value reflecting the relative levd of mRNA expression. The protein producte of Uiesejjenes often conta^i one or mora 
domains indicative of have oncogenic function orof transducing nMracelufar signals, or of being modulauAle by small molecules, peptides, or antibodies (e.g. plunase, deanVdomain. 
Ttm, phocphetese, or ionjrerisporter). Certain predicted protein domains are noted. 
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Pkey : Unique Eos probeset Identifier number 

ExAccn: Exemplar accession number, GenBank accession number 

UnlGenelDi UnlGene number 

Pred.ProLDomams: Certain predicted protein domains. Abbreviations use± TM. transmembrane domain; SS, signal sequence; =Y, very likely to contai n; =M, 

^> likely to conta&i; olher protein domain abbreviations are tram PFAM (Nucleic Acids Research, 2002, 30-276-280) 

UntGeneTiffe: UniGene gene title 

R1 95lh percejitDe of tesfcular cancar AIb divided by the 50lh percentile of norma) tissues At, where the 1 0th percentile of al norma! tissue Als was subtracted 
from both lbs numerator and denominator 

1 0 Pfcey: ExAccn; UnigenelD; UnigeneTitle; Pred.PiolDrjmains: R1 

424687; J05070; Hs. 151738: malrix meteJIoprotelnase 9 (gelafoase B, 92kD gBlaflnase, 92kD type IV cotfagenase}; matrix metaUoprotainase 9 (gelatlnasB B- 31.23 
4401 19; AA865455; Hs.125331; ESTs, Moderately similar to unknown [Ksapionsj; ESTs, Moderately srrnflar to unknown [H*- 27 37 
1JC 421241; X91B17;Hs.l02866;traralwtolBS^^ 

ID 431840; AA534906; rfs.2860; POUdomaJn. class 5, transcription factor 1; POU domain, class 5. transcription facto; 25.03 
435918: AF263538; HS.B5232; growth drfrarenfetfon factor 3; growth differentiation factor 3; 19.86 

432666; AW2Q4069; Hs.3511 18; ESTs, Weakly aimilarto unnamed protein product [H.aaplens]; E8Ts, Weakly slmrlar to unnamed DroteVi : 17 74 
419556; U29615; Hs.91093; chifinase 1 (chltoWosWasa); chlrJnase 1 {chltotriosldasej; 17.64 

452838; U6501 1; HsJ0743; preferentially expressed enngen "n melanoma; preferentially expressed antigen in mela; 17.06 
20 417BB6; AA214584; ; ESTs; ESTs; 15,95 

412265; AA1D1325; Hs.86154; hypothetical protein FU12457; hypothetical protein FU12457; 15.93 

425572; Aaoi107fit Ks.158307; undifferentiated embryonic celt tjansorfptfori factor t;undifterenftated embryonic call Iranscri; 15.82 
423905; AW579960; Hs.135150; king type-l cell mernbrane-assodated glycoprotein: lung typo-l cell membrsne-assodated gly, 15.11 
419741; NrwL00701ft Hs.93M2;ublqu™n carrier prated E2-Q ublquffiri earner protein E2-C; 15j08 
Z j 4275B4: BE4102B3; H&.1 79718: v-myb avian myeteblastosb viral oncogene hornoioo-llke 2; v-myb avian myeloblastosis viral oncogen: 14.17 
418696; AW9SS433; Hs-326290; hypometicat protein FU125B1; bypotheficeJ protein FU125B1; 13.58 
416B19; U77 735; Hs.B0205; pi m-2 oncogene; plm-2 oncogene; 13.20 

414034; UB9277; Hs.309965; early development regulator 1 (homolog of porybomeoUc 1); early development regulator 1 (hornotog a 1Z93 
454077; ACC0S952; Hs.37062; tosuMtai 3 (LeytCg celt); IrisuMte 3 {Leydig cell); 1Z90 
30 432730; AI066520; Hs.13t358; ESTs; ESTs; 12.84 

446293; AT420213; Hs. 149722; UM domerrc r/anscripBon factor UM-1 (hUW-1) mRMA; UW domaBi tanscrfptton factor UM-1 (tq 12.74 
423354; AB01 113ft Hs.127436; calcium channel, voltage-dependent, alpha 2/de!ta subunft 2; calcium channel, voltaae-dependenL alrxr 12.46 
450561;AF08l5t3;Hs.25f95;T6F-bata4;rGF4»ta4; 12.42 

1 S ^?^S^ 21 il?.!^ S ' **** to RBaB -> Hl J MA ^ RAS-RELATEO PROTEIN RAB-8B [riSBpiansJ; ESTs, Weakly slmter to RBBB HUMAN RAS-R; 12.26 

■33 431462;AW5&^72;Hs^6311:granln^^ 

431354: BE046956; Hs.251673; UNA (cvt^e^J-n^rtransferase 3 beta; DMA (cyteslr^l-rr^ytlranBferase 3 be; 1 1,91 

402199; Target Exon; Target Exon; 11B5 

424578; AK00197$ Hs,150B9ft hypothetical protein; hypaibefJcal protein; 11.81 
416350; AFI88625; HS.1B9507; phospholipase A2, group I1D; prirjspholipase A2, group HD; 1 1.67 
43997$; AW60029J; Hs.6823; hypoihefca! protein FU10430; hyporheBca? prote/n FU10430; 1 XJ57 
410048; W76467; Hs.343874; profine oxidase homolog; prolne oxidase homolog; 1 1.42 

442573; H93366; Hs.7567; branched chain anifnotraisferase 1, cytosoHc; branched chain arrtfnofransferase 1. cytos; 1 1.42 
414812: X72755; Hs.77367; monokine induced by gamma interferon; monokine Induced by gamma Interferon; 1 1 .38 
421917; A8028943; Hs.109445; KIAA1020 protein; KIAA1020 protein; 11.15 

440006; AKD00517; Ks,6844; NAIP2 protein; PYRIN-Contalning APAFI-Uce; NALP2 protein; PVRIM-Oontaining ARAF1-B; ia92 
4146B3: S78296; Hs.76868; hypothetical protein MGC12702; hypothefcal protein MGC12702; 10^1 

423673; BE003054; Hs.1695; matrix metelloprolebase 12 (macrophage etestasej; matrix metai^o^hase 12 (macrophage ; 10.74 
433800j A1034361; Hb.135150; kjng type-l cell mert^rana^ssodated glycoprotein: lung type-l cell membrane-associated gly; 10,68 
429J20;AK001673;rte.ttr35^ FUlOBif; ia46 

D\) 444371; BE540274; Hs.239; torkhead box Ml; fortrnead box Ml; 10.46 
441553; AA281219; Hs.12f 296; ESTs; ESTs; 10.37 

426534; U58096; Hs.2051; tesfis spedEc protein, Y-linked; testis specific protein, Y^Rhted; 10.28 
441878; AIB01B69; Hs.1 27982; ESTs; ESTs; 10.06 
432117; AL036195; H&3909; protamine 1; protamine 1; 10.01 
D J 425427: AI652662; Hs.31 7432; branched chain amkiolransferase 1 , cytosollc; brancried chain aminotransferase 1 , cytos; 9.97 
416201; AA467752; Hs.195161; ESTs; ESTs: 9i97 
410929; H47233; Ha30643; ESTs; ESTa; 9i91 

t^S? factor 4 (heparW ^ fr^artstomrflrrg probm I Kaposr sarcoma oncogene); flbrcblaEt growth factor 4 (heparin seer, 9.B1 

427239; BE270447; Ks.356512; ubtqiifin carrier proWn; libfepjlfo ca^ ^ 1 

OU 402680;;;TargetExon;Ta^IExoii;9.68 

4O920* Y 0009 * Hs.172631; Integrin, Blpha X (anugen C011C (pi 50), alpha pdypepfjd^; Integrin, aipna X (antigen CD11C (pl50) f ; 9.46 
443426; AF0S6t58; Hs.9329;diromosome 20 open reading frame 1; chromosome 20 Men rearing frame 1; 9.42 
440207; AJ371978; Hs.128326; ESTs; ESTs; 9.41 
- 433001: AF217513; Hs.279S05; clone HQ0310 PRO0310p1; clone HQ0310 PRO0310p1; 9.41 

65 447534; AW953935; Hs^BB655; ESTs; ESTs; 9.33 
442333; AI650B77; rte.129302; ESTs; ESTs; 9.28 

¥1 3 3 (^ , ^ 05: Ho ^ mRMA; cONA DKRp434B0425 {from ctone DKFZp434B0425J; Homo saptons mRNA; cONA DKF2p43460425 ft 9.24 

423458; A1204212; Hs.35l 1 1$ ESTs; ESTs; 9.23 

43195ft XB3629; Hs.2877; cadherin 3, type 1. P^adherm <pracentafj; cadherin 3, type 1, P-wdheKn (pfacenla; 9.23 
/U 422938; MMJJD1B09; Hs.1594; cenr/omerepfotBin A(17ld^;ceiUromeie protein A (17kp); 9.21 

11!??? AFOr20 "; Hs.67848; leutecyfe unmunogtobuffn-Oke receptor, subfamily B (with TM and TT1M domains], member 4; leultocyte ImrranoglobulirvllRfi receptor : 9.21 

425397;J0406^Hs.156346;topoJ^^ ' ^ ^ ' 

428664; AK001666; He.189095; simllarto SALL1 (sal (DrosophilaJ-Dte; similar to SALL1 (sal (OrosophilaJ-likfl; 9.17 
_ _ 428970; BE276B91; Hs.194691; re&ioicactd Induced 3 (RAIG1): rnetabotroplc glirtamaje family GPCR relinoic add Induced 3/RA!G1)« melabo; a 11 
/ O 447733: AF157482; Hs.194O0; MAD2 (mitotic arrest deficlerrl yeast, Ixxnotogyrfte 2; MAD2 (rn'rWc arrest deficient, yeast h; 9.11 

42231ft AA316B22; HS.9B370; cytochrome P45D r subfamUy IIS. potypepfirte 1; cytochrome P450, subfemTIjr «S t potypapt g.tQ 

449722; BE280ri74;l^23960;c^rriB1;cycTkiB1;a86 

441580; F13386; Hs.7888; v-ero-a avian enytnrobtestfc taufcemta vfral oncogene homolog -like 4 (HER4); v-erb-a avfan eiythroblastic leukemia vf; 8 86 
A 440983; M206B1; Hs.7594; solute carrier family 2 (facilitated glucose transporter), member 3; solute earner family 2 (facilteted etu: 8.86 
OV 409342;ALl077u^;Hs^40D^er^lBSSoriatedRINGd 
420367; AA259090; Hs.257028: ESTs; ESTs; 8.62 

41 5947; U04O45; Hs.78934; miilS (E. coli) homolog 2 (colon cancer, nonpolyposis type 1); mulS (£ cdi) homolog 2 (colon cancer, : 8 73 
41 8613; AA744529; Hs.86575; rwtogen-activBted protein kkiase kinase kinase kinase 1; ntitogen-acBvated protein kinase kinase ; a?1 
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417407; AA923278; Hs.290905; ESTs, Weakly similar to protease (HsapiensJ; ESTs, Weakly sircar to protease IKsapl; 8.64 

407239; AAO76350; Ha.67646; leukocyte hwr^notfobuRn-lfoe receptor, subfamily B (with TM and iTIM domains), maraber 4; leukocyte Irnrnunoglobulh-Bke receptor ; 8 58 
436481; AA379597; Hs.5199; HSPC150 protein similar to uWquftlrwonjUQatiriQ, enzyme; HSPC150 protein Blmlferto ubiquftTn-ccn; 8,55 
412140; AA219691; Ks.73625; RABOinteracIing, kuiesin-^e {rabklnes&i 6); RAB6 interaclinH, Wnesm-like (rabWrws; 8.52 
3 438091 ; AW373062; Hs.351546: nuclear receptor subfamily 1, group t, member 3; nuclear receptor subfamily 1, group 1, m; 8.61 
424800; ALD3558B;H5.1 53203; MyoD family inhibitor; MyoD family Inhibitor; 8.45 

447188; H65423; Hs-17631; bypcrfnefibal protein DKFZp434EZ135; hypoltiafcal proiem OKFZp434£2135; 8.45 
430056; X97548; H3.22B059; KRAB-assocfated protein 1; KRAB-assoclated protein 1; 8.42 

41 7389; BE260964; Hs.B2045; midklne (neurite orowth-promofing factor 2); msdkine {neurits grtwth-promotirig factor, B.40 
1 KJ 430676; Af084665: Ks.372585; gfcHbmo sapiens envelope protein RlC-3 (env) gene, complete cd; gb:Homo sapiens envelope protein RJC-3 (; 8.3B 
420759; T1 1832; Hs.127797; rtomo sapiens cDNAFU11381 Us, ctor»tfEMBAfOOQ5DJ; Homo sapiens cDNAFU113al fe, done HE; 8.38 
406621; X57BG9; Rs.181 125; immurwolobufln lambda locus; immunoglobulin lambda tocus; 8-37 
463914; NfA_000507; Hs.574;fractose^1.6-bfephosphaiaaB 1; fructose-1 ( &*bphosp\iatase 1; B.25 
423198; MBi933;Hs.1M4;<^drvl3fonr^ 

I ~> 41 8299; AA279530; H$,83968; tategrin, beta 2 (antigen CD 18 (p95). ^monocyte fundion-associated antigen 1; macrophage awKoen 1 (rosc-1) beta subunil); wtegrto, bete 2 fanfigen 
CD18 (p95), ty r 8.17 

453968; AA847843; Hs.6271t; High mobility group (nonhfetone chromosomal) protein 4: High mobiBty group {nonhistone chromoao; 8.16 
453985; M44S45; Us_281865; ESTs; ESTs; 8.14 
451106; BE3B2701; Hs.25960; N-MYC oncogene; NWC oncogene; 8.10 

420347; AL033539; Hb.97124; Human DNA sequence from done RP1-309H15 on chromosome 6p22.1-2Z3 Contains a gene similar to HDGF (hepotoma-derived Growth factor Ihlalw 

rnobOTty group protemf^^ a 1 * 

415857; AAB66115; Hs.127797; Homo sapiens cDMA FU11381 fe, ctone HEMBA1000501; Homo sapiens cDNA FU1 138 f fc, done HE; 8.02 
425601; AW629485; Hs. 148720; GSK-3 binding protein FKAT2; GSK-3 binding protein FRAT2; 7.90 

421016; ^504583; Hs.101047; transcription factor 3 (E2A Imrwnogtobulin enhancer binding factors E12/E47); transcrlpflon factor 3 (E2A rmmurwdobul; 7.B9 
£3 432407; AA 221036; ; gb:zr03fr2r1 Straiagene NT2 neuronal precursor 937230 Homo sapiens cONA ctone 5' slmWar to SW:POL_BAEVM P10272 POL POLYP ROTSN • mRMA 
sequence; gbzr03f12.ii Stratagena MT2 neuronal pr, 7.83 

422846; BE513934; H3.15B3; neutrophil cytosdlc factor 1 {47kD, chronic granukxnatoua disease, autosomal 1); neulraph^ cytosollc factor 1 (47kD cnr 7 80 

433228; F2fl21£Hs.l4983;Ki^l491p^^^ 1 ' ' 

446528;AUD76640;Hs.15243f iwdiioter protein 1 (1201(D); nucleolar protein 1 {120kD}: 7.71 
D U 447350; AI375572; Hs.1 72634; v-erb-a avian erythroblastic leukemia viral oncogene homology 4 (HER4); v-erb^ avian erythroblasOc leukemia * 7 71 

448775; AB025237; Hs^8; nucfe (nucSecfllfc dlpr^phate 

430253; AK001514; tfs.23S844; hypotfieScaf protein FU10652; hypothefrcel protem FIJI 0652; 7.70 

444784; D124B5; Hs.11951; ectonudeotide pyrophospharasa/phosphcxHesJefa59 1 {Plasm a-ce5 membrane gJycoproWn PC-1>; ectonuc»eo8de fjyro^s^aiasG^ofwdi- 7 82 
443537; W3305; Hs.203; chdex*sto^^ W w^*****f*^p™ai, 

J J 446291; fiE397753; Hs.14B23; interferon, gamma-lndurible protein 30; tnterferon p gamma-lnducible protein 30; 7.55 

410006; AW732306; Hs.57783; eukaryotic translaiton toltlaton fector 3. suborvt 9 (eta. 116kD); etAaryobc franslabon mRiaifOTi factor; 7 53 
430255; AKOO07Q3; Hs.323822; Homo sapiens mRNAfor K1AA1551 protein, partial cds; Homo sapiens mRN A for KIAA1 551 protein :'752 
411975; A191605B; Hs.144583; 3'UTR of: dead rkiger prosopMaJ-IIke 1: 3'UTR of: dead ringer (DrosophiIa)-lB« ; 7 50 

4398S4; AJ720078; Hs.291 997; ESTs, WeaWy sWlar to A475B2 B-cett gnwm factor precursor (Rsapiens]; ESTs, Weakly stofilBr to M7582 B-cetl pn 7 47 

440773; AA3527Q2; Hs.37747; Homo sapiens, Similar to RJKEN cDNA 2700083B06 gene, ctone MGC4669, mRWA, complete cds; Homo sapiens, Slml/ar to WKEM cDNA 2700* 7 47 

4m05;AW34952;Hs.175220;fvpofo^ 

425367; BE27118B; Hs.155975; protein tyrosine phosphatase, receptor type. C- associated protein; protein tyrosine phosphatese, receptor 1 7.47 
407710; AW022727; Hs.23676; ESTs; £S7s; 7*45 
445093; AI207197; Hs.374l49; ESTs; ESTs; 7.41 
4^ 4181 13; A1272141; Hb.834B4; SRY {sex determining regton YJto 4; SFCf (sex determining region Y)-box 4; 7.39 

417900; BE2S0127; Hs.82906; CDC20 [cell division cycle 20, S. OBrevisiBe. tiomotog); CDC20 (cdl dwiston cycfe 20 t 8. cetevi; 7.37 

fJ??? 905 JJ 5,1 11 B01; dF* ne o^frogenaw (decarboxyiaBng; glycine decarfjoxytase. gJyo^ecteavage system pmteto p) ; glyohis dehydiOBeriase (daartjoxyiatiigT ; ? 33 
422mUl1690;Hs.l572;fack^nlt3ldysplBsk(Aar3kc^ ^ v 3f ^ 9 " 

n 430504; H52761; Hs.44095; Homo sapiens, clone M6C:t2617 > mRNA, complete cds; Homo sapiens, ctone MGC:12617, mRNA, conr 7.32 
D\) 44898?; AI963719; Hs. 195387; ESTs; ESTs; 7.28 

413752; AW41 1479; Hs.848; FK508*lndlng protein 4 (59kD): FKSO&^ndlng protein 4 (59xD); 7^6 

-il 3 ^ 01 H ?- 4749 "» ft* 1 * 8a P tet1R mRNA; cDNA DKF2p76lE13121 (from done OKFZp761 El 3121); paitJatcds; Homo sapiens mRNA; cDNA DKFZp76tE13121 1: 7.25 
™ m4 l^2% l?^?* 07011 NC "-C6AP-Ov23 Homo sapiens cDNA done 3T similar to TR:OtS475 015475 UNNAMED HERV-H PRfJTEIM ;, mRNA sequence; gb:W4b07j<1 
r-c- NCLCOAP_0v23 Homo sapiens; 7.24 

55 428977; A«001404;Hs.194698;cyclin B2;cyc1[n B2; 7.19 

434274^AA62853^ ,^785 .ESTs, Moderately similar to ALUIJ^UMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY IH^aplensl; ESTs, Moderately 
fitmnaf to AluI^HI/MAN A; 7. 19 

446700; AW20B257; Hs. 156325; Human DNA sequence from done RP11-145L22 on chromosome 6p2?.32-2Z2 Coniafna the ge/te for myellntoygoo^rccyta rfrooproeelfl MOG 
s-f\ {part of) the gene for a novel KRAB box cor,la3nir\g C2H2 type zinc C; Human DNA s^ubhco from clone RP11-145L2; 7.16 

OU 420524; AB010575; Hs.98547; airutoruJe*senBhTve caflon channel 3. teste; amUoitte-senslbve cation channel 3, te; 7.15 

439053; 8E244588; Hs.6456; chaperonin corrtsirtfng TCPf t subunit 2 (belaj; chaperonm contemlrn TCPL subtmit 2 (b; 7.14 

445076; A1205886; Hs.154131; ESTs; ESTs; 7.14 

448588; A1970276; Hs.159905; KIAA1676; KIAA1676; 7.13 

429486; AF155827; Hs.203963; hypc^heticai protein FU10339; hypoftieficat protein FU10339;7.10 
O J 445362; BE614410; Hs.23044; RAD51 (S. ceffivfaiae) homotog (E cdl RecA homolog); RAD51 {S, cerevfelae) homotog {E coJf Re; 7 04 

433914; AF1uai38; Hs.1 12160; Homo saptene DNA heltoaae homotog (PIF1) mRNA. partial cds; Homo sapiens DNA hefcase homdon fp?F1)' 7 02 
413276^ BE5f^B6; Hs.833; Interferorv^talaited pro^ 

R23859; Hs.f43375; Homo sapiens, done tMAG&3840937. mRNA. partial ods; Homo sapiens, clone IMAGE3e40937 l rnRNA^ 636 

TO41 6058; U03272; Hs.79432; fibrillin 2 [ojnaenltal contjecturat ajadinodactyly); fibrillin 2 (congenita) cwrtractural era; 5.92 
438450; A1050B66; Hs.65853; nodal, mouse, homotog; nodal, mouse, homdog; 8.90 

1!??? fiE ?? 9352: Hs - 949 I neutrophil cytosotfc factor 2 (6SkD. chronic p^utornatous disease, autosomal 2); neutrophil cytosoUc factor 2 [65k0.chr; 6.90 
444381; BE387335; Hs.283713; hypothetlca? protein BC01424S; rrypotheifcai profefn BC014245; 6.89 
447582; BE293520; Hs.18910; prostate cancer overexpreased gene 1; prostate cancer overexpressed gene 1; 6.89 
424779;AL046851;Hs.153053;CD37antlgen;CD37anlia^6^ 
/ 3 443907; AU076484; Hs.9963; TYRO protein tyrosine Kinase binding protein; TYRO protein tyrosine kinase binding pro: R84 
427298; AA400495; ; ESTs; ESTs; 6.82 

414732: AW410976; Hs.77152; miiili^Tromosome maintenance deffcfent {S. cerevistae} 7; rdruchrornofiome maintenance defiaenUS.; 6.81 

424959; NM^DD5781; Hs. 153937; activated p21cdc42Hs Wnase; arfrvated p21cdc42Hs kinase; 6.81 
„ 428866; U02330; Hs.172816; neuregulin 1; neureguffn 1; 880 
oU <18203; X54942; Hs.6375S; CDC28 protein ttiase 2; CDC28 protein kinase 2; 6 80 

427521; AW973352; ; ESTs; ESTs; 6.75 

430397; A1924533; Hs.105607; bicarbonate transporter related protein 1; bicarbonate transporter related protein ; 6.75 
427719; Al 3 93122; He. 134726; ESTs; ESTs; 6.74 
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43945$ BE264974; Hs.6566; thyroid hormone receptor fnteractor 73; thyroid Jwrmone receptor Weracior 13: 6-72 
4S334B; BE272318; Hs.859S; hypothetical protem FU12438; hypothetical protein FU12438; 6,71 

446f f 3; AW967553; Ks.32351fl; Homo sapiens mRNA for FU00083 protein, partial cos; Homo sapiens mRNA for FLJ00083 protein, ; 6.70 
c 445363; NMJ0O5993; Hs/12570; lubuTin-specific chaperone d; tubuHn-specKic chaperone d; &70 
5 433701; AW445D23; Hs.15155; ESTs; ESTs; 6.69 

418054; NM_0023i8; Ks,B3354; lysyl ojddase-Ike 2; lysyl owdase-lite 2; 6.63 

441031; A11106B4; HsJ645; fomogen, B beta pch/peptide; fibrinogen. B beta polypsptKte; 6.62 

439237; AW408158; Hs.318693; ESTs, WsaWy srrnRar to A47582 B^ceW growth factor precursor {Hjsaptens}; ESTs. VveaWy smWar to A475S2 B-ceff gr 6 59 
424381; AA285249; Hs.146329: protein kinase Chk2 (CHEK2); protein kinase Chk2 (CHEK2>;6.66 
1 U 414821; M63635; Hs.77424; Fc fragment of IgG, *irgh affinity la, receptor for (C064}; Fc fragment of tgG, high affinity la, re; 6.58 
418883: BE3B7036: H$J211; acid phosphatase 5, tartrate resistant; add phosphatase 5. tartrate resistant; 6.57 
414161; M136106; Hs.184652; KJAA1553 protein; KJAA1553 protein; SJS7 

437437; AA226BB9; Ks,351623; hypothetical protein DKFZp762L0311; hypolnefceJ protein DKFZp762L0311;6.55 
425769; U725l3;HsJ59486;Hum^RPL13-2rK^^ RFL13-2 p^dogwemRl^.cornplBta;fi.S6 

1 J 452363; AI582743; Hs-94953; Homosapfens, SJmRar to compfemeni component 1, q subcomponent, c polypeptide, cfcne MG&17279, mRNA. complete cds; Homo sapiens Similar 
tocomplernant.comp;6.65 

422765; AW40970J; Hs. f 576; cacutovlral tAP repeat-contataing 5 (sonrfvfn); baculovlra? IAP rerwat-contafntng 5 (stir; 6.51 

409893; AW24709D; Hs.57101; ralnighrornosome maintenance deficient (S, CBravfefee) 2 (mtofln); rmnichrarnoeorne maintenance deficient (S>; 650 

418918; X07871 ; Hs.69476; C02 antigen (p50). sheep red blood cell receptor. C02 antigen (p50), sheep red Wood cell ; 6.49 

417911;AA3333B7; te.B291fcchaperanincontriito^ subunit 6A fc 6.44 

427747; AW41 1 425; Hs.180655; sertne/threonlnB kinase 12; serine/threonine kinase 1 £ 6.43 

409142; AL136B77; Hs,5075B; SMC4 (structural maintenance of dromosomes A, yBost^fike 1; SMC4 {structural maintenance of cbromoso; 6 42 

430280; AA36125B; Rs.237B63; Intedeukin 7 receptor; interlouWn 7 reoepfar; M2 

432938; T270J3; Ite3132; steroidogenic aorte regutotory protein; sterofdogenfc acute regulatory protein; 6.42 
ZD 422809; AW001379; Hs.121026j hypothetical protein FLJ10549; hypothetical protein FU10549; 6.40 

427578; AI5913G5; Hs.169064; ESTs. Hfghly sMar to UfUJfUMAN TU8BV RELATED PROTEIN 3 JUsaptens); ESTs, Hfchly similar to TUL3_HUMAN TUBBY" 6.40 

417115; AW952792; Hs.334612; email nuclear ribcmicleoprolein polypeptide E; small nuclear ribonucleoproteln polypept; 6.39 

42B227; AA321649; Hs.2248; small Inducible cytokfrie subfamily B [Cys-X-Cys), member 10; small Inducible cytokine subfemlly B {Cv: 6.39 

441364; AA44784S;Hs.288660&reflnbteacld Induced 3; re&iofcscid Induced 3; 6,38 
JU 438915; AA280174; Hs.285681; Waiiams-BBuren syndrome chromosome region 14; Wllfems-Beurensyndroma chromosome regj; 6 34 

416773; AK000340; Hs.79828; hypothetica) protein FU20333; nypc*elteef protein HJ20333; 6.34 

45199ft AW176401; Hs.27424; QEAP/H (Asp-GJu-Ala^sp/His) box polypeptide 1 1 (SxeravisiaQ CHL14lka hefoase); OEAD/H (Asr>G!u-Ala^AEplHls) box polypep; 6 31 

444159; AF1 16846; Hs.10431; dead ringer (DrosoptiilaJ-fike 2 (orfght and dead rfngerj; dead rlnnjOT (DrosopN?a>-like 2 (bright ; 6.31 
n k 425274; BE2B1191; Hs.155462; mbfclwomosonio maintenance deflctent (misS, S, pomba) 6; mlnlchroinosonie maintenance deficient (mt 6 31 
* D 434649; AA736254; Ks.165390; ESTs. Highly simitar to A40350 transcrlplton repressor protein YY1 lH.sapiensl; ESTs. Hfflhty similar to A4036O transcript 6.30 

420507; AF093408; Rs.9B397; AWnase (PRKA) anchor protein 3; A kinase (PRJC*) aichor protein 3; 6,30 

415B29; AW45019B; Hs.163742; ESTs; ESTs; 6\28 

410342; R31350; Hs.743; fc fragment of JgE, high affinily \, receptor for, fianma pdypepfide; Fc fragment of lgE r high affinity }, rec; 6.26 
446B39:BEX91926;Hs.16244; miotic spindle coHeo^ralm^^ 
4U 457465; AW30T344; Hs.?2290fl; DHA repfrcatfon factor; DNA i^lcaGon factor; £25 
428918; ALD369B7; Hs^324; protamine 2; protamine 2; 6.24 

443523; AK00157S; Ks.9536; hypolheticat protein FU10713; hypothetical protein FU10713; 6.19 

424415; NNL0O1975; Hs. 14 6560; endase 2, (gamma, neuronal); enolase 2, (gamma, neuronal); £19 

407245; X9056B;Hs.l72004;tHhi;tnin:ai8 
4^ 45B627; AW068642; Hs.97964; SRV (sex determining region V)-box 17 (SDX17); SRY (sex detemilnlng region Yybm 17 (6; 6.18 

421379; Y16221; Hs.103982; small Inducible cytokine si*farniiy B (C^s-X-Cys), member It! small inducible cytokine subtamlly B (Cy; 6.16 

435099; AC004770; Hs>4756gn^*(f^re^p«cfficemkjmjctea^ ? ; ^ Etrucftjre^pedftoeirdomniease 1;6.t3 

424308; AW975531; Hs,154443; minfchjomosoma maintenance deficient (S. cerevisiaa) 4; rrfnlchornosome maintenance deficient IS.; 6.12 

430521: NftAL016383; Hs.242163; HOM-TES-05 tumor antigen; HOM-TES-85 tumor antigen; 6.10 
DU 444823; BE262989; Hs.12045; putative rjrotem;p^lBtnreproteb; 6.10 

42B4B4; AF1 04032; Hs. 184601; solute earner family 7 (cationta amino acid fransporter, y system), member 6: solute carrier faniy 7 (cafionte amino ; 6.09 

402260; ; ; NM_Q0143S':Horno eaptens fibriUarm (FBL), mRNA, transcript) (F8A), mRNA.; NM_001436*:Homo sapiens flbrfllarfn fFBb 609 

441321; H171B2; Hs.7771; &cell associated protein; B-cell associated proteto; 6\05 
^ * f^E? ^il 1307 ' Hs.««^; CDC45 (cell dJwston cycle 45, &cerevisi». homoJogH^ COC4S {ceil oVvisfan cycte 45, Rcerevis; 6.04 
D D 43162P4 AU077025; H3.2656Z7; interferon, alpha-tndiiclbte protein (clone 1F1-6-16); Interferon, alphaHnduclble protein (cfo; a04 

402678; ;; Target E»m; 7amet Exon; 6.03 

453884; AA355925; Hs.36232; WAA0186 gene product KIAA0166 gene product. 6.01 

439753; BE262233; Hs.7423; hypothetical crotetn from EUROIMAGE 2168212; hypotheEcal protein from EUROMAGE 2168; 6.01 
420586; NWL002692; Hs.99165; polymarase (DNA directed), epslton % potymerase {DMA dhected), epsUon % 6j01 
OU 42067ft AI4347B0; Hs^248; vav 2 oncogene; vev 2 oncogene; 6.00 
418756; AA252254; kk226949; ESTs; ESTs; 5.99 
454438; AA224053; Ks.172405; cell division cycle 27; cell division cycle 27; &98 
407818; AI021938; Hs.40454; jumonjl (mouse) hwnorog; Jumonji (mouse) homotog; 5.98 
413313; NMUW2047; Hs.2938B5; glycyt-tRNA synthetase; glycyURNA synthetase; 5^6 
424247i X14008; Hs.234734; tysczyme (renal amyfoltfosts}; tysozyme iienal amytoWoste); 5^5 

fJ?J} il s S E ^J S ' aimi,a " * 179695 ^rthreonln&apBdfc protein kinase [H.sapiens]; ESTs, Weakly similar to I7BB85 sertneflh; 5.94 

449569; A1656634; Hs.195389; ESTs; ESTs; 5.92 

436576; AJ458213; Hs.77542; ESTs; ESTs; 5>90 
70 42OT0S; AiftW1ir^rfe133^ H E^^^ saK 9 en P 97 (niefanotransferrin) mRNA, 3" flairti; Human nr^lanoma-assoclated anHgon p97 (m; 5.89 

417208; S 67773; Hs.B1666; v-Mt Hardy^uckerman 4 fall no sarcoma viral oncogene homolog; v+it HsroyZccterTnan 4 felne sarcoma v; 5^8 

iHiiHl ^?^J^\^^^\ iO&STSI sperm membrane protein BS-63 (Homo sapfenslll; r^01472*:gi|580967B|aWAAB4184a2| (U64; 6.87 

44873$ AB032983;Hs.21894;KtAA1 157 protein; KIAA1157 protein; 5^7 
1* i251 37: : ! MM_W0l79*iHomo sapiens mutS (E. cdl) homotog 6 (W8H6), mRNA. VERSION NMJJ00178.1 Gl; KM_000179*iHorno sapiens mutS (E. cdil h; 5^5 
fD 423787; AJ 295745; H5.236204; nudeer pore complex protein; nuclear pore complex protein; 5.85 

425126; N327S9; Hs.172944; chorionic gonadotropin, beta polypepbde; chorionic gonadc^opln, bete polypeptide; 5.84 

452796; ABD1 1 1D0; Hs^0656; KiM0528 gene pradiKJt; KIAA052S gene product 5^4 

HS^fi??^ 1 ^^ 6 * 541131 ^i^P^entidnasetahfcltor 3 (COK2-assodated dual specific^ phe^hatase); cyclrKlependenlWnese Inhibitor 3 (CDK; 6.80 
OA 447359; W-O12093;Hs.1S2W;adenytete Mnase 5; adenylate l^tnase 5; 

OU 420344; BE463721: Hs.97101; putative G protelixjoupted recaptor; pelade 6 protefa-coirpled receptee 5.7B 

^ 97 ^ fi 2^^^7^^Ts. Wea% similar to AIU1J1UMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [H^apiensl; ESTs. Weakly similar to 

414761; AU07722B; Hs.77256; enhancer of zeste (Drosophlla) homolog % enhancer of zeste (Drosaphlte) homolog 2; 5.75 
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430287: AW182459; Hs.1 25759; ESTs, Weakly simitar to LEU5_HUMAN LEUKEMIA ASSOCIATED PROTEIN 5 |H.sapJen4- ESTs, Weakly smite to LEU5 HUMAN LEUKE'5 74 
436251; BE515065; Hs.296595; nucleolar protein (KKBD repeat); nucleolar protein (KKEJD repeal); 5.73 

421535; ABG02359; Hs.105478; prKsphorroosyftbmry?^ synthase (FGAR amWotransferase); ptKKphcribosyjfwn^gtycharnkJiTie syntha; 5.71 

41 48B3; AA926960; HS.34S6S9; COC2B protein kinase 1; CDC2S protein kintal; 5-69 
J 425159; NM_0D4341; Hs.15466B; carbamoy {-phosphate synthetase 2, aspartate tfanscarbaraylBsg, and dihydroorolase; carbarnoyVphosphate synthetase 2, aspart; 5 69 
401704; ;; NM_02ll95*:Homo sapiens claudin 6 (CLDN6). mRNA. VERSION NM_0209B21 Gl; NMJ^ISF-Homo sapiens daudin 6 (CJ.DN6; 5.66 
425358; ALQ79658; Hs.338207; FK5Q6 binding protein 1^n$amycin associated protein t; FK506 binding protein 12-rapamycin assoc; 5.65 

402677; ; ; NM^00£W7ftHomo sapiens aMne phosphatase, jfvHrJbaneftfdnejr (ALPL), mRNA. tfpfra-hycYoxyJase}, polypeptide f (CYP7S1J, mRNA.; NW OtKHT&Horno sapiens 
alkaline phosphat; 5.54 
1 U 409264; NNL014937; Hs.52463; KIAA0966 proton; Kl AA0966 protein; 5.63 

432185; AA221032; Hs,272638; hypolhaticat protein FU 10494; hypothetical protein FLJ10494; 5.63 
409012; AL117435; HM9725; 0KFZP434I216 protein; 0KFZP434I216 protein; 5.63 

430252; AI638774; Hs.105328; testes developmenkefoled NYD-SP20; testes (fevebpmsrti-retated NY&-SP20; 5.6f 
419369; ALO43202; Hs,90073; chromosome segregation 1 (yeast homotogj-like; chromosorne segregation 1 (yeast homology 5.61 
15 46281S;AA1317B9;Hs.61909;ESTs;ESTs;5.6O 

402679; ; ; NM_000478iRomo sapiens alkaline phosphatase, Uver/boneflddney {ALPLJ, mRNA. alpha-hydroxyiaaeL pctypeptfde 1 (CYP7B1), mRNA.; rtoU}0C478:Homo sapfens 
alkaline phosphat; 5.59 

414291; AI2B961 9; Hs,13040; G praiein-coupjed receptor 85; G pjoteto-couptetf receptor 867 5.58 
453028; AB006532; Hs.31442; RecQ proteln-ike 4; RecQ protein-lite 4; 6\5B 
Zi) 453905; NMJH2314; Hs.3B56B; UM domain kinase 1; UM domain kinase 1; &56 

41126% BE297802; Hs.69360; kJnesin-ljke 6 (mitotic centromere^assodated khiesin); klnesin-like 6 (mitotic centre me re-assoc; 5.55 

4196BG; BE280337; Ha. 194693; solute carrier family 7 (catkxifc amnio acid transporter, y system), member 7; solute canter famfly 7 (cationte ammo ; 5.55 

446979: AI654443; Hs.197683; ESTs; ESTs: 5.54 

424321; W74048; Hs.1765; lymptiocyto«spBcHic protein tyrosine Kinase; Ivrr^Kocyle-specrfo protein tyrosine Wn; 5.53 
ZD 416962: AA714835; Hs.271863; ESTs; ESTs; 5.53 

447388; AW63Q534; Hs.76277; Homo sapiens, done MG&93B1, mRNA complete cds; Homo sapiens, clone MGQ9381, mRNA, comp; 5.52 

427247; AW504221; Hs.1 74103; totegrin, alpha L (antigen COIIA (p180), lyrTTprwcytefuncBon-aSBOciflted antigen 1: alpha polypeptide); integral, alpha L (antfeen CM1A (pl60J T ; 

449322; AI638616; Hs.196565; ESTs; ESTs: 5.51 
30 428450; NfoL014791; Hs.1S4339; KJAAD175 gene product; KIAA0175 gene product; &St 

41 5141; AA189099- Hs.268171; ESTs. Weakly similar Id AIAJ7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION WARNING ENTRY IH^aptensI; ESTs. WeaWy similar 

to ALU7JWMAN ALUS; 5,48 
45404B; W05626; Be.6921; ESTs; ESTs; E.46 

417079; 1365590; Hs*B1 1 34; Interieukln 1 receptor antagonist: Interlaukin 1 receptor antagonist; 5.45 
3D 434699; AA643687; hts. 149425; Homo sapfens cDNA FLM 1980 ITs, clone HEMBB1 001304; Homo sapfens cONA FU11980 fts. clone HE; 5.44 
414334; AA824298; Hs.21331; liypotheHcal protein FU10036; hypo&ebcai protein FU1O035; 5.44 

452291; AF01S592; Hs.28853; CDC7 (cell division cyds7, S. ceravlstee, homotog^ke 1; CDC7 (cell division cycle 7, S. cerevisi; 5.44 

438554; AA381553; Hs.198253; major Wstocompatifaiity complex, class IJ, CQ alpha 1; major hlstoeompalibinry complex, class : 5.44 

427668; AA298760; Hs.180191; hypothetical protein FU14904; hypothetical protein FU 14904; 5.43 
4U 449437; AJ7Q2D38; hfe.f 00057; Homo sapfens cWAi FU22902 fis, done KAT0558f; Homo saprans cDNA: FU22902fc, clone K, 5.41 

4S3633; AA357001; Hs.34045; hypothetical protein FU20764; hypothetical protein FU20764; 5,40 

450746; 062673; Hs.278589: general transcription factor H, 1; general transcripDon factor II, i; 5.40 

425966; NM^001761; Hs.1973; cydin F; cyclln F; 5.39 

41B134; AAa97769: Hs.86617; ESTs; ESTs; 6.38 
4^ 432141; BE4?0964; Hs.272736; rwdeer receptor bkxflng pralain; nudear receptor WnoVng proWn; 5.37 

417141; U22662; Hs.347353; nuclear receptor subfamly 1, group H» member 3; nuclear receptor subtamty 1 , group H, m; 5.36 

42832$ AA426091; Hs.98453; ESTs, Moderately elmler to R27326 2 [H-saplensl; ESTs, Moderalery similar to R27328 2 (H.; 5.35 

40681 1; U82979; Hs.67846; leukocyte imrnunoolDbulln-5ke leceptor, subfemlry B {with TM and IBM domainsX membBr 4; leukocyte taunogtobdin-like nsceptor, ; 5.34 

41581S; AU077330; Hs.360791; transcnpHon etongafon factor A (Sit), 1; transcription elongation factor A (Sli),; 5.33 
DU 44B^AArai57;Hs.7376^fblateiTO^ 

424762; AL119442; Ha.1 83S84; eukaryotic transtetion Initiation factor 4 gamma, 2; eukaryolfc translation infliaiion factor; &32 

421959; AW751497; Ks.98370; cytochrome P450 ( subfamily IIS k polypeptide 1; cytocmorne P450 f sutifarnlly LIS, porypept; 5.32 

429271; AF039B50; Hs. 1 9861 5; dead ringer (Drosophla)4te 1; dead ringer (DrosophlaHike 1; 5.32 
_ c 458373; BE247706; Hs.89693; nwmbrane-spanrfng 4-domelns, subfamly A, member 2 (C02D antigen); membrane-spanning 4-domalns, subfemlfy A- 5l30 
DZ) 414907; X90725; Hs.77597; poto (tacisophiaj-toa Wnase; poto (DrosophraHite kinase; 5.30 

42299T7; BE016212; Ha.122906; DNA relocation factor; DNA replfcafion factor; 5.29 

440014; AWS60782; Hs.68565 ash2 (absent, small, or hotneotfc, Droaophila, rwnotogHflce; ash2 (absent, sm^orhomeob'c, Drosopj 5.28 
418399; AF131761; Hs.84753; hypothetical protein FU 12442; hypofoeticaJ protein FU12442; 5.26 

416178; A1608627; Ks.192822; serotogicaify defined breast cancer antigen NY-6R-81; serolngicaHy defined breast cancer ami; 5.21 
Ol) 450377; AB033091; Hs.355925; KIAA1265 fwuteb; KIAA1265 jvofem; 5-20 

409670; A1368109; Hs.375604; WAA1BS6 protein; WAA1 856 protein; 5.20 

42906*3; Y09397; Hs.227817; BCL2-retated protein A1; BCL2^Bteted protefn A1; 5.20 

449523; NMJ000579; Hs.54443; chemoMne (C-C mofil) receptor 6; chemoWnB (C-C motif) receptor 5; 5^0 

40B908; BE296227; Ks.250822; serinerthreoolne kinase 15; serineAhreonlne kinase 15; 5.19 
OO 42973% U201 5fif Ha.24B8c lymphocyte cytoscte protein 2 (SH2 domam-conlainlng leukocyte protein of 7EkUy, lymphocyte cytosofic protein 2 (SHZ doma; S.19 

438QB9; W05391; Ks^51546; nuclear receptor subfamly 1, group I, member 3; nuclear receptor subfamily 1, group 1, m; 5.1 6 

437623; D638BQ; KS.57T9; chromosome axidensatfon-rolated SMC-assocjated protein 1; chromosome oomtertsafcn-relafed SMC-acso; 5.15 

448181; AF272833; K8.279783; hypotheficaJ protein RJ1 0504; hypomaficai protein FU10504; 5.15 

436540; BE397032; Hs.14468; hypotheHcal proteii WGC14226; nypothetteaJ protein MGC14226; 5.14 
/ U 422241; Y00062t Ks.170121; protein tyrosine phospnatsse, jBceptor type, C; pnMn tyrosine phosphatase, receptor 1 5.1 4 

426752; XB949Q; Hs.172004; fitin; tilin; 5L13 

415007; BE244332; Hs.77770; adaptor-related proteki complex 3, mu 2suounff; adapter-reteted protein complex 3, mu 2; 5.13 
4002S3; ; Hs.76309; Eos Control; Eos Control; 5.13 

437099; M77793; HSw48659; ESTs, HighV similar to S14458 lambin alpha-1 cnain precursor [H .sapiens]; ESTs, Highly snflarto S14458 laminln a; 5.12 
/!> 427209; H0550$ H&92423; KIAA1566 

407347; AA829Q47; ;gb»d40d0?^1 Na^CGAP w GCBl Homo sapiens cDNA clone IMAGE;1370413 3* simWar to contains Alu repetitive element mRNA sequence.; fl b:od40d07j1 
NCi_CGAP_GCBl Homo sapiens; 5.10 

456933; At638429; Hs.24763; RAN binding protein 1; RAN binding proteBi 1; 5.10 
c 45043t; AW1367S7; Hs.266041; ESTs; ESTs; 5.09 

OU 434608; AA805443; Hs.179909; hypolheliceJ protein FU22995; hypothafcal protein FU22995; 5.0S 

410423; AW402432; Hs.63489; protein tyrosine phosphatase, non-receptor type 6; prolan tyrosine phosphatase, iran-recepl; 5.QB 

417929; R27219; Hs.74647; Human TxwJI receptor actrve efph»chain mRNA from JM ceV ttne, oomptefa cos; Human T-celf receptor acffve alpWiBtn; 5.05 

41Z723; AA648459; Hs.33595l; hypothetical protein AF301222; hypothetical protein AF301 222; 5.05 
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447321; AW271217; Hs.281434; Homo sapeens cDNA FU1402B lis, done HEMBA10O3B38; Homo sapiens cDNA FU1402B ns. clone HE; 5.03 

417B66; AW067S03; Hs.82772; collagen, type XI, alpha 1; collagen, type 30, alpha 1; 5.03 

410082; AA0B1594; Hs.15B31 1; Musasfu (DrosophHa) homolog 1; Musasht (Drosophila) homolog 1; 5.02 

433592; NMJ004642; HsJ436; deleted in oral cancer {mouse, homolog) 1; deleted m oral cancer (mouse, homotog) ; 5-02 

42581 1; AL039104; Hs,159557; kaiyopherin alpha 2 (RAG cohort 1, tntportin alpha 1); karyopherin alpha 2 (RAG cohort 1, impor, 5.00 

425237; U07695; Hs.155227; EphB4; EphB4; 5lG0 

414809; AI434699; H&.77356; transferrin receptor {p90. C071); traneferrin receptor (p90, CD71); 4.99 
402145; ; : Target Exon; Tarcet Exon; 4J99 
432126; AAaB5239; Hs-37196; ESTs; ESTs; 4.99 

40B27S; AF216965; Hs>44095; Homo sapiens, done MG&12617, mRNA. complete cds; Homo sapiens, done MGkttfil^ mRNA, com; 4.98 
419525;T79257; rfc,12K;asiatogVcorxotei« receptor 2; aslafogtycopratein receptor 2; 4,97 
424439; AA579635; Hs.1770; tnasa l r DNA, ATP-oepandert; Ggase f. DNA, ATPKJepenoeni; 4.97 

427667: AK001279; Hs.1B0171; Homo sapiens cDNA FU10417AS, clone NT2RP10001 12; Homo sapiens cDNA FU 1041 7 fa, dona NT; 4J96 
457313; AF047002; Hs.241520; Iranscriptlonal coacfivator; transcriptional coacfivalor; 4.96 

448569; BE382657; Hs.21486: signal transducer and acllvator of transcription 1, 9lfcT>. signal transducer and activator of brans; 4.94 
426427; M86699; Hs. 1 69840; TTK protein kinase; TTK protein kinase; 4.91 

440129; AA86581B; Hs.369523; ESTs, Weakly slmflar to S71B86 Ste2Wlke protein Vsiase [Hsaptens]; ESTs t Weakly simitar to S71S86 Ste20-lk; 4.91 
453922; AF053306; Hs.36708; budding uninhibited by benzlmidazoles 1 (yeast homolog), beta; budding uninhibited by berairridazotes 1 ;4.90 
435602; AF217515; Hs.283532; uttcharacferized bone marrow protein BM039; uncharadertzed bone marrow protein BM03; 4.69 
452698; N^001295; Hs.30l921;chemoWne (OC motif) receptor 1; chemoWne (C-C motif] receptor 1; 4.88 

445333; BE537641; Hs.44278: hypolheScat protein FU1253B similar to ras-rataled protein RA817; hypothetical protein FLM2538 similar to; 4.88 
437182; AW005505; Hs.5464; thyroid hormone receptor coscSvapng protein; thyroid hormone receptor coadivafing pn 4.87 
441600; AA939347; Hs.127223; Homo sapiens cysteine knot protein (2S1G51) mRNA, complete cds; Homo sapiens cysteine knot protein (ZSIG; 4.B7 
421350; AW301608; Hs.278188; ESTs. Moderately sirdar to 154374 gene MF2 protein [Hjaptens); ESTs, Moderator/ similar to 154374 gene ; 4.B7 
4O9093; BE243834; Hs.50441; CGI-04 proteto; CGI-04 protein; 4.86 

424304; NMJW1395; Hs. 144879; dual specificity phosphatase 9; dual specificity phosphatase 9; 4.86 
437696; ZB3844; Hs.6790; hyrjotheticd protein dJ37E16.5; hypothetical protein dJ37E1 6.5; 486 

417381; AF164142; Hs.82042; solute carrier family 23 (nuctecbase transporters), member 1; solute earner tamty 23 {nucleobasetra; 4.84 

416445; AL043004; Hs.79337; KIAA0135 protein; KIAAD1 35 protein; 4.83 

448839; BE267795; Hs.22595; bypotheScat protein FU10637; rwpofoeticB! protein FU10S37; 4&2 

413566; AW604451: Hs.285B14; sprout/ (prosopKIa) homolog 4; sprouty (Drosophila) homoloB 4; 4.82 

424GB1; NMJKK413; Hs.139120; ribonucteas* P {30kD}; ribonuctease P (30kD); 4.B1 

425474; 248054; Hs.1 58084; peroxisome receptor 1; peroxisome receptor 1; 4.81 

412276; BE262621; Hs.73798; macrophage migration inhibitory factor^ycosyteHon-lnhlbtt'ng factor); macrophage migration Inhibitory factor 1 4.81 

422689; AWB56BB5; HaJ99797; gb:RC3-CT0297-29CtO(M]13-d03 CT0257 Homo sapians cDNA, mRNA sequence; fjb:RC3-CT0297-23t]10D-O13-dO3 CT0297 Homo; 4.80 

409101; NMJKM297; Hs.50612; guanine nucteoCda binding protein (G protein), alpha 14; guantoe rructeotjde NndRig protein (G pr; 4w79 

412760; AW379030; Hs.41324; ESTs; ESTs; 4.79 

447250; AI878909: Hs. 17883; protein phosphatase 1G (formerly 2Q. magnesiurrHJependent, gamma rsotxrrc protein phosphatase 1G (formerly 2Q. ma; 4.79 
429345; R11141; Hs.199695; hypothetical protein; hypothetic at protein; 4.73 
448950; AF2886G7; Hs.9275c CG1-162 protein; CGt-152 protein: 4.78 

412926; AI879076: H8.79D61; macrophage rnyristoylated alajumwich C kinase substrate; macrophage rnyristoylated alanlne-nch C ; 4.78 
41 2641; M1666Q; Hs.74335; heatshockSOkD protein 1» beta; heal shock 90kD protein 1. beta; 4.76 

4202E1; AW2C6093; Hs.748; fibroblast growth factor receptor 1 <tm*reiatad tyrosine kinase 2, PfeUfer syndrome); ffrroblastgrowth factor receptor 1 (firs; 4.76 
421905; A1660247; Hs^2699; ESTs. VteaWy similar to LIV-1 protein [Hjapiens]: ESTs, Weakly similar to UV-1 protein IK* 4.75 
41 3860; AI660842; Hs.110915; toterteulJn 22 recepton InterleuMn 22 receptor; 4.75 

424905; NM.002497; Hs.153704; MIMA {never In mitosis gene abated kinase 2 (NEK2); K1MA (never In mftosls gene aj-related k. 4.74 
418355; L42563; Hs.1165; ATPase. H? traraporting, nongastrfc, alpha polypeptide; ATPase, H? lransportlng, nongaatric, ab; 4.74 
435905; AW997484; H8J003; KIAAD45B protein; WAA0455 protein; 4.74 

428024; Z29Q67; HsJ!236; NIMA (never to mitosis geno a)-rBlated kinase 3; NIMA (never En mitosis gene a)-refated b; 4.74 
421846; AA017707; Hs.1432; prateln kinase C substrate 80K-H; protein kinase C substrate 8 OH^H; 4.72 
419138; U48508; Hs.89631; ryanodne receptor 1 (skeletal); ryanodine receptor 1 <skeletai); 4.72 
437296; AA350994; Hs.20281; K1AA1700; KIAA1700; 4.70 

450142: AW207469; Ks,244B5;cbondroftTn sulfate proteoglycan 6^3macan);crmdroirjnsulrate proteogTycan 6{bama;4.70 

421733; AL1 19671; Hs.1 420; fibroblast growth factor receptor 3 (achondroplasia, thanatophoric dwarfism); fibroblast growth factor receptor 3 (ach; 4,69 

449475; AJ348027; Hs.1 29828; hypotheUcal protein PP1057; hypoftetlca! protein PP1 057; 469 

420062; AW41 1096; Hs,94785: TGF(bete)-induced transcrlptton factor 2; TGF(beto)-induced transcription factor 2; 4.69 

429336; AB0Q503B; Hs.199270; cytochrome P450, subfamily XXVlIB (25-hydroxyvftarrin (>1^lphB4ryAoxyla£eK polypeptide 1; cytocHrome P450, subfamily XXVIIB {25-hy; 4.69 
436856; A1469355; Hs.127310; ESTs; ESTs; 4.6B 

406937; U14622; ; gb^uman traiskotolase-like protein gene, partial cds.; gb:Human traftsketolase-Kw protein gene; 467 
411296; BE207307; Hs.101 14; growth suppressor 1; growth suppressor 1; 4.67 

426726; AA488915; Hs.171 955; Irophlnln associated protein (tastJn); trophlnb assodated protein {lastin); 467 

409132; AJ224636; Hs^0732; protein kinase, AMP-«Hvated. bete 2 non-catalytic summit; protein kinase. Aiyp-acSVated, beta 2 no; 4.67 

449230; BE613348; Hs. 356 392; melanoma cell adhesion mdaculB; melanoma cell adhesion molecule; 4.66 

431681; AK000378; Hs.267566; hypothetical proteto FU20371; hypothefical protein FU20371; 4.65 

443623; AA345519; Hs.9641; compfemant component 1, q subcomponent, a^ha polypepHde; ccrnplemerrtccrnponant 1, q subec^ponent, ; 465 
441595; AW2Q6035; Hs.356457; ESTs; ESTs; 4.64 
423419; R55336; Hs.23539; ESTs; ESTs; 4.64 

415724; NM_0035BO t Hs.78687; neutral sphngDmyelinase (N-SMase) acUvaOon associated factor, neutral sphingomyelinase (N-SMase) ectiv; 4.63 
435045; BE297155; Hs.143698; ESTs; ESTs; 4.62 

424441; X14850; Hs,147097; H2A fistone family, member X; H2A histone family, member * 4.62 
414972; BE283782; Hs.77695; WAAO00B gene product; KJAA0008 gene product; 4.62 

436885; W28661 ; Hs.5286; Homo sapiens mRNA; cOMA DKFZp434M245 (from done OKFZp434M245): Homo sapiens mRNA; cDNA DKF2p434M245 {fr; 4.62 
449515; A1653378; Hs-302012; ESTs; ESTs; 4.61 

425998; AU076629; Hs.1 65950; myoblast growto factor n3ceptor4; fEbrobtast growth factor receptor 4; 4.61 

420027; AF009746; Hs.94395; ATFM)todIng cassette, sub-famUy D (ALD). member 4; ATP-binding cassette, sub-family D (ALD); 4.61 

436469; AK001455: Hs.5199; Down syndrome critical region gene Z Down syndrome critical region gene % 4.61 

413441; AI929374; Hs.75367; Sn^ike-adapter, Sn^ike^dapter, 460 

456B47; AT360456; Hs,860B8; ESTs; ESTs; 4.5B 

421506; BE302795; Ks.105097; thymidine kinase \ soluble; thymidine kinase \ soluble; 4.57 
426935; NMJDOO0B8; Hs.172926; collagen, type I. alpha 1; cdlagen, type t. alpha 1; 4.57 
42B782; X12B30; Hs.193400; interleukln 6 receptor; kiterieuWn 6 receptor; 4.56 

409430; R21945; Hs.346735; spldng factor, arglnlnefceri wnich 5; splidng (actor, argirunefeerlne-rich 5; 4^6 
412773; K157B5; Hs.74573; similar to vacdnla vims Hlndlll K4L ORF; similar to vaccinia virus Hindi II K4L OR; 4.55 
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443068; AI18B710; Hs.374460; ESTs; ESTs; 4.55 

441607: NA4_005Q10; H&7912; neuronal cell adhesion molecule; neuronal cell adhesion motecule; 4.64 
453227; AW1 35852; H3.243991; ESTs; EST$;4.52 

438459; T49300; Hs.35304; Homo sapiens cDNA FU13655 fis. done PLACE1011503; Homo sapiens cDNA FUlaesSfts, done PU 4.51 
5 422565; BE259035; Hs, 1 18400; singed (f>osoph!la)-llke (sea urchin fascin horralog like); singed (DrosophllaJ-IIke (sea urchin fas; 4.51 
463513; F06S38: Hs .374476; ESTs; ESTs; 4.50 

413900; AW409747; Hs.75612; slressHndoced-phosphoprotelft 1 tHsp7Gfflsp90*rganlzlng protem);stress4nduced-priosphopfoteln 1 (Hsp70/H; 4.50 
412507; L36545; Hs,73964; EphA4; EphA4; 4^0 

419034; NM_002110; H5.89555; hemopo'ieGc cell kinase; hemopoisffc coll kinase; 4.49 
1 0 413431; AW246428; Hs.75355; ubiquibn^jugatlng enzyme E2N (homologous to yeast UBC13); ublquitirvconJugaBng enzyme E2N (homdo; 4.49 
406547; ; ; Target Exonc Target Exon; 4.49 

443216; W60487; Hs.324521 ; hypothetical protein DC5Q; hypothetical protein DC50; 4.4B 

417497; AW402482; Hs.82212; CD53 antigen; CD53 antigen; 4.47 

445595; ABQ14544; Hs.21572; WAA0644 gene product; KIAA0S44 gene product; 4.47 
1 5 445350; AFQ52112; Hs.12540; lysophospholipase I; lysophospholipase I; 4.46 

446236; NM_00S293; Hs.301; TYR03 protein tyrosine kinase; TYR03 protein tyrosine kinase; 4.46 

429150; AF120103; tfs.197366; smnothenfid (Drosophfla) homolog; smoothened IDrosophlla) homolog; 4.46 

420340; NMJW0734; Hs.97087; CD3Z antigen, zeta polypeptide (TiT3 complex); G03Z anligen, zeta polypeptide fJTT3 corn; 4.46 

413426; U88837; Hs.75354; GCN1 (general control of amino-edd synthesis 1. yeastHike 1; 6CN1 (general control rfanfoc^acld Bynt 4.44 
20 421819; NR.013403; H6.108665; zinedii; zinedin; 4.44 

409512; AW979167; Hs.293591 ; melanoma drfferendatJon associated pratan-5; melanoma differentiation associated prot; 4.44 

423995; AW00497S; Hs.194716; MAP {mothers against decepentapteglc, Drosophlte) homolog interacting protein, receptor activaUon anchor; MAD [mothers agamst decapantaplegic, 
Dr;4.43 

434551: BE3871G2; Hs.280B58; ESTs, Highly shnlar to A35661 DNA excision repair rjrosB^cornplementfaig protein ERCC3 IRseriensfc ESTs, Highly similar to A36661 DNA excte; 
25 4.43 

418295; AW970043; Hs-238039; hypothetical protein FU11090: hypothetical protete FU11090; 4.42 

409243; ABG37761; Hs.51743; KIAA1340 protein; KIAA1340 protein; 4.42 

437103; AW13940&; Hs.152940; ESTs; ESTs; 4.42 
_ 413186; AU077141; Hs374S48; solute carrier family 16 (monocarboxylic acid trensporters), member 1 : solute canter family 16 (rrujrawarboxylfc; 4.42 

30 456362; AW973GQ3; Hs.179909; hypothetical protein FU22995; hypotheticaJ protein FU22995; 4,40 

42407B;ABQ06625;Hs.139033; paternally expressed 3; patemaHy expressed 3:4.39 

448153: YT08Q5: Hs.20521; HMT1 (hnRNP meftyltransferase, S. r*revi5iae)-lke 2; HMT1 (hnRNP methyhransferasa, S-cermr; 4.37 
410134; U6B140; Hs.58927; nuclear VCWIte; nuclear VCP-like; 4.36 

435523; T62849; Hs.1 1090; membrane-spannmg 4-domains, subfamily A, member 7; membrane-spanning 4-domalns, subfamily A; 4.35 
35 400440; X83957; Hs.63670; nabufirn nefaiiffn; 4.35 

437216; Alt 17497; Hs.5B185; ESTs, Weakly similar to 742727 proliferation potential- related proteki - monse [M.rmscidusJ ESTs, Weekly simitar to T42727 prollfera; 4>34 

430478; NMJ014349; Hs.241535; apoSpoproteln L, 3; apofipoprotein L, 3; 4.34 

432744; AA98863&; Hs.3B664; ESTs; EST* 4.33 
_ _ 423173; AA442655; Hs.124942; protein phosphatase 2A 48 kDa regulatory subumt; proteVi phosphatase 2A 46 kOa regulatory; 4.33 
40 415995; NM JD04573; Hs^55888; phospho^pase C. beta 2; phosphollpase C> beta 2; 4.33 

408728; AU37379; Hs.47125; hypothetical protein FU13&12; hypothetical protein FU13912; 4,30 

431222; X56777; Hs,273790; zona pelocsda glycoprotein 3A (sperm receptor}; zona pdhicida glycoprotein 3A (sperm re; 430 

427792; M63926; Hs.160841; tumor necrosis factor receptor superfamly, member 7; tumor necrosis factor receptor superfami; 4.29 

426227; U67058; Hs. 154299; Human proteinase acEvated receptor-2 mRN A, 3'UTR; Human proteinase activated receptor-2 mR; 4,29 
45 427337; Z46223; Hs.176663; Fc fragment of IgG, tow affinity Ulb, receptor for (CD16); Fc fragment of IgG, low afflnhy 1Kb, r, 4.29 

434826; AF155661; Hs^2265; pynwate dehyaVogenase phcBphatase; pyruvate dehydrogenase phosphatase; 4,29 

412314; AA625247; Hs.356084; downstream of; G protein-coupled receptor 27 (GPR27) (SRE81); downstream of: 6 protein-coupled recepto; 4,28 
447827; U73727; Hb. 19718; protein tyrosine phosphatase, receptor type, U; protein tyrosne phosphatase, receptor t; 4.28 
426106; AA622037; Hs. 166468; programmed cell death 5; programmed cell deem 5; 4^B 
5 0 42&B20; AA4361B7; Hs. 172631; integrin, alpha M {complement component receptor 3, alpha; also known as CD1 1b (pt70). macrophage anrjgen alpha polypeptide); integrin, alpha M 
(com pie menl component ; 4,27 
437908; A1082424; Hs.351043: ESTs; ESTs; 4.27 

444664; M263&2, Ha.116l5; map kinase phosphatese-Ske protein MK-&TYX; map Wnase phosphalase-IIke protein MK-S; 4.27 
429Q02; AW246439( Hs. 2340; Junction plakoglobln; Junction pTakogTobln; 4.25 
55 439334; A1148976; Hs.1 12062; ESTs; ESTs; 4.26 

425308; M97639; Hs.1 55585; receptor tyrosine klnase-Jfce orphan receptor 2; receptor tyrosine Wnase-lke orphan rec; 4.25 
413869; MH,0O0878; Hs.75596; InterfeuWn 2 receptor, beta; IrrterteuMn 2 receptor, beta; 425 
453648; W21493; Ha^80100; hypothefical protein FU14005; hypothetical protein FLJ14005; 4.24 

447200; BE543146; Hs.281434; Homo sapiens cDMA FLJ14028 lis, clone HEMBA1 003B38; Homo sapiens cDNA FU 14028 hs. clone HE; 4.24 
60 447528; AI612027; Hs.76277; Homo sapiens, ctone MG C:938t, mRN A, complete oda; Homo sapiens, clone MGC.-9381, rnRNA, compj 4.23 

452721; AJ269529; Hs.301871; solute carrterfamly 37 (Blycerd^pnosphate transporter), member 1; solute earner family 37 (glycerol-3-pho; 4.22 
4498 10; AB008681; Hs.23994; actMn A receptor, type IIB; acMi A receptor, type I1B; 422 
447196; D51523; H6.2B3435; ESTs; ESTs; 4.22 

416714; AF2a3770; Hs.79530; CD79A antigen (ImrniinogloburLn-associated alpha); CD79A antigen prrmurwglobuliri-Bssodated; 4.22 
65 425356; BE244B79; Hs, 155939; inositol polyphosphate-5-phoaphatase, 145kD; Inositol rx^typHosphate-E^hosphalase; 14; 4.22 
422605; H16646; Hs.118666; hypothetical protein PP591; hypomettcal protein PPS91; 4.21 

444535; AFDl14G6c Hs.122575; EDG4 (endcrihefel different atton. lysophosphalidic acid &protelr>ooupted receptor. ED&4(eridomeiia] drfferen^Mon. 5ys4 4.21 

417086; M54915; Hs.B1170; prrvl oncogene; pkn-1 oncogene; 4.20 

421707; Nf^01492l;Hs.1W0^;terjtomedin-2;rBcteinsd^ 
70 408717; AF04545S;Hsj47061;uno>51 {a eieg ens) like kinase 1;unc-51 (a etegansj-fike kinase 1; 4.20 

436485; W5757B; Hs J7B71B; RAB7, member RAS oncogene family; RAB7, rnember RAS oncogene famly; 4.19 

419452; U33635; Hs.90572; PTK7 protein tyrosine kinase 7; PTK7 protein tyrosine kinase 7; 4*16 

41 8755; Y14443; Hs.8&21 9; zinc finger proteffi 200; zinc finger protein 200; 4.15 

417212; AW952B23; Hs.351547; NSI-brading protein; NSI-bfndbig protein; 4.17 
75 41 36B6; A1469213; Hs.71404; ESTs; ESTs; 4.1 7 

419344; U94905; Hs.277445; diacytglycerol kinase, zeta (104kD); dlacylglycerol kinase, zeta (104kD); 41 6 

418870; AF 147204; Hs.69414; chemokine {OX-C molit), receptor 4 (fusfa); c«emokfhe (C-X-C motif}, receptor 4 {fus; 4.16 

406482; MM.000676; He.45743; adenosine A2b receptor; adenosine A2b receptor; 4.16 
0 _ 422391; D63479; Hs.1 15907; diacyfcVcerol kinase, delta (130kD); dlacytgtycero) kinase, delta (130kD); 4.15 
oO 409421: AA189B83; Hs.67624; ESTs; ESTs; 4»15 

415938; BE 333507; Ha.78921; A kinase (PRKA) anchor protein 1; A kinase (PRKA) anchor protein 1; 414 

415196; AW00&48O; Hs.943; rraturat killer cell transcript 4; natural killer cell transcript 4; 414 

4246S5; W21223; Hs.151734; nudear transport factor 2 (placental protein 15); nuclear transport factor 2 (placental pr, 413 
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428479; Y00272; Hs.334562; ceil division cycle 2, G1 to S and G2 to M; ceil division cycle 2, G1 to S and G2 to; 4.13 

423728; AW891294; Hs.132136; solute carrier family 4. sodium Wcarbonatecotrareporter. member 8: solute carrier family 4, sodium Wcarbon; 4.13 
433435: BE545277; Hs.340959; Ts translation eJongafion factor, mitochondrial; Ts translation elongation factor, mrtoch; 4.13 
420253; AI65BEJ55; Hs.96200; neighbor of A-ldnase anchoring protein 95: neighbor otA-Wn as e anchoring protein 9; 4.t2 

412584; X54870; Hs.74085: DNA segment on chromosome 12 (unique) 2483 expressed sequence; DNAsegment on chromosome 12 (unique) 24; 4.12 

400205; ; Rs.81846; NM_005265*:Homo sapiens RAD21 (S. combe) homolog (RAD21). mRNA. (APO-1/CD95 {Fas)-assoclated phosphatase) (PTPN13), mHNA.; 

NM_006265*:Homo sapiens RAD21 (5. pombe); 4.12 
425322; U63630; H3.155637; protein kinase, DrM-acbVated, catalytic polypeptide; protein kinase, DNA-ecttoated. catalytic; 4.1 1 i" 
419607; R52557; Hs.91579; Homo sapiens done 23783 rnRNA sequence; Homo sapiens ctone 23763 mRNA sequence; 4.10 

459035; AW291109; Hs.332563; ESTs, Weakry stmlar to T31611 hypothetical protein Y50E8A.g - Caenorhabdife degans [Gelegans); ESTs, Weakly sfoilar to T31611 hypoihab'; 
4.10 

43251 2; Nty_003284; Hs.3017; transition protein 1 (during Wstone to protamine replacement); transition protein 1 (during his tone to ; 4.10 

452075; BE275760; Hs.30928; DNA eegment on chromosome 19 (unique) 1177 expressed sequence; DMA segment on chromosome 19 (unique) 1.1; 4.09 

423804; AW4Q3448; Hs.1706; inteneron-stimutaled transcnption factor 3, gamma (48kD); Interferon-stimiiated transcription fact; 4.09 

413745; AW247252; Hs.75514; nucleoside phosphorylase; nucleoside phosphorylase; 4,09 

442091; AW770493; Hs.182674; guanine nucleotide binding protein (G protein) alpha 12; guanine nucleotide birring protein (G pn 4.09 

4250O3; AF1 19046; Hs.154149; apininic/apyrtmldlnlc endoiuicleasefAPEX nuclease)-! ks 2 protein; apunnic/apyrimidincc er>donudease{APEX ; 4.0B 

421859; AA356620; Hs.10B947; KJAA0Q5G gene product KIAA005O gene product; 4.08 

432841; M93425; tts.62; protein tyrosine phosphatase, non-receptor type 12; protein tyrosine phosphatase, ron-recept: 4.08 
452069; A9028949; Hs.183994; KIAA1026 protein; KJAA1026 protein; 4,08 

425AS9; AA687465; Hs. 2981 84; potassium voltage-gated channel, shaker-related subfamily, beta member 2; potassium voltage-gated channel, shaker-; 4.07 
418526; BE019020; Hs.85838; sotufe carrier family 16 (rnonocarfaoxyISc add transporters), member 3; solute carrier femlly 16{monocarboxy1Ic; 4.07 
424517; A1539443; Hs.137447; Homo sapiens cDNA FU12159fls, clone MAMMA100Q643; Homo sapiens cONA FU 12153 fis, clone MA; 4.07 
434224; AA3B0731; Hs.84; IntarfeukJn 2 receptor, gamma (severe combined Imrnunodeficiency); interieukm 2 receptor, gamma (severe go; 4.0S 
446791; A1632276; Hs.1 95922; ESTs; ESTb: 4.06 

4320B5; AA4O1039; Hs.2903; protein phosphatase 4 (formerly X>, catalytic BubunH; protein phosphatase 4 (formerly X), carta; 4.D6 

431194; D43704; Rs.250712; calcium channel, voltage-dependent, beta 3 subuntt; calcium channel, voltage-dependent, beta; 4.06 

418751; BE389014; Hb.372546; phospnoInosttkte-3-Wnass, regulatory subunlt, polypeptide 3 (p55, gamma); phosphdnosfUde-3-Wnase. regulatory su; 4,06 

425923; NMJH5026; Ha.162B0B; phospholnosifide-^klnase. catalytic delta polypeptide; phrophdnrahlde^nBs^ catalytic, de; 4.05 

408892; AL040127; Hs.34074; dipepfidylpeptidase VI; d^epUdytpeptidase VI; 4.04 

446272; BE268912; Hs. 14601; hematopoietic ceU-speclflc Lyn substrate 1; hematapoleflc cell-specific Lyn substrat; 4,04 

439176; A1446444; Ks.190394; ESTs, Weakly similar to B2B09B 5ne-1 protein ORF2 [H.sapiens); ESTs, Weakly simitar to. B28096 line-1 pr; 4.04 

4178B0; BE241 595; Hs.B2a4fl; sdecttn L (lymphocyte adhesion molecule 1); sc lectin L (lymphocyte adhesion molecule; 4.04 

410068; A1633888; Hs.58435; FYH^lnoTng protein (FYB-12GM3Q); FYN-blntflng protein (FYB-12G7130); 4.03 

410639; BE269047; Hs.65234; hypolhettcal protein FU20596; r* pdheacal protein FU20596; 4.03 

427716; L389S1; Hs.1B044B; Haryopherin (rmpor&i) beta 1; karyopherin (importin) beta 1; 4.03 

451050; AW93742Q; Hs.351859; ESTs; ESTs; 4.02 

449667; A&023227; Hs.23860; K1AA1010 protein; KIAA1010 protein; 4.02 

448499; BE613280; Ks.77550; p53-regulated DDA3; p53-regiiiated DQA3;4.01 

437527; A1241019; Hs.145644; ESTs; ESTs; 4.01 

4251 18; ADQ76611; Hs.154672; methylene teirahydrofolale dehydrogenase (NAD dependent), melhenyUetrahydrofoIate cyctohydrolase; methylene tetrahydrofolate dehydrogenase; 
4.00 

451931; AK0Q02O8; Hs.27267; Homo sapiens cDNA FU20201 fia, dona COLF1ZI0; Homo sapiens cDNA FU20201 lis, cbna CO; 4.00 
412939; AW411491; Ks.75069; eukaryoEc translation elongation factor 1 flamrna; eufcaryolte translation elongation factor; 400 
409561; U66243; Hs.55039; mitogen-aeb'vated protein kinase 12; rnltogen-activated protein kinase 12; 3.99 
433577; AW007080; Hs.284192; ESTs; ESTs; 3^9 

439963; AW247529; Ha.6793; platatet-acbvat'ng factor acetyl! ydidasa, isofarm lb, gamma subunit (29kD); platelet-abating factor acetyl hydrola; 199 
41 8629; BE247550; Hs.86859; growth tactor receptor-bound protein 7; growth factor reoeptor-bound protein 7; 3.99 
44BS33; AA311426; Hs.21635; tubulin, gamma 1; tubulin, gamma 1; 3.98 

402398; ; ; Ct9000263Dgl|3106Q23|gb|AAC15755,1| (AO0O4659) BC62940.2 [Homo aapiens]HS6335; Cl900O263:gi|310B023JgbjAAC15755,1| (AGO; 3J7 
408414; AM 14688; Hs.1 93400; ESTs, Weakly svmler'to 2 10926 OA B cell growth factor [UsapJens]; ESTs, WeaWy similar to 210926QA B ceB ; 3.97 
415012; NM W 0Q4383; Hs.77793; o-src tyrosine kinase; c-src tyrosine kinase; 3.97 

4160B4; L16S91; Hs.79Q06; deoxythymldylate kinase (thymtdyfate ldnase);dsc«ythyrnidy}ma kinase (Siymidytata kin; 3.95 

422051; AW327546; Hs.1 1 1024; solute carrier family 25 (mitochondrial carrier; citrate transporter), member 1; solute carrier family 25 (mitochondrial ; 3£5 
447867; AA1 14050; Hs,21 1610;casp3se 8, apo ptosis- related cysteine protease; caspase 8, apoptosls -related cystelre pr, 3.95 
430770; AA765694; Hs.123236; ESTs; ESTs; 3.94 
442994; AI026718; Hs.16954; ESTs; ESTs; 3,94 

420333; AJ001383; Hs.97064; Vmpnccyta antigen 94 (mouse) homolog (activating NK-receptor; NK-p46); lymphocyts antkjen 94 (mouse) homolog (a; 3.94 
438458; AA913381; Hs.279763; ESTs; ESTs; 3.94 

422599: BE387202; Hs.11B636; non -metes laflc cells 1, protein (NM23A) expressed In; non-metastatlc cells 1, protein (NM23A) ; 353 
4201 62; BE37B432; Hs.95577; cycHn-dependent kinase 4; cycHrvdependent kinase 4; 3.93 

424829; NM_0025D7; Hs.1827; nerve growth factor receptor (TNFR superfsmily, member 16); nerve growth factor receptor (TNFR super, 3.93 
447574: AF162666; Hs. 18&95; tousled-like kinase 1; tousled-like kinase 1; 3.93 

425797; AF00298B; Hs.159545; platelet acUvaCng receptor homolog; platelet acuvafing receptor homolog; 3.93 

421910; NMJ014586; Hs. 109437; harmonally upmgulated neu bmor-assodated kinase; hormonatly upiegutated neutumor-assocla; 3.92 

413781; JD5272; Hs.850; IMP {tnosine monophosphate) dehycVogenase 1; IMP (Inoslne monophosphate) ddhydrogenas; 352 

434334; AA912476; MB.11B750; Homo sapiens cDNA FU13221 ffe, done ffT2RP4002O75; Homo sapiens cONA FU13221 fls. clone NT; 3.92 

400262; ; Hs.75309; Eos Control; Eos Control; 3,90 

424977; AA3492B9; Hs.1 00057; Homo sapiens cDNA: FU22902 fis. clone KAT055B1; Homo sapiens cDNA: FU22902 Es, done K; 3.90 
409799; D11926; Ks.76845; phosphoserine phosphatase-iace; phosphoserine phosphatase-fike; 3.90 
435206; A1432364; Hs.160594; ESTs; ESTs; 3.90 

439863; BE547B30; Hs.37520fl; paired immJnoglobulin-fSce receptor beta; paired Immunoglobutin-fike receptor beta; 3.90 
413627; BE182082;Hs J 246S73;intron of Bicaudal Dhomolog 1;Inlronof KcaudalD homolog 1;3.90 
426265; AA421059; Hs.97898; ESTs; ESTs; 169 

451063; AW163702; Hs.25911; HLA-B sesDciatad transcnpV2; HLA-B associated transcript-2; 3.89 
407013; U35637; Hs.63870; gb:Human nebulln mRNA, partial cds; gbrHuman nebulln rnRNA, partial cds; 3.B9 
437239; AW503395; Hs.5541; ATPase, Ca transporting, ubiquitous; ATPase, Ca transporting, ubiquitous; 3.8B 
400261; ; Hs.1B02; Eos Control; Eos Control; 3.88 

460447; AF212223; Hil25010; hypothetical protein P15-2; hypothefcai prdtein P1 5-2; 3.86 

422293; X94453; Hs.1 143G6; pyrrollRe-S-carboxYtate synthetase (glutamate gamrna-semlaldehyde synthetase); pyrrollne-S-carboxylate synmetase (glut 3.87 
414251; AU042306; Hs.97669; VASA protein; VASA proton; 3^7 

417767: B€242241; Hs.82542; acyloxyacyt hydrolase (neutrophil); acytoxyaoyl hydrolase (neutrophil); 3.87 

414443; AU077268; Hs.76144; platelet-oerived growth factor receptor, beta polypeptide; platelet-derived growth factor receptor.; 3,87 
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444099; D87432; rls.10315: solute carrier family 7 (caflonic am'no acid transporter, y system), member 6; solute carrier family 7 (cationc amino : 3.86 
419596; BE379320; Hs.91448; MKP-1 lite protein tyrosine phosphatase; MKP-1 Ilk© protein tyrosine phosphatase; 3.86 
427022; AW245839: Hs. 173255; small nuclear ribonucleoprotein polypeptide A; small nuclear ribonuclsoproJeln polypepi; 3.86 
443661; AA3366Q9; Hs. 10862: Homo sapiens cDNA: FL)23313fe. clone HEPU919; Homo sapiens cDNA: FU23313fis clone H; 3.86 
Z> 40B056;AA312329;Hs.42331;ephrin-A4;ephrin-A4;a86 

410552; X66945; Hs.748; fibroblast growth factor receptor 1 (fms -related tyrosine kinase 2, PfeHfer syndrome}; fibroblast growth factor receptor 1 (fins; 3.6$ 
450778; U81375; Hs.25450; solute carrier family 29 (nudeoslde transporters), member 1; solute carrier family 29 (nudeoside \m: 3.85 
418976; T85295; rls.268606; ESTs; ESTs; 3.84 

429903; AL1341 97; Hs.93597; cyctiivdependenl kinase 5. regulatory subunit 1 (p35); cyclirKfependent Wnase 5, regulatory bu; 3.64 
10 442980; AAa57025;fe.8878;lanBs^^ 

447232; AW499834; Hs.327; Interleukin 1 0 receptor, alpha; toterleukin 10 receptor, alpha: 3.84 

434689; AF154115; Hs.4076; CTO (<rarbc*y-tarmina1dcm!na, RNA polymerase 11, polypeptide A} phosphelBse. subunit 1; CTD rcarboxy-terminal domlna, RMA poNrne: 1B3 
4325a9;Ai13ai69;HsJ^378;kaTyophen^beta2b l lrans W ^ 
415684; D59356; Hs.374480; sorbitol dehydrogenase: sorbitol dehydrogenase; 3.83 
ID 461598; N2910£Hs.79658: ESTs; ESTs; 3.82 

449433; AJ672096; Hs.901 % ESTs. WeaWy similar to S26650 DNA-Wndlng protein 5 [rtsapiens}; ESTs, Weakly smilar to S26650 DNA-bindr 3 32 
426359; AA376409; Hs.10862; Homo sapiens cDNA: FU23313 hs. clone HEP1 1919; Homo sapiens cDNA: FU23313 fis, done H 3.81 
435160; AB0Q2374; He.4791; KIAA0376 protein; KIAAD376 protein; 3.ao 

443402; U 77846; Hs.9295; elasDn (siipravahrular awtfc stenosis, Wflllams-Seuren syndrome); elastfn (simravalvular aortic stenosis, ; 3 80 
422753; AJ92B996; Hs.1575; small nuclear ribonucleopratan D3 potypeptMe (18kO); smal[ nuclear rfbonudeoprotein D3 polyp; 3.79 
421S08; NMJI04B33; Hs,1051l5; absent in nwlanoma2; absenlin melanoma 2; 3.79 
414806; D14694; Hs.77329; prwsphatWylserine synthase 1; phraphatfdylserrne synthase 1: &79 

428023; A103B843; Hs.374530; Homo sapiens cOMA: FU23602 fis, done LNG15735; Homo sapiens cDNA: FU236Q2 fis, clone L; 179 

421654; AW163267; Hs.106469; suppressor of var! (Sxerevislae) Miko 1; suppressor of varl (Sjcerevisiae) 3-llke; 3.79 
Z!> 439668; A1091277; Hs.302634: frizzled (Drcsophila) hamolog 8; frizzled (Drosophila) homatog 8; 3.79 

411125; AA151647; Hs.68877; cytochroma b-245, alalia polypeptide; cytochrome b-245, alpha potypBpffde; 3.78 

429170; NM.0O1394; Hs.2359; dual speciBaly phosphatase 4; dual specsffcity phosphatase 4; 3.76 

425354; U62027; Hs.155935; ttrr^err«lrtcompCfleiT(t3a receptor 1; complement component 3a receptor 1; 37B 
n 423809; AJ223183; Hs.135194; Immunoglobulin superfamiry, member 6; Immunogtobulln superfamily, member 6; 3.78 
D U 429667; A1675749; Hs.21 1 608; nucteoporin 1 S3kD; nudaoportn 153kD; 3.77 

414177; AI351355; Hs.356303; uiicharacterfzed hypoHiabmus protein RARP11; uncharactarfeed hypothalamus protein BAR; 3.77 

414135; NM.004419; Hs.21 28; dual spacWcHy phosphatase 5; dual specmclly phosphatase 5; 3.77 

445817; NM.003642; Hs.13340; histone ace^ltrensfarase 1; hlstone acetyl transferase 1; 3.77 

410422; AL042014; rls.63348; Homo sapiens, clone MGC:15203, mRNA, comptete cds; Homo sapiens, cfane MGCi15203, mRNA. com; 3.76 
3 D 412146; M92444; Hs.73722; APEX nuclease (muHifunclionBT DMA repair enzyme); APEX nuclease (multminctionel DNA repel: 3,76 
41301 1;AV^B115;ris.W1;M9lycan;KntyDan;a76 
428157; AJ73B719; Hs.196427; hextfdnase 2; hexofdnase 2; 3.76 

400288; X082S6; Hs.149509; Integrin, alpha 5 (fibronectin receptor. alpha pdypeptide); integrin, alpha 5 {ffcronectin receptor,; 3.75 
427378; BE515837; Hs.177555; mefenoma antigen, family 0. 1; melanoma antigen, famly D. 1 ; 3.75 

405484; ; ; C30rj2I24^gll1273728u1reft^J»S6BZ2|kera1in 18 [Homo saojens]|J6S33: C3002124^ili2737280|refp(P_OG66B2.2| k; £75 
450998; BE387814; Hs.2S797; splldrtg factor 3b, subunit 4, 49kD; splicing factor 3b, subunit 4, 49kD: 3.75 
432460; H12912; Hs.274691; adenylate Wnase 3; adenylate kinase 3; 3.75 

42881 6; AA004986; Hs.193852; ATP-bindlng cassette, sub-family C (CFTR/MRP), member 2; ATPiinrJng cassetlB, subfamily C (CFTR; 3.74 

431884; AA521246; Hs^10792; ESTs, WeaWy similar to ALU6_>IUMAN ALU SUBFAMILY SX SEQUENCE CO NTA MI NATION WARNING ENTRY {H.sapiens]; ESTs, Weakly similar 

to ALU8_HUMAN ALUS; 3.74 ' 
453329; T97205; Hs. 193400; ESTs, Weakly similar to 210926QA B cell growth factor [H.sapiens]; ESTs. Weakly similar to 2109260A 8 cell ; 3,74 
426440; BE382756; Hs.169902; solute carnerfarnly 2 (feclBtated gfucoaa transporter), member 1; solute carrier femtty 2 (facilitated glu; 3.74 
43B330tAW450572; Hs.257316; ESTs; ESTs; 3.74 

419911; L153Q1; Hs.1276; BN51 (BHK21) temperaturesenaifivrly complementing; BW51 tBHK21) temperature sensitrvny com; 3.74 
OK) 452695; AW7B0199; rte.30327; mrtogervactivated protein kinase- activated protein Wnase 5; mitogen-Bcf vated protein Mhoae-«rUva4; 3.73 

44223^ AW987149; Hs.28439; ESTs, Weakly Bimllar to 138022 hypothetical protein nisapTens); ESTs, WeakJy similar to 138022 hypothefJ; 3.73 
451295; AI55721 2; Hs.17132; ESTs, Moderately similar to 154374 gene NF2 protein [H.sapiens!; ESTs, Moderately sirnPar to 154374 gene ■ 3 73 
41077% BE275297; rls.194685; Homo sapiens done 24875 mRNA sequence; Homo sapiens clone 24675 mRNA sequence; 3.73 
426251; M24283; Hs.1 6B393; biterceTtular adhesion molecule 1 (GD54), human rhinovvus receptor; InterceTuter adhesion motecuTe 1 [CD541* 3 72 
33 44W3;RB5337:Hs^4090;w)lubcai^^ 

423523; AW29982B; Hs. 193580; ESTs; ESTs; 3.71 

413407; AB56283; Hs.75339; inositol polyphospnate priosphatase-llke 1; toosnd poNphosphete pnosphalQse^ile : 3.71 
448336; R53848; Hsv4497fi; ESTs; ESTs; 3.70 

422083; NhLOOt 141; Hs.111256; arachidonaXa iS-Gpoxyoenase, second type; arachldonate 15-lipoxyganase. second typ; 3.70 
OU 416087; AF045184; Hs.79008; SKI-INTERACTING PROTEIN; SKMNTERACTING PROTEIN; 3,70 
442200; AW590572; Hb^35766; ESTs; ESTs; 3.70 
414280; BE410769; Hs.75873; zyxln; zyxli; 3.69 

409354; N6B1B8; Hs.159472; Homo sefflens cDNA: FU22224 fis. ctone HRC01703; Homo sapiens cONA: FU22224 fe, done H: 3.69 
A< 4152re;U8B6^H3.7B353;SFRSpittonta>a©e2;SFRSpr^ 

0!> 15!!S ^SlSSi"^ 594 ^ !ouclne ' rich ^peat-containing G proteiiwoupled receptor 6 (LGR6) mRNA; feucine-rlch repeahjontatotng G protein; 3.69 
446522; NhO03876; Hs.15198; putative receptor protein; putative receptor protein; a69 
4227B5fAl6^114;Hs.2fl90a8;he^shockS()kOr^ 1. alpha; heatBhockSOkD protein 1,alpna;3L6B 

401OK5 : ; NnL0165B2*:rlomo sapiens peptide transporter 3{LOC5129S>. mRNA. VERSION NMJJ1 6579,1 GI; NM_«fi582*:Homo sapiens peptide transpon 168 
413048; M93221;Hs.751B2;matuwsere^^ 
70 452690; AI536070; Hs.16085; ESTs; ESTs; 3.68 

42 ^to^Ul^l^i?S^ W ^ 8lniBarto ALU?J<UMMI ALU SUBFAMILY SB SEQUENCE CONTAMINATION WARNING ENTRY TKUaprere]; ESTs, Weakly similar 

415010; NM_a04203; Hs.77783; membrane-assodated tyrosine- and tiveonfoe-specrRc cdc2'lnhlbltory kinase; mernbrane-assceialed tyrosine- and Bireon; 3J8 

428579; NMJJQ5755; Hs. 184942; 6 protein-coupled receptor 64; G proteto-coupled receptor 64; 3.68 
ID 446430; AA346637; Hs.15075; hypometical protein DKFZp434E2216; hypothetical protein DKFZp434E2216; 3.68 

442013; AA506476; te.375KH; Human DNA sequence from ctone RP11-353C18 on chromosome 20 Contains ESTs, STSs, GSSs and CpG Islands. Contains the NFS gene for 
cysteine desilruresa, two genes for novel proteins and me serve for the; Human DNA sequence from done RP11*353C1: 3.66 

416602; NAUD6159; Hs .367895; Prefab kinase Obtoding protein NEIL2: Protein Wnase C-hnding protein NEIL2; 3.65 
80 44122 ^BE563042; Ks.118820; Homo sapiens. Simitar to RIKEN cDNA 0510012G03 gene, done MGC14132. mRNA, complete cds: Homo sapiens, Sfmlar to RiKEN cDNA 0610; 

413076; U10584; Hs.75186; vrael {S. pombej homotog: weel (S. pombs) homolog; 3.65 

429303; AW137635; Hs.44238; ESTs, Weakly siraler to S65657 alpha-1 C-adrenargfc raceptor spice form 2 [H^apiensl; ESTs. Weakly simitar to G65657 atoha-1C< 3 65 
452060: W26980; Hs.3490B9 : ATP-bJndlng cassette, sub-family F (GCN20), member Z ATP-bbd&ig cassette, sub-family F (GCN2; 3.65 
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451558; NMJW1089; Hs.25630; ATP-bifidlng cassette. sub-family A (ABC1), member 3; ATP-bindrng cassette, suWamfly A [ABC1; 3.65 
414774; X0241 9; Hs.77274; plasminogen aco'vator, uroWnase; plasirtnogon aclivatof, uroklfiase; 3j65 
41742B; NM_002291; Hs.82124; laminih, beta 1; larninm, beta 1; 3.65 

450296; AL041 949; Hs.24756; fcepeiocyte growth factor-regulated tyrosine kinase substrate; hepatocyte growth factor-regulated tyros; 3,64 
5 437669; AI353105; Hs.1 23164; ESTs, WeaWy similar to match to ESTs AA657999 iTUapiens]: ESTs, WeaWy similar to mafch to ESTs AA; 3.64 
414029; BE297731; Hs.75709; mannose-5-phosphata receptor (cation dependent); majirase-S-ptiosphate receptor (cation dep; 3.64 
4443B3; AB03305B; Hs .1 1 101; K1AA1232 protein; KIAA1232 protein; 3.64 

42591 D; AA830797; Hs.1 84760; CCAAT-box-blndtng transcripBon factor; CCAAT-box-HrxJIng transcription factor; 3.63 

4514B4; AV648B96; Hs.283771; hypoihelicat protein; hypothetical prolan; 3.63 
1 U 444613; H29627; Hs.79092; hypothetical prolein FLM4427; hypothetical prolan FU14427; 3.63 

447495; AW401664; Hs.ie72Q; programmed cell death B [apoptosts^nducbig (actor); programmed eel death 8 (apoplosis-induc: 3.62 

4081 01; AW965504; Hs.276348; CDC2-related protein kinase 7; CDC2-reIated protein kinase 7; 3.62 

424732; DBO0D1; Hfe.152629; KIAAQ1 79 protein; K1AA0179 protein; 3.62 
. _ 41 1165; NM_000169; Hs.69089: galactosidase, alpha; gaiactosldase, alpha; 3.62 
15 422112;BE540240;Hs.1117B3;L5m1 prote'qn; Lsml profaln;3,62 

453020; AL162039; Hs.31422; Homo sapiens mRNA; cDNA DKF2p434M229 (from clone DKFZp434M229); Homo sapiens mRNA,- cDNA DKFZp434M229 (fr 3.61 

438795; AAB25792; Hs.377119; gb:od64b1 U1 NO.CGAP._Ov2 Homo sapiens cDNA clone, mRNA sequence; gbrod84bt Ui NCtCGAP Ov2 Homo sapie'ns ■ 3 61 

44551 5; BE3B8665; Hs.1 79999; Homo sapiens, clone !MAGE:3457003, mRNA; Homo sapiens, clone IMAGE:3457DQ3, mRNA; 3,61 

407797; AK000524; HS.3965Q; hypolhefcal protein RJ20517; hypothetical protein FU20517; 160 
ZU 423217; NM_000094; Hs.1640; collagen, type VII, alpha 1 (epfctermalysrs bullosa, dystrophic, dorninanl and recessive); collagen, type VII. alpha 1 {emdermnlys* 3.60 

444985; AI677737; Hs.3B0100: hypomefcel proteni FU14005: hypothetical protein FU14005; 3.60 

433764; AW753676; Hs.399B2; zinc finger protein RJNZF{NM_023929); zinc finger protein RINZF (NMJM3929}; 3.59 

427657; AL133017; Hs*28ab79; hypdhabcal protein FU22865; hypothelical protein FLJ22865; 159 

41 5020; BE24991S; Hs.293533; Human DNA sequence from clone RP11-127L2D on crcomosoine 10. Contains ESTs. STSs, GSSs and CpG Islands. Contains me gene for a novel 
ZZ> glutatWone-S-transferaso and five neve! genes; Human DNA saqtierxe from done RP1 1-127L£ 3.59 

415149; X12451; HsJBOSS; calhepsln L; cathepam U 3.57 

45B715; AK00O973; Hs.1706; hypothetical protein FLJ1 01 1 1; hypothetical protein FU101 11; 3.57 

423576; NMJJ003B3; Hs.129829; autonimune regulator (automimmiine polvendocrinopathy candidiasis ectodermal dystrophy); autoimmune reoulator (automlmmune oolven" 157 
440270; NM_0159B5^HS.7120; cytokine rece^ikarr^ ^ 1 ^ ' 

JU 4348B3; AW381533; H B .19a07;hypoUietlca] protan MGC12959; hypothetlcat protein MGC12959; 157 

404976; ; ; NMJ)14323*:Homo sapiens zinc finger protein 278 12NF27B), transcript variant t, mRNA.; NML014323':Homo sapiens zinc finger prot 3l57 
449656; AA00200B; Hs.1 88633; ESTs; ESTs; 156 
413795; AL040178; Hs.142003; ESTs; ESTs; 156 

40685S; A1581 134; Hs.161357; lamtnln receptor 1 (67kD. ribosomat proteVi SA); lamlnin receptor 1 (67k0. rlbosomal prot; 156 
35 411030; BE3B7193;Hs.67896; 7-60 protein; 7-50 protein; 3.56 

447079; AA280057; Hs.1 05280; ESTs, Weakly similar to d J953K23.2 [RsapfensJ ESTs. WeaWy similar to dJg63K23.2 (Ksa; 3.56 

424263; M77640; Hs.1757; Li ceU adhesion molecule (hydrocephalus, stenosis of aqueduct of Sylvius 1, MASA (mental retardation, aphasia, shuffling gait and adducted thumbs) 
syndrome, spastic paraplegia 1); 11 cell adhesbn molecule (hydrocephalus; 155 
. - 41 3472; BE242870; Hs.75379; solute carrier 1amHy 1 (glial high affinity gtotamata transporter), member 3; solute carrier tamfly 1 Mai hTgh affi; 155 
4U 443466; BE243123; Hs.321045; IKK-related Mnase ansion; inducible IkappaB Mnase; unrelated kinase epsibn; Inducible ft; 155 

i? 3 ^ 5 '' ^- 2D57; urid?nfl Jnonophosphate synthetase [orotete phosphoribosyl transferase and orofrine-S'-decarboxylase}; uridine monophosphate synthetase (orotat; 3.56 
450931; N2515B; Hs.25648; tumor necrosis factor receptor sopertHmily, member 5; tumor necrose factor receptor superiamf: 3.55 
425536; AW955696; Hs.90950; ESTs; ESTs; 154 
441054; AAW3591; Hs.1264BO; ESTs; ESTs; 154 
45 440592; AL137266; Hs.7285; WAA0759 proteh; KIAA0759 protein; 154 
45S046; AAD09716; Hs.42311; ESTs; ESTs; 353 

449027; AJ271216; Hs.22680; dlpeplldy^ptxJase 111; dipepHdylpeprJdase III; 3.53 
421662; NM_014141; Hs.106552; eel recognition molecule Caspr2; cell recognlllon molecule Caspr2; 153 
422732; AA577455; Hs.24937; transformer^ alpha (htra-2 a^ha); transformer^ alpha {htra-2 alpha}; 153 
50 424370; T16545; Ha.244624; ESTs; ESTs; 1£2 

442794; AJ744130; Hs.356753; hypothetical protein MC5C2975; hypothetical proteto MGC2975; 3,52 

417640; D30B57; Hs.82353; protein C receptor, endothelial (EPCFQ; protein C receptor, emtothefial (EPCR); 151 

*]*gP> AA4O0O27; Hs.296234; ESTs, WeaWy similar to T31613 hypothebcat protein Y50E8A1 - CaenarnabdltlB etogans [Celegans]; ESTs, WeaWy sknibr to T31613 hyrjotheti: 151 
c 410257; BE244044; Hs.61469; h^potrieocal protefn; hypolheticaf protein; 351 
55 424637; BE2761 13; Hs.333034; N-ace^ensfarase, homolog of S. cerevtsiae AROI; K-acetyJtransferase, homotog of S. cersv; 3^1 

421921; H83363; Hs.355993; iranstocase of inner rritochondrteJ membrane 10 (yeast) homotog; transtocase of mner mitochondrial memhr; 150 
454128; AL031259; Hs.367S0D ; rxogrammed cell death 2; prr^rammod cell death 2; 3.50 

43404^; AA501430; Hs.5771; amyotrophic lateral sclerosis 2 (juvenile) chromosome region, candidate 2, aniyotrophte lateral sdarasTs 2 fluveril; 150 
^ 453641; AA444140; Hs.90960; ESTs; ESTs; 150 U 
DU 429592; ABO29041; Hs. 209646; WAA111B protetrc KIAA1 118 protein; 3.49 

«rj847;ACTJ036B2;rte.127&8B; ESTs^^ 

410855; X9779S Hs.6b718; rU05^ (Sj^^ ' 

413372; H55532; Hs^49695; tubulm, alpha 2; tubulin, alpha 2; 3.07 
- - 437224;AL11762B;Hb,97808;ESTs; ESTs;Z77 
D5 430439; AL133661; ; DKFZP4348061 protein; DKFZP434B061 prote&i; 27B 

435BA7; AF269223; Hs.128322; t^mptex 11 (a murine top homotog); t-complex 11 (a murine top homotog); Z53 

tt^™ 7 ! 6 ? 13 ' Slm ? 3r 10 154383 chromcso ™ segrcg^Drt protein smel rjisaplens]; ESTs. WeaWy stmllar to l543B3chramosom; 2AB 
412026; AA383618; Hs.73073; tesHs^pecllicankynn motif containing protein; tes&s -specific anlcyrr moW containing; 2^5 
4227fl9;AK001113;Hs.120842;hypometkdprotekiFU10251;^ 
70 426827; AF0123S9; Hs.1S5685; ESTs; ESTs; 2.12 

438981- AF0B5884; Hs.20029; proacrosh bfndng protein sp32 precursor; proacrosTn binding protein sp32 precurso; 2.07 
425709; AA383076; Hs.159274; outer dense fibre of sperm tails 1; outer dense fibre of sperm tans 1; 159 
433724: AI827749; Hs.144924; serineflhreonlne protein Mnase SSTK; sen^reonTne protein kinase SSTK; 1.6B 
420710; NfuL007009; Hs.9flB75; zona pelucldB Wndlng protein; zona peKucWa binding prolerh; 1.54 

TABLE 57B 

Pkey: Unique Eos probeset idenbher number 
CAT number: Gena cluster number 
8U Accession: Genbank accession numbers 

Pkey CAT Number Accession 
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10 
15 
20 
25 
30 
35 
40 



417866 
432407 



434414 



1031334J 
MHI429J2 



35978 1 



427298 
427521 

407347 
430439 

TABUE57C 



115241 J 
513212_1 

810943J 
6750^2 



AA21 0987 D57294 AA21 45B4 AA207006 D56572 

BG036675 BF772O05BF771866 BG960386 BG9603B1 NMJB5712AF11D315 BE074534 BE16277B BE158000 BE157999 BE714315 AWHlftma 

5S2S2 BF739224 BG98B^5AK057283 BIB61466 AA663341 AA457591 BG949294 AW3928B6 AA0711ZT AA^Wg 

AA5a4918BG95957OBF773465AJ,O41698BF959313R57170C1BB59BF770411 BF77129BAW75321 L13^AA2l67008miB&^S1fl5q 
B6537068 C18935AA1 55719 BF771172BF769107 8F604064 AWB18172 AVWB1«AW392^^ 

AW8616S7 AW321B26 BIQ55726 BF242643 AA2071 89 BF770412 BF771 157 BG430Q30 AA055M2 F74S2 " M179928 

AF134164 BF8Q9407 AA21B567 BF842863 A1267168 BF876178 BGS99253 AWBS1851 AWB58362 AJ817548 BF771300 AA1 13928 AA27H3? 
i*™ B^^BPnaSB BBW393 BE073424 BE142245H59571 HmBHnHBBmSS 

A«23267 BG997B95 BG997897 AW991857 AA534354 BG319501 BF736309 AJ6342B5 AA045564 BG950256 AI8293D9 BG987850 BE09317S 
AA933717 BF061B97 AW628327 AA64178B AA400495 

AW&73352BF222929AW01G853 BF05&13G A1651829 BE561767 AA568414 A1339359 BF059601 A1961162AI341422 AI2D5248 AI20B16«i 
AA548736 AA768578 A1539081 AW025957 AA73B837 r^75 AW5943S7 AA480892 *»"«"*™ AIZD5248AI20B165 

T23614AB66765 

AL133561AI117481AI122069AW438292AI96B826AL041090 



Pkey: Unique number <wrespondir>g to an Eos p robes et 

Refc Sequence source. The? dig* numbers In thlscoJumn are Genbankldenfflier(GI) numbers. "Dunham I el al." refers to the mWfcaljcn enHited The DMA 

sequence of human chromosome 22." Dunham Let al., Nature (1999)402:489-495. puratcauonenDUeo ineDHA 

Stran± Indicates DNA strand from which exons were predicted. 
NLposltkn: Indicates nucleotide positions of predicted axons. 

NLpostllcn 

84187-84744 

137634-1 37768, 139702-139893,140475-1 4059 
113765-113910,115653-115765,11680^11694 
37395-37514,37806-37981 
74502-74703 
30487-31058 
24712-25374 

22135^2309,23063-23238 
132079-132216 
113086-114800 
172780-17435B 
24019^4973 

199214-1 99579,199672-199920,200262-20049 
33192-33360 
139625-140632 



Pkey 


Ref 


Strand 


402199 


8576116 


Minus 


4026B0 


8113438 


Plus 


402260 


3399665 


Minus 


402678 


8113438 


Plus 


403171 


9838164 


Minus 


406137 


9166422 


Minus 


401704 


3097B41 


Plus 


402677 


8113438 


Plus 


402679 


8113438 


Plus 


402145 


8018280 


Plus 


406547 


7711513 


Minus 


402395 


4092817 


Minus 


40S484 


5922025 


Plus 


4010B3 


3242744 


Plus 


40497& 


3419864 


Minus 



45 
50 
55 
60 
65 
70 
75 
80 



Ptoy: 
ExAccn: 
llnKBenelD: 
PredProtDomsine 

UntGeneTitle: 
Rt 



TABLE 58A: 434 genes uprejulatet! In Madder cancer relative to norms* body tissues 

Table 5BA lists about 434 genes upregutatgd In bladder saw relative to normal body tissues that ara likely to encode proteins amenable to modulation by oral) molecules Denudes 
S^r^^Tu* ^ m ' ™ i ™ f * "BUf^S the relative level of mRHAaxpreesion. The protein products of these genes often contain one ormore domains 

Uikjue Eos probeset identifier number 
Exemplar accession number, GenBank accession number 
UnlGene number 

UnlGene gene Cue 

90th percentile of Madder tumor AJs dlwded by the 50th percentile of normal tissue Als 

Pkey; ExAccn; UnfoenelD; Imlgene Title; FredProtDornains; R1 

430630; AW269920; Hs.2621; nystatin A fstefln A); cystatin;TM=M,- 35.25 

422282; AF019225; Hs,t 14309; apolipoproteln U MotA_&dbB;TM=Y;SS=M; 33.25 

1155? I?? 85591 Hs76d 22: phosphdlpase A2, group ||a (platelet ; phoslipcTM^M:SS^Y; 31.6B 

TjSJSf M2J^ ; ]^^a^redUdw»ftirtliy 1 f member C3; aWoJceLrediTM^Mi^l^ 

417771; AAB04698; Hs.82547; retindc acid receptor responder ftazaro; none.none; 2&50 

413859: AW992356; Hs.8364; Homo sapiens pyruvate denydrooenasa kkia; SAMj^NT.norve; 25.38 
418818; AA228899; Hs.101307; Homo sapiens HUT11 protefo mRNA, paife* UT.none; 25.28 

255? iJSSi^i! 5 ? 165 taP** 0 ™™ (DNA) li alpha {170kD); rMAjyrasBB.DNAJopoisol^ 23.58 

421733; AL119671; Hs.1420; fibroblast growth factor receptor 3 {ach; b tP Wnase;TM=Y;SS=M; 21.24 

424008; R02740; Hs.137555; putative chemoklne receptor; GTP-tinding; 7lmJ;TM=Y;SS^M; 21X45 

447343; AA256641; H3.236594; ESTs. Highly slmflar to S02392 alpha-2-nx none,noive; 19J8 

408243: Y00787; Hs.624; mterteuWn 8; HLH.PAS^LSjTMfM;; 1&90 

427490; 295152; Hs.178695; nutogen-acflvated protein kinase 1 3; pkinase;TM=l\ta 18.75 

410887; U24389; Hs.G5436; lysyl oxfdase-lika 1;LysyLprfdase;SS=r4 ia63 

444381; BE3B7335; Hs.283713: ESTs, Weakly similar to 864054 hypotheft CoBagen;TM=M;SS=M; 18 60 
452747; BE1 53855; Hs.61 460; Ig supertamlyroceptorLNIR; fg.FUiabd.glycopcTM^SS-M; 18,55 
41S444; BE247295; Hs.78452; solute carrier family 20 {phosphate tran: PH04,UM;TM=M;; 18.25 
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413132; NM_0CG823: Hs.75209; protein kinase (cAMP-dependem\ calalyti; PW;SS=M; 1773 

427337; Z4G223; Hs/176663; Fc fragment of IgGv tow affinity I lib, r; Ig;TM=Y;SS=M; 17.68 

45074S; D82673; Hs.276559; general transcription factor II, I; none.SH3.PX; 17.12 

418945; BE246762; Hs.89499; arachidonaia lipoxygenase; BpMygenase ( PLAT;7M=M;; 16.88 
D 42G9B1; L409O4; Hs.100724; peroxisome proliferative activated recap; rKxrmonejec,ztC4;TM=rVfc 1 B 78 

439941; AI392640; Hs.18272; amino add transporter system A1; Aa_lrans;TM=Y;; 16.75 

431848; BE019924; Hs.271580; urocJeWn 1B; transmembrare4;TM=Y;SS=M; 16.56 

?™ 7: X14008 J ^S-234734; tysozyme (renal amyloidosis); lys.i0,FAD_Synth Jdh.ldh C,ptoiasB;SS=ft/l; 16.43 

414883;M92696ft;C0«8 protein Wnase1;CKS;: 16.20 
1 U 438091; AW373C62; ; mitfear receptor subfamity 1, group \, rrr, hormone rec,zf-C4,none; 15.90 

439963; AW247529; Ks.6793; platetet-acHvaNng factor acetylhydrola; PAF-^H Ib.Upase.GDSUTf^M- 15l70 

428450; NMJJ14791; Hs.184339; KJAA0175gene product KA1,pWnase;TM=M;; 15.63 

AF072873; Hs.1 1421* frizzled (Orosophila) homofog 6; Fz,Frizzted t 7tm_2;TM-Y^8=M; 15.45 

434293; MMJJ04445; Hs.3796; EphB6; EPH^lbd.fnS.pJdnas^SAMiTM-YlSS^M; 15.43 

HISS? S^ISS!^? 4 * 66,6451111 L ^P 1100 ^ ^edbB molecule; EGF,)ecIln_c^ oshi;TM=M;SS=M; 15,28 

zi^V ^S? 2250: Hs - 1008 ^ Potassium IntarrnedJate/smaJlcOTdudance; CaMBO,^hmr^JonJrans;TM= ; Y;SS=rvI; 15 10 

416305; AU076B28; Hs.79187; coxsacWe virus aid adenovirus receptor; rg;TM=Y^M; 14 90 

432306; Y18207; Hs.303090; protein phosphatase 1, regulatory fmhib; CBM_21;Ttt*M;; 14.80 

413076; U10SS4; Hs.75188; weel (S. pornbe) homolog; pkina8e;TM=M;; 14.73 
ZV 429345; R11141; Hs.199695; hypothetical prolan; K_tetra,SAH; 14.58 

^230; BE613348; Hs.21 1579; mefanoma cell adhesion rrrtecde; ig/rsoih^b^^ 14.55 

421508; NNL004833; Hs.105115; absent in melanoma 2; PAADJMPIN ( HIN;TM=M;; 14.53 

446D06; Mr<_004403; Hs.13530; deafness, airtoscrna! domThantS; none;TM=M;SS*M; 14.35 

429556; AW13S399; Hs.99988; ESTs; none;TM=M;; 14.18 
ZD 416224; NM_QD2902; HSJ90B8; rettcuJucatbtn 2, EF-hand calcium Mndln; efhand;SS=M; 14.13 

429573; AA884407; Hs.21 1595; protein tyrosine phosphatase, non-recapt Yjrfioap»iatase > BanaL41 t PDZ;SS=M,- 13.90 

426657; NMJ>15865;Hs.171731;soluta carrier My 14 {orea Iraispcrt liTH^; 13» 

426157; AI738719; Ks.198427; hexokfnase 2; he^nase p hexokinasB2 i norffl; 1X80 

5225?? ^^^3^H^aap<"nR8^n^elated; EGF^WLrecepU-loljeci^L^ 13.78 
J V 439453; BE264974; Hs.6566; Ihyrofd hormone receptor interador 13; AAA.ABC tran,Co*;TM=Mr. 13.38 
432314; AA533447; HS.3129B9; ESTs; X)ink,rmne7t3L25 
413109; AW389845; Hs,1t0855; ESTs; PHCW.none; 1*15 
42449ft AJ278016; Hs.55565; ankyrin repeat domain 3;aik,pWnase;TM=M;; 13.13 
426490; NMJXM621; Hs.170087; aryl rtydrocarbcn receptor PAC,PAS;TM=fvfc 1Z93 

425177; AF127577; Hs.155017; nuclear receptor Interacting protein 1; none;SS=M; 12.38 
404942; U30825; ; splicing factor, arglrdne/sarifie-dch 9; CD3S;TM=Y;SS=M; 1Z03 

A(\ 1^? SS!Sf e: i? t2 ^5? ! , CEGP1 protein: EGF ^Kj*.granulln t CuB,KaratruB2 ( 7lLrTM-M;SS=M; 11,83 
4U W195;BE148235;Hs.193063;Homosarien6cDNAFlJ14201 fls,ctcrw AT;AOans,none; 11.88 

A* 20558 * Hs.lSSieujSpiicrngfactor, argininefeerinerfch. 4; mn,hormonB_rec,zf.C4 ( siigar tr;; 11.85 
433470; AW9BQ584; ; tnmsmeirtraiie 4 superfen^ty member 1; none;TM==Y;S8=M; 1 1.60 
HO 4251 18; AU07661 1; Hs.154672; methylene telrahydrofblala dehydrogenase; myb DMA- 

11521? *J!1 472I,4; Ha-89414; chemolone <C-XrC motif), receptor 4 t^a; 7tnru1,71n^2;TM=Y^S=M; 1 1.50 
426761; A1015709; Hs.172089; Homo aspens mRNA; cDNA DKFZp586!2022 $ none;TM=Y;SS=M; 1 1 48 

^S^? tlP^IS^ Homo ^P^ 5 mRNA full lengKi thBort cON; I^H^CJMPDftN 1 C8S^te 9 rin^ t Rldn_0J«*n; 11.38 
446742; AA2321 19; Hs/I6XH5;puletto 

436729; BE621 807; ; frmsmembrana 4 suparfamljy member 1 ; none;TW=Y;SS=M; 1 1 .1 8 
4O9960; BE261944; Hs.339673; hexoklnase 1; none^orie; 1 1.02 
428539; AB011 155; Hs.170290; discs, large (Drosophila) aomotog 5; SH3,PD2.Guany1ate Jdn:TM=M;: 10.78 
417821; BE245149; Hs.82643; protein iymsme kinase 9; cofiKnJM)F^S=M; 10.63 
J D 427654; AA410183; Hs. 137475; ESTs; tocutrans,wrc l IGFBP l tsp_1; 1058 

418004; U37519;Hs.87S39; aldehyde dahydrogenase 3 famHy, memba- ; aJdedh;TM=M;SS=M; 1053 
447365; BE3B3676; Hs.334; Rno guanine nucteoDde exchange factor {; SH3.PH, RhoGEF;TMaM;; 10.53 
?ES2S? Ha.100057; Homo sapiens cDNA: FU22S02 fis. clone K; norwione; 10.S2 
^ 436856; A1469355; Hs.127310: ESTs; pknsse p rnn;TM=Afc ta48 

OU 451035; AU076785;Hsw430;pta5fri 1 Obota);efhand,CH 1 /id«j|lrLN;SS=M; 10.38 

VJ%$ i? 27853 * Hs.83968; integrin, beta 2 (antigen CD18 (p95K V; lntegrin_B 1 EGF 1 P8l;TM=Y I -SS=W; 1035 
431890; X17033; Hs.271986; integrin, alpha 2 (CD49B. alpha 2 subuni; vwa,lnlegrln J A,FG-GAP;TM=Y-SS=M 1 10 34 
440006; AK000517; Hs.6B44; hypothetical protein FU20510; AAA,N8-ARC,PAAD OAPlN;NA;NA; 10-25 
A* ^ 97101 I P uWve G pmte'Bi-coupIed reoeptan Methytlransf_6iTM=Y;BS=M; 10.18 

Q> 437852; BE001836; Hs.255897; ESTs, Weakly similar to dJ385012.1 [rfia; Gre.7trn_2;™^;; 1113 

V*fSl5r5? 31 . , 3 5 * :Ho,no sapfens ^"-related; EGF.frWLrecepU.1dUecepU.granuHn.BW ia08 
427700; AA262294; Hs.180383; spec^ phosphatase fi; Rrcdan^^DSP^TNtwi 10.05 

S!i ^S^S^JiS^l^^ *? m * n (PTPRF interac ^ SH3 1 i ff( DWnase ( PH. a pecWn,RhoGEF;TI^M;; 10.05 
Id ' : ENSP0^00224716*^TP^IrKfing protein SAR; none;TM=Y;; 10.01 

InSS ^^SSiS-SP 0 * Wj uncfo,t P roteta 2(zo ra occludens: SH3,PDZ 1 Guanyrate_Wn;TM=M;; 10.00 

40456B; ; ; NM_02207r:Homo sapiens hypothetical pro; 8H2;TM=M;: 10j00 

444823; BE262989; Hs.12045; putative protein; MralJWBOATtTM^^Y; 9.93 

iVJ$ ^I?o D1564; Hs1W222 : hypotfiafeal protein FU1O702; emand,kazal 1 aif 1 ras ( 7trrLl;TM-M;;9.90 

413441; A1929374; Hs.75367; Sjc-Hke^daptar; SH2.SH3;TM=M^ 9^0 

f J 424954; NMJJ00S46; Hs.1846; fcjmor protein p53 (Li-Frauraeni syndrome); P53,WE>4aiRK:TM=M" 9,88 
4522B9; BE568205; Hs^8827; mlogen-acttvated proton kinase kinase ; pWnase;TWt=M- 9,65 
^3 5Hlf?5S? HsJZ50fi18 : UL16 blridbg protein 2; ldLrecept_e l PKD,MHCJ;TM=M;SS=Y; 9.83 
SfJ38: S'i^ 8 ? PR"""**™™ Phosphatese. raceptorl; Y jl»sphatase;TM=Y;S^M; 9.80 

Q/\ SSS!Jl 9 i HsJ!128 : &d speclflchy phosphatase 5; Rhcdane^DSP^Yjjrwphdtase;TM=«;; 9.73 

50 452239; AW37937B; Hs.170121; protein tyrosine phosphatase, receptor r; nonejTone? 9.73 

418322; AA284166; Hs.841 13; cydln-dependent kinase mNbltor 3 (CDK; Y prwspr^e.DSFtTM^Mr 9.72 
403912; ; ; CSD00394^127372BO|iel|XPJD066B2jq ^^^.u^inrn,. *u 

427315; M179949; Hs.175563; Homo sapiens mRNA;cDNA DKFZp564N0763 (f; none.spectnXSH3.PH.CH; a70 
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428423; AL037544; Hs. 184298; cydln-dependent kinase 7 (homotog of Xe; pklnase;TM=M;; 9.6B 
422241; Y00D62; Hs.170121; protein tyrosine phosphatase, receptor t kinesin.fh3,Y J4iosphalase;TM=M;- 9.53 
429053; Y09397; Hs.227817; BCL2-reTated protein A1; BcV2;TM=M;; 9.63 

425322; U63630; Hs.155637; protein kinase, DNAraclweted. catalytic; PI3J>I4 Jonas*, FAT,FATQTM=N*; 9.56 
430259; BE550182; Hs.127B26; RalGEF-fike protein 3, mouse hcmobg; fn3.RA,RasGEF;TM=M;5S-M; 9 50 
428620; AA331901; Hs.184736; hypothetical protein FU1D097; none;TA*=M;; 9.50 
418969; W33191; Hs.2B907; hypothitfcar protein FU20258; SH3;TM=M;; 9.S0 

448913jAAl94422; Hs^2564; myosin VI; muzTI^BP,pkin^^^ Myosin N hZIPaf- 

41491 1: NMJ000107J Hs.77602; damage-specific DNAbbidlng pralein 2 (4; WD40 ( horneDDOK.UM;TM=M;; 9.48 
^295; A1557212; Hs.17132; ESTs, Moderately stater to 154374 gene ; pknasevDAG P&Nr^ptfnase J),OPR.none; 9.45 
402328; ; ; Target Exon; pklnase;TM=M,- 9.44 
443710; AI928136; Hs.9691; Homo sapiens cDNA: FU23249 fe. dons C: G-alphajone; 9 42 
4149B7; AA524394; Hs.294022; hypotheUcal protein FU1495Q; SH2;TM=M;; 9-42 

ni?^!? ll 8 ^^ 3 ^^ 0 ^^?? » *f*^*»** W APSjoi^eATP^lfurylase.PRK.Thyn^dylateJdniSS^ 9.40 
418BZ7, BE3Z731 1; Hs.47166; HT021 ; none;TM=M;; 9.40 

440675; AW005054; Hs.47883; ESTs, VfeaAly sitnila/ to KCC1_HUMAN CAUCI; pktoase.none; 9.35 
433376; AT249361; Hs.74122; caspsse4, apoptosis-ialaled cysteine pr, CARD,ICF^DlO 1 lCE ta D20vSS=M- 928 
410663; BE379794; Hs.65403; hypothetical protein; iiealh.TNFIU$TM*Y;SS=M; 9.25 
430024; AJB06780; Hs.227730; integrin, alpha 6; tntegr1nAF&GAP:TM=Y;SS=M; 9.23 
452696; AI826645; Hs.211534; ESTs; ArfGap i PH ( anlt,GugnyiateJdn > PDZ,SH3; 9.13 
434263; N34895; Hs.44648. ESTs; ig.none; 9.13 

407949; W21874; Hs^47057; ESTs, WeaWy similar to 2109260A B cell ; RlbosomaLSt^k, pkhase,death,none; 9.10 

42S332;AF030403:Hs.ig9263;Ste-20 related Wiiase; p^as^metah^rojM^SS^M; 908 
jL 3 417428; NMLO02291; Hs.82124: larnlnin, bets 1; larrfrfrt_EGF;eniWnJ^lerm l inlegrir^:SS=M; 9.08 

443951; F13272; Hs.111334;fBrriSn. Tght pdypaplide; PMP2^aaudm,none; 9,07 

414368; W70171; HsJ5939; uridine monophosphate kinase; PRK,CoaE;; 8.SB 

451292; AB037716; Hs.262Q4: KIAA1 295 protein; SH3;TM=M;; 8.93 

438000; AiB25880; Hs.5985; non-Wnase Cdc42 effector protein SPEC2; none;TM=M;; B.90 

446620; AA128808; Hs.179902; transporter-tlte protein; none;TM=Y;SS=M; 8.90 

436075; BE090176; Hs.179902; transporter-ike protein; none;TW=Y;SS=M; 8.8B 

437056; AJ147061; ; 0fcok33a11,5l Soares_MSF_FBJ9W^OTJ J A _P_S; none,spectrin l SH3.PH.CH; B.7B 

445496; AB0078B0; Hs.12802; development and differentiation ennancin; SH3,anfc,PliArfGH>;TM«Nfc 8J8 

418203; X54942; Hs.83758; CDC28 protein kinase 2; CKS;;&75 
33 434608; AAB05443; Hs.179909; hypothetical protein FU22995; none;TM=M;; 8.70 

445033; AV652402; Hs.72901; mucin 13, epKhdla! transmembrane; ank;; BJ68 

417640; D30857; H 5 .B2353; protein C receptor, endolheliat (EPCR); none;TM=M;SS=M; 8.65 

432B41; M93425; Hs.62; protein tyrosine phosphatase, non-recept Y phosphatase;SS=M; 8.65 

430397; AI924533; Hs.105SO7; bicarbonate trensporter related protein ; HC03 colnrop;7M=Y;; 8.64 
W 448BB8; AW19S563; Hs.200242; caspage recruitment domain protein 6; CARD;TM=M:; 8,60 

442994; Ai026716; Hs.16954; ESTs; ank,p^3se,<ieatrwWbosornalJS14; 8.60 

429109; AUM6637; Hs.196352; neutrophil cytosolic factor 4 (40kD); SH10PR > PX;TM=M;; B.60 

455439; AW94S4B4; Hs.1 84252; ESTs, Weakly similar to ALUBJ4UMAN AlU S; «one,7frn_1: 055 
a <- 437763; AA469369; Hs.5631; tissue inhibitor of metafopralBlnase 1 ; TlMP.pkinase, DAGJ'E-bind.RBD; 8.43 
H3 417035; AA192455; Hs.22968; Homo sapiens done IMAGE451939, mRNA se? nona.none; 8,40 

418478; U38945; Hs.t174; cycfewlependent kinase Inhibitor 2A (ma; ank- 6.39 

448209; AW160489; Hb.20709; tetraspan 5; transmEmhfane4;TM=Y;SS=M; B.33 

4501 39; AK001838; Hs,296323; senim/Sglucocortloold rcgiialed kinase; none,none; 8.33 

446350; AF052112; Hs. 12540; rysophosphdipaaBl: abhydrolase^TM^M,-; 8^1 
DVJ 427509; M62505; Hs.2161; complement component 6 receptor 1 (C5a [; 7tm_1;TM=Y;SS=M; 8.30 

445633; AI453386; Hs.17287; ESTs, V^akV similar to S266B9hypotheti; IRK^ne; R2B 

446719; W39500; 14^301872; hypothelica! protein MGC4840; AAA,SKI;TM=M;; 8^3 

424762; AL1 19442; Hs.183684; eiAaTotte franslaCoii In Wen facton iwne,none; «.20 
S S SISJ 1 i!^£5L 1 i ^ 174142 ^ Coto| V s«mdatln9 factor 1 receptor, fo; \g.pkina5e;TM=Y;SS=M; 8.18 
33 416094; AW995512; Hs.225977; nuclear receptarcoacttvator 3; nonB,none; 8,16 

419262; AA834664; Hs.29131; nudB^ieceptcccoacbvHtor 2; PAS,zf-C2H^SET; ai5 

417386; AL037228; HsuB2043: D1235flna product; NUDIX^Y.EI.dehydraa^ 8.13 

431236; AV656840; Hs.285116; Interiouklh 13 receptor, alpha 1; fn3;TM=Y;SS=M; 8.10 

439658; AA332D57; Hs.6539; hypothBScal protein MGC15440; none;TM^M^S«M; 6V09 
OU 425424; NM-004954; Hs.157199; ELKL mo$f kinase; pWnase t UBA,KA1;TM=H; 8.03 

x!?SS?? V s295726 : ]a *X* n > *Plw v (vltroneclin receptor; FG-GAPJntegrfn J^none; 8.05 

408912; AB01 10B4; Hs.48924; K1AA0512 gene product; AUEX2; Armadilto jeg;TM=M;S8=«; 8.03 

447674; BE270640; Hs.19192; cyclln-dependsnt kinase 2; pJdnase;SS=M; 8,03 

404691; ; ; Target Exon; none.none; 7.95 
OJ 414279; M3301 16; Hs,77273; Human ^ucose transportar pseudogene; noneinone; 7^3 

462888; AW955454; Hs.30B42; ephrfn-B2; EphrIh f fh2;TM=Y;SS«M; 7.93 

433211; H11850; Hs.t2aOB; MARK; pkinase > UBA.KA1;SS=H 7.91 

438485; W57578; H&237955; RAB7, member RAS onccgene famKy; pWnase.ABCI.none; 7.90 
H(\ AA ? 78200; Hs.118727; Homo sapiens cDNA FU13692 fe, done PU Hm,death.TWFRjc6 1 Acyl-CoVUiydro: 7^0 

70 ?™£ ^ 820421 soluta ^fenlly 23 (nucleobase tra; ^njirjwnease.RA; 7^8 7 ^ 

TIB ^SyS?™ u 39101 H ? R2 ^^tyrostoe WnaseC^rtMi?.; Fi^lkfi^na 8 e,RecepJJomdn l YI^T^S^ 7.86 

405036; ; ; NrU.02t628*:Homo sapiens arachWonate Hp; lTpoxYflenase,wmpl9xU49Kd,PLATrrj^M;; 7^3 

7*5 ^S2?' SP2£i3 s ;? ^ ESTs ' simferto A56154 Abl subst f»ne^<^tfh 1 Acy!-CoA u <^^M^dh^ 7^0 
/3 432931; NM_002733; Hs.3I3S; protein kinase, AMP-acSvated, gaiYna 1 n; CeS^a-lrBnsiTT^M;; 7J8 kJV " n " , " rfUnu U ' 

li^c rei^^S^t 1104 ^ Hqto; ^PH^NF^lor^an ( WD40.1RK; 7.78 

446636; AC002563; Hs.15767; cilron (rhoHnteractrng, sennafthreonln; CNH.DAG_PE4 3 lruJtPliliwrfucrin 1 M;Tf^Wt; 7 78 
431 183; NM-OOBBSS; Hs.2S0696; KDa (LysJtep-eu-Leu) endoplasmic retfe; ErUumBruecepl;TM=rA-SS=M; 7.76* 

Hs ' 507 5S; SMC4 (structural melntenenca of chromoso; ABCjran,M,SMC_riSMC^C l DUF164,none; 7.73 
437192; AW975786; Hs.75355; ublquWrwonJugating enzyme E2N (homoio: Ua.con,Y_phosphatase,SH2; 7.70 
S?? 12: i;^Sy ) !? 995:Hofnosa P rBns Wereecffn 2 (IT; SH3,erHand 1 C^PH,RhoGEFtTM=M;;7.70 
441 190; H09073; Hs.25046^ ESTs; E1-F^Pa 5 e,C a t^TPase_^ 7.6B 
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447898. AW9B963B; Hs.f 12318; 6.2 kd protein; none.none; 7.65 

S ^?/^^ 1 5S?ii rtwferofi * gamma-inducfble protein IB: PAAD_DAHM1M;SS=M; 7.63 

O 401927; ; - C170009l4>ia:594367|ref|NP-058549.1| s; none; 7,50 

S^r^/ 7 ^?^ 0 ^ 07 - 81 Na - C ^6CB1 ^sapiens; rtto^SH^pKnas^POLO box.nane; 7^8 

1 U 421489; AI922B21; Hs^2433; ESTs: none,PI-PLC-X ( PI-PLC-Y 1 C2; 7.53 

Hs.241551; chlorftte channel, cate'um acUvaterf,^ ncoe;TM=Y;SS=M; 753 

SEX o^L 8 ^ 1 ^?^ 2 OTCO0e,l6: Rh "GEF > PH,CH.SH2,SH3^AG_PE^1hcJ f non a: 7.48 
444252; R21 135; Hs.54985; ESTs; nona/iane; 7.47 

a^S. HS : 93Z9: chrom050mfi 20 0P«» wading frame 1; none;TM=M;; 7.45 

20 U 9b ^o? 4 L fl NC, - CG/Vf, 7^ Homosaptens ; ^bitegria^FG-GAP^one; 7.45 

Hs L 34906: Weakly similar to T1 721 0 hypatheti; r^pWnase; 7U0 

30 SmKS 

JESS XEiE? 8,5113 * PTK6 prDteln ^ rosinQ Masse 6; SH2 1 SH3. P kJnas B ;TMsM;; 7^4 
2£5£I2£i J- 11 ? 1 ? Kiyirfd hormone recaptor-assodated pn* no^TM=M;; 7.23 

^ S/v^fi 2 /^ Bap ™ 5 PAKZ^iRNA. comptetecds; pfctoase,PED;TM=Mr: 7.19 
407581; NMJW0910; Hs.37125; neuropeptide Y receptor Y2; 7tm_1;TM=Y- 7 18 

^ S 5SS2S SHS? 1 * Bprauly Pro^P^) twmoteo 4; SH2,SH3?TM=M;SS=M; 7.18 

403344; ; ; Nro 00341 :Hamo sapiens solute canter fee Elpha-amvlaserTM^Y- 7 i«? 
42T2(*X7852a;HU74l39^ 

iSS ^SSSSF ^^^rfamlly 2(fa C nilalBd"glu; suaarjrjTM^^M; 7.14 
iJSI 5 ; : » NMJ>228l9':Homo sapiens phospliollpase A2; pnoslfcSS=M; 7.11 
40 433B18; AA802539: Hs.345404; ESTs; G^^AjdMm; 7.10 

J^IS 7 ^^^i Sapfera f°^ FU21425fi 8l clone Q 8ulfa]Bjransp 1 STAS,HM6_box;7JlB 

Hs - 1717 ?* fouronalcalldeallKelaJed proton: TRID-31;TM=W;;7.05 
S«:^?^;2fS^ ll ^ W peptide receptor 1; 7lm_?.KRIACSD;TM=Y;8S=M; 7.03 

2S ESSE? hrofema-g growth factor, beta reoe^^jmScSwio™; 6.93 
2n£?i' H1251531 ; piDteasome (prosome. mscropefn} subunfi,; prctoasornecTM^M;; 6.93 

SSi2E?S fcaBaW * EST * P^Bse.haP^m^ojeg.noiK; 6.B8 

2»rS SSSSK? h «^P^ J^fl™P IVAfcytosolfc, ; C2,ttA?JB;TJ*=M; ; 6.85 

A *SSSS5J2S2! MBs ^P^^apcplM^ated cysteine pr; K:e^10JCEj20^;TM=mS 
W1S8;; ; Ehter^0024W:C^ ^ ' ' ^ U 

^r?iSS!P 1: rrit °9 en - at * vated P^teln kinase kinase ; pjtfnase;; R73 
ii.«^L^ WTOsap,ensmeteno,na artfi ^ MAGE;TM^ 6.73 

f^ 7 ^?* 1 ^ MGC4248; rijne.traKrnembrane4: 6.85 

f^055iAA312329;Hs.42331;ephrfn^4; 6phrln;TM=M;SS=M; 6.61 

Si ^u^^S 901 HomosapIan6cDNAfU13357fle l dona PL; nonage; 6j60 

' ' N ^-020975*:HcmD sapiens rei proto-ojiooge; cadhei1w>WnasB:TM=Y-SSsftA G 58 
418562; R60669; Hs.124831; CQ-67 protein; nor^^pl.AA^CT 

7S ^ : ^^^ Mte2:TNF ^ F1A ^ 550ci8ted vtodaaft domain! daaliTilHit: 6.45 
AW391B27; Hs.7946; KlAA^pioWn; Alra;TM=ftfc E44 

^^SSJ2«i : » S, l! 0OO9; «^-C0«ttyra« Aoxidasa 1. pantoyl; pkinaseLTIrrul; 6.43 

BE27453D ' Hs.273333; hypofhetfcai protein FU109B^ FGGY CTTM^ - &43 
432945;AU)43683; Hs^173; hy Aal pr£fn FuW S5«Wm 

Hs - 54828: ESTs: P*»^<2H2WW3jion»; 6.33 
S^tJJSSlS," 8 * 1258151 ESTb - Wea ** B ^ to 138022 hypothec llpoxygenasa.PLAT^none; 6 33 
4Z78Q1, AW979155; HS.23B275; amTnoacM lransportBr2; Aajraas;TM=M;SS=M; R32 
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407970; AW403814; Hs.41714; BCL2^assoclated athanogene; ib)qultin£AGJropomyosrn;; 6.30 
438464; AA669735; Hs.324743; protein phosphatase 4 regulatory subunlt nonejone; 6.30 
453977; AA866Q06; Hs.250427; ESTs:pkinase > P2XjBcepfar,El-E2JSiTP3SO.Hydfotese: 6.30 
412491; W31589; Hs.73S57; RAB5A. member RAS oncogene family; ias 1 arf,PP2C;TM=Nt; 6.30 

413235; BE243445; Hs.7524B; topaaomerasa {DNAJ II beta (IBOkD); DNA^rasaB.DNAJopcJsoW.HATPase c.DMA.syraaeB.DNA toDoisoIV.HATPase c 6 29 
404342^ ;;C70O219r:g1[7299207}gblAAF544O4.1l{Ae0; none JM=^. 6^7 " 
409274; N*U]03930; Hs,52644: SKAP55 bomologue; SH3,PH;SS*M; 6.25 

41SS93; AA133749; Hs.301350; FXYD atomalrvcontairang ton transport reg; ATP1 G1_PLM_MA7ftTM=Y;SS=M* 6L23 
405429; ; ; Target Exon; Y_phosphaiBse,none; 6l23 
I U 404975; AL042279; ; imcharacterized hypoihalarruis protein HTO; fcnngte;TM=Y;SS=M; 6.20 
452929; AW354938; Hs. 172816; neuregufin 1; Nsuregulin.EGFjg.rtoreguiln.EGFto &18 

446BB3; AW452756; Hs.16364; Jiypofhefcrf protein FU10955; DEAD jielicase J^nm.lWr, Cysjknotjl Uvwa, vwc.vwd ! la i RHa,abhydrolase TGF- 

beta,DUF139.TPRDSPc,1spJ^rj^^ 
433233; AB040927; Hs.301604; KIAA1494 protein; SH3,zf-C3HC4;TM=M;; 6.15 
I D 458731; BE615453; Ha.346609; dedJeatorof cylo-ttnesls. 1; nonaJlUMY:; 6.15 

421836; AF109219; HS.10B787; phosphaBdylinosHol glycan, class N; nona.none; 6.14 

447727; AI421079; ; lumor necrosis factor receptor superfaml; none k synsptobrav1n; 6.13 

401S36; ; ; NM_00253C:HomQ sapiens neurotrophic tyr; Ig.pftina^LRaiH^^CT^M-^M; 6.11 

OCi 1^?!I^ 14056fl: Hs - 143436 : ESTs- similar to PLHU pTasmln [H.: PAN.krfng^trypsln^l^LC-X^SHZ.W.SH^PLC-Y.PAN- 6.10 
427557; N^002659;Hs.l79657;pl3BnTlrfflp^adhrator,u 
450447; AF212223; Hs.25010; hypotheUcal protetn P15-2; NTF2:TM=M;; 6.0B 
450107; AI873287; Hs.257812: ESTs; 1C^20,DED;TM=M;; 6.05 

41 8175; AW967054; Hs.206312; ESTs, Weekly similar Id I3B022 hypcfteff; ^Cffl^BTB.K.^Synlaxin.rnriB; 6,05 
_ _ 408983; NM_0Q0492; Hs.663; cystic fibrosis transmembrane conduetanc; ABCJrari 1 AfiC_rnembrana 1 PRK t Bac export 3TM=Y" 6 05 
Z:> 43973B;BE246502;Hs.959B;serjra domain,™™^ 

427625; AF00B216; Hs.285013; putative human HLA class II associated p; nona.none; 6.03 

409430; R21945; Ha.346735: spftcing facta; wglnhefeerir^rich 5: DSPt^RhodarHJsa.none; 6,03 

446650; AB016825; Hs.15B13; solute carrier famiry 22 (organic cation; 5tigar_tr;TM^;SS=M; 5.03 

405102; ; ; Cl500122(r«l]4469558Igb|AAD21311.1|(AF; DAGJ>E^fid,PH,RhoGEF,DC1;SS=M; 6.03 

400121;;;EosContro1;SH3,PH t Rhc<^ 

415327; H22769; ;gb:yrn54c02,r1 Scares infant brain 1N1B H; SK3,PDZ.Guanylate km;SS=M; 6.03 

404143; ; ; NM_002fl44*:Homo sapiens V-ros avian UR2 ; fh3 1 pWnase.DUFl39;TM=Y;SS=M; 6.03 

405531 ; ; ; Target Exon; PDZ,CARD t Guanylate Jdn;TM=M;; 6.O0 

433363; AA584629; Hs.275163; (wn-rnetestatic cells 2, protein (NM23B) ; NDK,none; 6.00 
3 D 427270; H47921; Hs.174139; chloride channel 3; voJtage.CLC.CBS.nane; 5.99 

423774; 153064; Hs.1702; Interleukin 9 receptor; none;TM=KtSS=M; 5.98 

4241 24; AA335609; Hs.7589; ESTs, Weakly similar to A46010 X-lr&ed ; pktnase.TBC; 5.96 

411040; AF007393; Hs.177574; proteln-kinase, intsrferon^rKfocjbla doir; HLH;TM=M;; 5.95 

423422; AO0Q5175; Hs. 126425; NY-REN-24 antigen; SuHbtransfer.7lm_1.none; 5.88 
40 453902; BE502341; Ha.3402; ESTs; none,none; 6.88 

40963S; AA305729; Hs.18272; amino acid transporter system A1; AaJrans;TM-Y;; 5.B6 

436154; AA764950;Hs.119B9B; ESTs; efhand.!W<U 3 E-l^ > DAGI^ 1 P}tD l DAGKc t PSI J jior>e; 585 

455359; AWS02641; ; gb^V3-NM1 024.100500-1 81-dDB NN 1024 Homo; Su^a^.Scmatorr^^Phosphodtestno^ 5.83 
- 41410S; AI267592; HbJ5761; SFRS protein kfnase 1; anKPaOxystero! BP lP Wna3a:TM=M;; 5.82 

45 452547; AA335295;H5.74120;ad P osespec^^ " 

415204; T27434; ; gb:hbc2294 Human pancreatic feist Homo s; Na_Ce^Ex.Calx-beta 1 nonfl; 6.80 

435S63; AF210317; Hs.95497; solute carrier family 2 (facilitated gtir; 6UgarJr;TM=Y^ 5.78 

456097; C15702; Hs.288028; ESTs, Moderately similar to J54374 gene ; dsrm.FKBP; 5.7B 
cn 422445; M23114; Hs.1526; ATPase. Ca transporting, canfec musclev; E1-E2!ATPase.(^tk>iL J A7Pase aCafionATPase N HvdrolasflTM=V~ 5 77 
50 436246; AW450953; Hs.119991; ESTs; none^DMA.iyre^.DMA.torrf^ 577 

422953; AA488B60; Hs.245043; hypothetical protein FU 14297; ABC tran,PRK;TM-Y^S=M; 5.75 * 

425654; AA74919D; ; ESTs; RhoGAP l SH2,pkinase 1 P0LO bojcnone; 5J4 

424160; T7406^ ; gbvc81f0l.r1 Soares Infant braki 1MB H; ROKnone; 6.70 

4173B9; BE260964; Hs.62045; m |dWne (neurlfe growth-pro fnt*"ng factor PTN MK:TM-M;SS=Y; 5.69 
55 437613; R1S892;H8.10267;MIL1 P rotBrn;none # none; 5.68 

410B2O; BE391493; Hs.16476; Human DMA sequence from clone RP5-852M4;TBC;SS=M; 5.6B 

423393; R37772; Hs.21420; p21-acfivated protein fcnase 6; pWnBsePBD;TM=fyi:; 566 

426500; NMJM638; Hs. 170156; KtAA0450fleneproduc1; C2,PI-PLC-Y;TMNMs 5.63 
_ A 406930; U04891; ; gb.-Human otfiactory receptor (OR17-219) g; non9;TM-Y;SS=M; 5.60 
OU 401(m;;;T«pMB(orKnonB 1 lCE_p20JCE_piaCWD,Pep 

42B479; Y00272; Hs-334562; cdl division cycle % 61 to S and G2 to; ptineso JCEj10 1 ICEd20;TM=M;SS=M; 5l55 

«t970; AF227156; Hs. 110 103; RNA potymerase I transcription fiactorRR; ' ' 

^ c 4262485 T1896B; Ks.29366B; ESTs; pHnase^ione; 6.50 

05 416426; NM.0038O4; Hs.296327; receptor (TNFI^4rterac6hg serine-thr; ptdnase.death;TM=M;; E.43 

41708$ AA194446; ; ESTs. WeaMy similar to S56024 nebulin. ; anMe3m.2U5.EG? Jmngfctre^ 543 

447437; UD7225; Hs.339; purfnergfc receptor P2Y. Gfrotetn coupl; 7fm_1,SH2;TW=Y;SS=M; 5.40 

412247J AF022375; Ha.73793; vascular endothelial growth factor; PDGF;SS=M; SAO 
-7H 2i!^^JE 1 « 8: ^ S n B ll 5: Homoaa ^ns, clone MGC16169, mRNA, com; P Wii a5 e,TBC 1 Rhod a nese;TM=M;; 5.36 
fl) 419355;M42B520;Hs.90aBl:progestetOTeWnoTngpro^ 

411180; BE161168; ; gfcPMO-KT 0425-170100*02^10 HT0425 Homo; adanytatoktnase.none; 5V35 

422461 ; NM_003417; Hs.117077; iinc finger prolefn 26<- 2f-C2H2 1 KRAB,TI : «S;TM=M; : 5^8 

426348; BE466586; Hs.17433; hypothetical protein FU20967; nona.none: 5.25 

7 ^ SnI75 1?!^? 01 ^" 5 - 23531 dual s P edfcB y P^«P^atese 4; Rhodanesa, DSP(^Yjptio fl phBtose.Rfbosom B LS3JN:TM=W;; 5.23 
/5 429592; AB029041; Hs.209646; KIAA1118 pratd^Troponb.Exo.erido^M.tQTwJM;; 5.22 

^l : ^™JLU^~^ ; J Hum8 ! 1 p4,teav8ribosomal P"**>S1 rnRNA; ErUuir«nLjecertRibos«naUl1,Wx«^ 
434368; AW519020; Hs.73853; dopamine recsptor D2; pWnase.SH3.none; 5.15 ' 
405586; ; ; NM„00O299:Horao sapiens ptakophlffn 1 {ec; Arm3dIlo_seg;TM=M;; 5.13 
40B176; AK00l553t rtU3436; adenylate kinase 3 alpha like; adenylateWnase.rwne; 5.12 
OU 407443; AF227138; ; oji:Homo sapiens candidate taste receptor; none;TM=Y;SS=M; 5.1 1 
416817; AA33BD45; Hs.104679; ESTa; Furm4ike,pkln3se.Reoep U.dtjman i fh3,none; 5.10 
401B86; ; ; NM_0217B3:Homo sapiens XEDAR (XEDARJ, TNFR_c6;TM=M:SS=lW; 5.08 

410314; AW860708; Hs.18851; hypothefcal protein FLM0875; mybjm^indn^PAHBAH.bromodomatn.PHa 5.0a 
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401579; AL031447; ; Homo sapiens, clone 1MAGE:4053044, mRNA,; Neur_charaBaNeur_ch3a.memb f tione; 5.05 

445873; AA25G97Q; Hs251946; poly(A)-binding protean, cytoplasmic 1-1; PABP,rmi,pltinBSB,1 4-3-3; 5.05 

417529; AA203634: ; gb:zx58b09.r1 Scares Jelaljver_splccn_: plclnase 1 UBA 1 KAl,none; 5.03 

417527; AA2Q3524; ; gb:zx56e10x1 Swires JetaUverjipleeiu SH3;SS=M; 4.90 

407722; BE252241; Hs.3B041; pyrkfoxat [pyridine, vftamm B6) kinas; pftB;TM=M;; 4.96 

431321; AW136372; Hs.1852; acid phosphates*, prostata; acid jtephaVwne: 4,93 

404298; ; ; C6G0123B*:gl|1 21715tsplP26697|GTA3_CHlCK; ncrie.GSTJXGSTJY.rMnase; 485 

4076O3; AW95570S; Hs.62604; Homo sapiens, done IMAGE:4299322, mRNA,; nona;TM=M^ 4.82 

424095; AF071202; Hs.1 39335; ATP-bmding cassette, sub-family C (CFTR; /^C_lran/BCjnembrane;TM=Y^ 4.73 

422356; TB3882; Hs.97927; ESTs; pkinase,nona; 4.64 

424905; NM_002497; Hs,163704; NIMA (never In mitosis gena aj-rstated k; pkffiase;TM=M;: 4.48 

425925; NMJ101196; H3.3156B9; Homo sapiens cDNA: f U22373 fis. clone H; BterasB,eralaae 1 PBpKd3se^S9;TM==M;; 4.45 

439606; W79123; Hs.58561; GprotaVHXxipted receptor87; 7tnU;TM=Y;SS=M; 440 

400749; ; ; Nr003105*:Horno sapiens sc^in-related; EGF,fn3JdlrecerA.aJdljBoerj^ 438 

425721; AC002115; Us-159309; yropTaWn 1A; transmembrane4;TM=Y;SS=M; 4.33 

444006; BE395085; Hs.100B6; type J transmembrane protein Fn14; WLrecept - a > PKD.MHCJ;TM=M;SS=Y; 4.31 

400751; ; ; NML003105*:Homo sapiens sorffin-relrtBd; EGF,1r)3#UecerAJJ*ILrecepl^ 4.1 8 

4089O8; BE296227; Hs.250822; serine/threonine kinase 15; pktoase;SS«M; 3.90 

4221 52; AA909249; Hs. 1 1 2262; solute carrier family 30 (zmc transport; nomyione; 3,86 

458760; AI498631; Hs.1 11334; ferritin, light polypeptide; rjystadn.ferritfn.hlstor^HCX^^ 3.85 
441218; BE327561; Hs.202345; ESTs; none,WE>40 1 E1-€2^TP8se > CarJonJVTPase C p CarJorLATPase_N,Hydrolase; 3.78 
419073; AW372170; Hs.1 8391 8; Homo sapiens cDNA FU12797fis,done NT; desth,ZU5;SS=W; 3.76 

451385; AA017656; ; gbze39h01 j1 Scares retina N2b4HR Homo ; AJtophlri-I.ejiotase^tro^ 3.60 

412604; AW976324: Hs.1904; protein Idfiase C iota; prdr^DAG_PE-Wnd p pWnase_C l OPR;TM=M;;3.30 

409582, R27430; Hs.271565; ESTs; TOne.Nei^charU^^rjiharunemb; 128 

441155; AW161D08; Hs.7719; GABA(A} receptor-associated protein; MAPI J_C3;&&=M; 3,23 

436740; AWB75133; ; gb:EST387239 MAGE resequences, MAGN Homo; nope.EPR Jbd.fn3.pWnase.SAW; 3.20 

418319; AW611703; Hs.190173; ESTs. Weakly Mark) A4601OX4inked ; rwnejRK: 3.20 

409744; AW675258; Hs.58265; Homo sapiens piRNA; cDNA DKFZp586P2321 ft none; NA;NA; 3.13 

418764; N30531 ; Hs.42215; protein phosphatase 1, regtialory auburn; nor^none; 3.10 

400846c ; I aortffin-reiBied receptor. L(DLR class) : EGF,fh3,WI jecepLa.lo1jececA>.flraimlii 1 BNi^Tl^Y:SS=M; 3.09 
422005; BE266556; Hs.1 10702; Ho™ sap»erwmRNA:c^^ 

426440; BE3B275&. Hs.169902; solute carrier family 2 (facilitated gb; sugarJnTiyhY^M; 3.02 

424187; AA335561; Hs.17257; ESTs, Weakly stmBar to S26689 hypomef; lRK.nonB; 298 

425852; AK001504; Hs.159651; death receptor 6. TNF superramfly memben death JNFR_pS;TM=Y:SS==M; 293 

401279; ;;C13000351^i[2494033isp|Q5439B|KDGDJrilES; nons.none; 2.68 

406671; AA129547; Hs.285754; met protc-oncogene thepatccyte growth fa; Seraa.pkrnase.TIG.PSUnone; 2.83 
447081; Y13896; Hs.17287; potassium mwarily-recthylng channel, s; IRK;TM=Y;; 2BO 
453619; HB7648; Hs.33922; Hornosapians. clone MGC:9084, mRNA, comp; pKnase;TM=ftk 2.75 
441699; AW51112S; H&.127572; ESTs; none.A&Jrans; 2.73 

45B781; AI444821; Hs.630B5; ESTs, WeaWy similar to MPP3JKJMAM MAGUK; SH3.PDZ.Guanylata Wn,L27;TM=M^ 2.73 

446913; AA430650; Hs.1 6529; transmembrane 4 superfamlly member {tetn trarsmembran84;TM=Y;SS=M: 2.70 

453487* R31770; Hs.56562; ESTs; 7tm_1 l nonec 26B 

421279; AW664B78; Hs. 106645; ESTs; pklftasejione; 268 

419720; AA249131 ; Hs.337778; hypothetlcat protein FU1 1068; oone,none; 265 

452345; AA293279; Hs.29173; hypothetical protein FU20515; DSFto;TM=M;; 263 

422247; U18244 Hs. 1 13602; solute carrier family 1 {high affinity a; SDF;TM=Y;; 262 

425212; AW962253; Hs-171618; ESTs; pMnase,n<me; 260 

427344; NMJ1008S9; Hs.2142; 5-rr/draxytjyptarnfr\e (serotonin) receptor; Meur_chBnJBD^Ieur chanjromb;TM=Y;SSs=M; 256 

423629; AW021 173; Hs.18612; Homo sapiens cDNA; FU21909 fls, done H; voltage CLC.CBSjane; 255 

456737; BE247203; Hs.1 24831; CGf-67 protein; abhydrdase_2;TM=M;SS=M; 253 

421585; U95626; Ks.302043; cliamoWne (C-C moQf) reoeptor-lke 2; 7lm_1;TM=Y;SS^t 253 

424Q2B; AF055084: H5.153692; Homo sapiens cDNA FU14354 fis, done Y7; none.none; 250 

448324; Al57t356; Hs.34174; ESTs, Moderately similar to ALUB_HUWAN A; lCE_p2aCARD,lCE_p10^one; 250 

402256; ; ; Target Exon; pklnase.UBA/ione; 243 

452256; AKM0933; Hs.26661; Homo sapiens cOMA FU10O71 fls, done HE; GOl.7tnu1,none; 240 

419169; AW851980; HbJ262346; ESTs, Weakly similar ta S72482 hypotheB; none,BpacWh,SH3,PriCH; 240 

420634; S42457J Hs.1323; cydic nucleofirJa gated channel alpha 1; cfW_b1i^g,tonJtans;TM=Ys 2^5 

419630; W57756; ; S hczd20g10.r1 Soares JeteLtea/LNbHHI ^ zf-C3HC4none; 2^5 

426227; U67058; Hs.154299; Human proteinase activated receptor-2 mR; 7tm_1;TM=Y^S=ftI; 235 

400704; ; ;Target Exon; nrjL^han,SBPJ>a^ANF^re t »|toT^Y;SS=^ 233 

400149; ; ; Eos Control; adoLphospha1;TM=Y;SB=M; 230 

459327; AW149706; Hs.7850; glxxW1d02jt1 Na^OGAP_Bm50 Homo sapien; PHD,PWWP,SET r pkmase^; 230 
45222D; BE156006; Hs.212296; ESTs;inlegrnAFG.GAP.none; 2^5 
416690; H84076; Hs. 108551; ESTs; pWnase.nona; 223 
408354; AI3B2803; rls.159235; ESTb; none.none; 223 

452203; X57522; ; transporter 1, ATP-bindmg cassette, sub; ABCJra*,ABC_meiriKara,SRP54T^ 221 
405093; ;; Ct2001 101*^l|7522643lpir||T32733 AWPAg; none,none; 220 
412723; AA648459; Hs.335951; hypothetical protein AF3Q1222" none;TM=M;; 220 
416738; AW3&B633; H&.6662; solute carrier famly 7, (cafionb amino; nonejione; 218 

417185; NMJ0O2484; Hs.814G9; nucleotide biirjlng protein 1 (Ecoll Mn; ParA,rer4JJirH.AraAJVTPasa;TM=4fc 218 
433222; AW514472; Hs.238415; dickkopf (Xenopus taevig) homotog 4; none,PH04; 218 
413627; BE182082 Hs.24B973t ESTs; AnTBdiao_seg.1BB;TM=fcfc 2.18 
407415; AF073328; ; gbcHomo sapiens tetracaine transporter-; none, none; 215 
450592; A1701 555; Hs.202562; ESTs; pkinas^none: 215 

428767iAl42l972;Hs.98802; ESTs, Moderately similar to T14342 NSD1 ; none.pklnase.lg; 215 

429012; AW629596; Ha. 194726; BO^assocfated amanogana 4; Sm,BAG;SS=M; 215 

419122; AI401360; Hs^44410; ESTs; ABCJrar^BCjnembrane,none; 2.10 

446420; AW015693; Hs.135614; ESTs; tonJrans.nons^OS 

420076; AA827B60; Hs.293717; ESTs; DUFS&pMnase; 205 

409415; AW38B359; Hs.10667; ESTs; haismembrane4;TM=Y;SS=M; 203 

426766; AA477989; Hs38B0a ESTs; TPR.7tm_l; 2.03 

427001; NM_006482; Ha.173135;duaVspedricJty tyrosine-(Y>pho5phoryi; pWnase;TM=M;; 203 
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45370$ AL1 19133; ; protein kinase C subairate 80K-R; nonejilstone; 2.03 
423341; AW242394; Hs.108660; ESTs; none,none; £00 

456772; AW604297; Hs. 1 2971 1 ; hepafltfe A vJrus cellular receptor 1 ; fc;TM=Y;SS=M; 1 .88 
42778ft BFJ4Q7863; Hs.256B71; ESTs: none.FG-GAPJtmJ: 1.65 
423508; AW6Q4297; Hs.12S71 1; hepafflis A vius cellular receptor 1; lg;TM=Y;SS*M; 1.00 
447993; AW139525;Hs.170362; ESTs; none.none; 1.00 

TABLE 568 

Pkey: Unique Eos pnobeset Identifier number 
CAT nunben Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

414883 637 V2 AF274943 BG4M834 A171 3075 AA908783 AI9351 50 AI422691 AA910644 AA5831 87 BM2721 67 A1B2B99B AA527373 AW972459 AI831360 AA772418 
A1033892 AA10Q926 AU154749 A1459432 A1423513 AI094597 AA740817 AI991988 AI090262 AI312104 BI256707 AA459522 AA416871 A)Q75239 
A1339996 AA701623 AU39549 AJ33G8BQ AA633B48 A1989380 AJ362835 AA399239 AI146965 BF514270 N92892AI348243 AI278Q87 AA459292 
AI494230 BF507531 AJ492600 AA962596 AW613002 AA29314Q AA235549 BF108854 AA954344 N496B2 AI4571 00 AW589407 AW3Q0758 BE22071S 
BE22Q698 BE569Q91 BM009647 BF9003S1 A1537692 AI203723 AI85757S AA584410 AW371667 BM172363 

438091 22448J AK054860 AV652198 AV652192AV6S213BAV652127AV652194 BE935913AV6S2017 AV651995 AVB51548AV546053 AV6519B5AVB461B4 
AV646179 AW8BO409 AA345002 BF155183 BED68931 X56197 AIJ5Q3014 AW953629 BM263546 BE550772 M7O1084 A16B1352 AA35B689 
AW938841 BF438147 W0S391 H75313 BF326185 AV646335 AV6515B9 AV646340 AV651992 AV646384 AV646364 AV6B7497 BF15S183 AV646370 
AVW97876 AJ906821 X56196 BE833B35 AA626440 BE633808 BF224205 AA709126 BE673807 AI923886 AA947932 AI2761 25 AH 85720 AW51 0698 
AA987230 BE46770B AWB98623 AW898544 AM469S4 AW043642 A128B245 AI1 B5932 A1635262 AI139455 AJ29B739 A1813854 AI02476B BE699445 
BE699444 AI707807 D52654 AI2145W AI004723 AI6980B5 AW087420 AI565133 AA845571 AW898622 BF110144 AW513280AI081 126 BF3B2770 
AI268939 AM35B18 BF475318 AI024767BE174213 AA757598 AA613019 AA902959 A1BS0794 AJ3347B4BF108411 BM31 0532 AVU51 3771 AI951391 
AI337671 Bf 095606 BF095B01 BF095488 AW890091 BR395753 AW243400 AWB98607 AW898616 BF362762 AI922204 AW898625 BE699468 
BE17419B AW102923 D52715 BE699456 D52477 D55017 BF955933 BG623563 AV646254 AA463522 B1003244 AJ29919Q W40186 BE174210 
BF939Q91 BF434180 AW579001 T55662 H01B11 T52S22 BF945037 BF955938 D54G79 D53933 R67100 BG925552 BF999056 RB3430 Z29922 T85791 
W03942 H63289 AID91537 BF0B6583 AA345570 H48B70HB07207B3523 BI039626 BI037700 R00353BF1 55184 N98343 N79072 H01B12T55581 

433470 6624.1 X75684 AL573167 A1445461 AJ4S3743 AI963655 A15B4644 AA9771 80 AI6941 11 AI591355 AW071625 AI67871 2 AI72Q939 A1927769 BE43979B 
AIS63432 AA292B55 AW192593 AI86583B A1&96905 AI424384 AK6131 2 A191 1921 AI597801 BI494959 A12409B8 A1492554 AW262737 BE044033 
AW00B570 AW629S05 BM94958 AA0B8439 AA706057 BF222820 BF593608 BE501957AA524526 BE044134 AW572531 AW0 15724 BE3481B6 
, AA043217 BE219784 A1799B14 AA129575 AI671727 AM70033 BE6461 95 AW779725 AA903050 AA14722B AA40457O AI075878 W38161 AI972739 
AWB73152 AA723200 C08123 BF057I47 AA6276B6 AA157944 AI99Q245 AA662517 T32487 AI800106 AI333170 AI859160 W45410 AI990827 
AW27504S M1B2640 AA478328 A1293935 AW08S15S AW471421 AW103470 AW300456 AW191997AI823466M962397M13665BAJ251817 
AW339104 AA724739 AA41 1 1 00 AA191349 AA757735 AA03769B AI769516 AW772283 AA0 10631 AI6B2646 AI0610B5 H80983 R79933 AI950B93 
AI24S632 AI349390 AA1 48284 A1798502 AA487893 AI621320 AW194272 C06355 AA953883 BE858935 AI91B523 AI87262B AJ927217 A1453453 
A118B366 AW338678 AI2813S9 AI500576 BF477735 AI032569 A1972899 AI985583 Z28771 AI363829 AI593030 AAB03586 BE7734B8 AW339301 
BE773489 BE773462 BE773495 AI650338 BE773499 A1745717 BE81 1475 BE811470 BE81 1464 BE811418 BE811415 BE81 1400 BE81139B 
BE8113B8BE811352BE773501 BE773494 BE773486 BE773474 BE773473 BE773470BE773461 BE81 1 350 BE81 1337 BF593847 BGQ55071 
AW675302 BF0O3O68 AA719173 BEB11348 AI5B2462 AI666240 BE773500 AI244845 AI565439 AI918453 AI472527 AI446740 AA035576 AA1 9141 4 
AW674145 C05782 AI5B9264 D57558 AI48B237 A1432033 AA989662 R21752 BF002457 AA988297 AL574095 AL57B200 AL571074 AL574525 
AI578B10 BG4SB3B1 A1928364 BE879732 AA479834AA479712C17732 BM091258 BF843901 AW8Z0230 C17478 BE327120 AA1 29574 AA13B645 
BF843900 AW80B193 AA5D2832 AA649494 AL568520 AL547S60 BE706937 BE811360 BE773498 BE8H40t BE773484 8E811437 BE81 1380 
BE811399 BF997171 BF757734 BE326037AI377595C0B111 AW08B968 BEB11404BEB1 1472AJ869912A1925607AI871950AI093510BE905927 
BEBU435 AA191387 AW772000 BEB11453 BE614379BFB44522 BI0448S6 A1744233 AW9B4527 C17504BFB43883AI248307 BE7734S3 A15679Q5 
W60075BF941183 AI73BB44 BE811458 BE773481 AI262930 AA948565BE706942SE1 55380 T65026AW24295B AW1 97954 BE905184AA722208 
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S?.^^ 6, ^ 7 ^ in co!urTin ^ Wendfier (Gl) numbers, 'Dunham L et al." refers to the publication entitled The DMA 

sequence of human chromosome 22.- Dunham 1. ei d., Nature 09991 40248&495 w punicawn ennnao ineuNA 



Strand: Indicates DN A strand from which axons were predicted 
NLposltlon: indicates nucleotide positions of predicted exons. 
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GwiBOilp may such that the raho of "average" prostate cancer to "average" normal adult tissues was nreater than or eoua] to fl ? Th^^^^» nr^^^!?^ . , 

TABLE 59A: ABOUT 1 170 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

p ter- Unique Eos probes at identifier number ~ ' 

ExAocn: Bcempfer Accession number, Genbank accession number 
UnrgeneJD: Unlgene number 
Unpens TrtlerUnigene gene Due 
Jjffc Ratio of prostate tumor to normal adult body fesue 

Pkey ExAccn UnigeneD Lingerie TlBe ~" "~ 

L22524 Hs.2255 matrix metalloprolGinase 7 (matrilysm, 

AA535210 Hs.171995 telUkre&i3, (prostaie apBctflc anbgen 
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S39329 Hs.181350 kaldkrein 2, prostatic 

Nr\UKJ6159 rte.79389 nel {cnlckenHka 2 
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AF109298 Hs.1t82Sa prostate cancer associated protein 1 

AA80469S Hs.82547 reflncfc add receptor responder (tezaro 

AF179274 Hs.22791 transmembrane protein with EGfMflce and 
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431958 X63629 Ks.2B77 cadherin 3, ftps 1, P-cadherin {placenta 41.9 

10 409262 AKOQ0531 Ks.52256 hypothetical protein FU2D624 41.1 

423582 BE000B31 Ks.23B37 Homo sapfens cDNA FU1 1 912 f s, clone HE 40.5 
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439176 AI445444 Hs.190394 ESTs, Weakly similar to B26096 Bne-1 pr 39.3 

, 416585 X541B2 Hs.793BS tetomodin 1 (smooth muscle) 39.0 

15 447725 AL137S38 Hs.1936B malrlfoi 2 38.6 
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428819 AL135B23 Hg.193914 WAA0575 gene product 34.3 
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432101 A1918950 Hs,123642 EphA3 32.7 

427398 AW39Q02O Ks.2D415 chromosome 21 open reading frame 11 32.4 

3d 426716 NMJM&379 Hs. 171921 sema domain, imrounogJobuGn domain (tg), 32.4 
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OU 446100 AW967109 Hs.13804 hypothetical prote&i <U46202&2 27,4 
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434078 AW880709 Hs.2S3683 chromosome 8 open reading frame 4 27.4 

442670 N45018 Hs.87e9 hypothetical protein 0KFZp761Jl7l21 27,3 

403047 NM - 005656^.Homo sapfens transmembrane pr 27.2 

65 407709 AA456135 Hs.23023 ESTs 27.0 
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415SB9 AI267700 Ha317584 ESTs 26.8 
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437179 AA393506 Hs.300G42 serotaglcairy defined colon cancer antig 2S.4 
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444484 AK002126 Hs.11260 hypothetical protein FU1 1264 26.2 
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448106 AI800470 Hs.171941 ESTs 25^ 
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407910 AA650274 Hs/41298 tlbroneclin teudne rich transmembrane p 25.2 

42B862 Wr4_000346 Hs.2316 SRY {sex determining regJon Y^jox 9 (ca 24.9 

416961 AW967646 Hs.23023 ESTs 24.8 
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rVJ\AJf uuij 


Hs,12017 


427707 


NM 00557A 


u, 1 Am aft 


453308 






442607 


AARD7S7fi 


14c 7fln^Ki 


433927 




ITS. 1 ItK-D/ 


443162 


T49951 




412490 






424580 


AA446539 


14c ^^Qfl9it 


443912 


R372S7 


no. lOTiov 


410297 


AA1dK71ft 

rl/i I to/ (V 


I4v 7CKHJ 


431448 






432306 


T lOAJr 




451652 


nnUlwTO 




406038 


Y14443 


He RR710 


408784 




i4c nwta 


453510 




rKMhcKn 


407894 


AJ278313 


Hs.41143 


43390B 




Mr 1R707*; 
n&. 19/9 to 


440529 


AW2 07640 


rij. 1 0410 


413249 


AF1671GD 

ill lUf IUU 




408430 


§79876 




430665 


BE3S019? 




447752 


ml Ot UU 


14c iriraia 


426044 


AA502490 




403362 






427982 






407634 






448362 


AAR417R7 




433226 




Uc QAtA 


427472 


AA59M3Q 


ns. ■ J i zau 


453661 


oJU^OOOd 


MS.JUlZu 


456082 






409264 


NNL014937 


He S2463 


43272S 


AKD00292 


Hs.278732 


422270 


AF114494 


Hs.114062 


410339 


AJ916499 


Hs.29B25S 


431992 


NMJ»2742 


Hs.2891 


424432 


AB037821 


H&.146B56 


431933 


A1187057 


HU132554 


440749 


W22335 


Hs.7392 


418434 


AW163045 


Hs.79334 


450382 


AA397658 


Hs.60257 


407021 


U62077 




413786 


AW613780 


Hs.13500 


437866 


AA156781 


Hs.74170 



h^hedcat protein MGC2747 

RAB1B, fnembsr HAS oncogene fam&y 

NM_004930*;Homo sapssnB capping protein 

Homo sapiens done OT1P1B6 raRNA. CAG rep 

ESTs 

ESTs 

ESTs 

peroriredoxin 3 

Homo sapiens HUT11 protein mRNA, parJfd 
ESTs 

glycoprotein M68 

sterlte-atpha motif and leucine zipper c 
hypothetical protein FU21986 
EST? 
ESTs 

six transmembrane epfthelfaj antigen of 
ESTs, WeaWy simlsr to AUI1_HUMAN ALU S 
ESTs 

protein tyrosine phosphatase, receptor t 

ESTs 

ESTs 

Homo sapTens prostein mRN A, complete ccs 

ESTs 

ESTs 

ESTs 

KIAA0217 protein 

ESTs, Weakly slmlar to ALU4_HUMAN ALU S 

hypothetical protein F LJ 20729 

IhrDmbospondtri 1 

promiran (mouse)-! 1 ^ 1 

clauin 12 

ESTs 

hypotfietJcat proteli FU13732 
ESTs 

criromosome 1 open reading frame 21 
chromosome 1 open reading frame 21 
pleiCTTophtc adenoma gene4ike 1 
ESTs 

homobg of yeast ubtqulUn-protein tgas 
UM domaTrKontairfng preferred ^anstoc 
ESTs 

Homo sapiens cDNA: FU22696 Ms, cbna H 

small nuctear protein PRAC 

DKFZP434G032 protein 

Homo sapiens cDNA: FU2252B fis, clone H 

ESTs, Weakly simffar to A46010 XJinted 

ESTs 



hypotheM protein DKFZp5640l27a 
protein phosphatase 1, regulatory pnhb 
ESTs 

zkic finger protein 200 

ESTs 

ESTs 

phosphoinositide-spsdflc phosphatase 
ESTs 

Homo sapiens cDNA: FU2171B fis, clone C 
OEAD/H {Asp-Glu-Ala^Hls) box binding 
dipepQdyl peptidase W {CD2G, adenosine 
ESTs, Weakly slmtarto 178885 serineVth 



ESTs 

NM„00t615*:Homo sapiens adin. gamma 2, 
KIAA1073 protein 

UDP-GlcNAcbetaGal be^1,Wacety>gluc 
hypotheb'cal protein DKF2p564L0«4 slm? 
KIAA148B protein 

transposon-derived Buster3 transposes e-i 
ESTs. WeaWy similar to NUCmUMAM NUCLE 
ESTs, Weakly similar to AU51 JiUMAN ALU 6 
KTAAD965 protein 
hypotheses! protein FU20285 
proteki tyiosine phospfiatase-lika (proli 
ESTs 

proteh Wnase C, mu 
protocadhsrfn 10 
ESTs 

hypothetical protein MGC3199 
nuctear factor, lnterteukin 3 regulated 
Homo sapiens cDNA FU1359B ckwe PL 
gbiHuman marinerl transposase gene, comp 
ESTs 

metalto^lonein 1E {functional) 



15.9 
15.9 

15. B 

16. B 
15.7 
15.7 
15.7 
15.7 
15.7 
15.7 
15.7 
15.6 
15.6 
15.6 
15.6 
15.6 
15,6 
15^ 
15^ 
15,5 
15.5 
15.4 
15.4 
15w4 
15^ 
15^ 
15^ 
153 
153 
15-3 
15_2 
1b\2 
15J2 
15.2 
\5J2 
15.1 
15,1 
15.1 
15.1 
15.1 
15.1 
15.1 
15.0 
15.0 
15.0 
15.0 
15.0 
15,0 
14.9 
14.9 
14.9 
14.B 
14.B 
14.B 
14.B 
14.8 
14,B 
14.B 
14.7 
14.7 
14.7 
14.7 
14.7 
14.7 
14.7 
14.7 
14.6 
14.6 
14.6 
14.6 
14.6 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
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454078 


AA60151B 


Hs.22209 


411990 


AW963624 


Ks.31707 


443280 


AA299688 


Ks.24183 


416836 


D54745 


Hs.80247 


436860 


H12751 


Ha.5327 


425174 


D87450 


Hs.1 54978 


442242 


AV647908 


Hs.90424 


443780 


NM_003500 


Hs.9795 


418994 


AA29S520 


HS.8954S 


419563 


AA526235 


Hs.193162 


43771S 


AI927288 


Hs.196779 


41282a 


AL133395 


Hs.7462t 


432712 


A9016247 


Ks.288031 


433332 


A1367347 


Ks.44898 


451533 


NM 004657 


Hs.26530 


424701 


NM_005923 


Hs.15198B 


446494 


AA463276 


HS.28B905 


456506 


AA502764 


Hs.123469 


432251 


AW972S83 


Hs.232165 


426423 


NM_012446 


HS.169B33 


442281 


N34742 


Hs.170065 


442145 


A1022650 


Hs,B117 


41flB4d 


AI820961 


Hs.193465 


447072 


061594 


Hs.17279 


439518 


W76326 




443967 


AW294013 


Hs.200942 


424775 


AB014540 


H8.15302B 


4111S0 


AA306342 


Ha£9t71 


447384 


AI377221 


Hs.40528 


444BBO 


AW118683 


Hs-154150 


433409 


AI276B02 


Hs.25661 


423201 


MM_000163 


Hs.125180 


413724 


AA131466 


Hs.23767 


454219 


X75D42 


Hs.44313 


429165 


AW0098B6 


Hs.11825B 


430598 


AKD01764 


Hs.247112 


444172 


BE147740 


HS.10455B 


416774 


AI005169 


HaJZ8274 


410310 


J02931 


Hs.62192 


429738 


NMJXJQ390 


Hs.192803 


400296 


AA3D5627 


Hs.139336 


407939 


W05609 


Hs.312679 


452261 


T93500 


Ks.28792 


40B622 


AA056060 


Hs.202577 


428054 


AI94868B 


HsJ266619 


444636 


T96667 


Hs.17677 


416292 


M179233 


Hs.42390 


442315 


M173992 


Hs.7956 


452260 


AA45320B 


Hs.28726 


428966 


AF059214 


HS.1946B7 


450316 


W84446 


Hs.226434 


419839 


U24577 


H3.933Q4 


443634 


H73972 


Hs.134460 


449474 


AA019344 


Hs.2055 


422278 


AF072873 


Hs.114218 


445817 


NM_003642 


Hs.13340 


434629 


AA769061 


Hb,4Q29 


416795 


A1497778 


Hs.20509 


410001 


AB041035 


Hs.57771 


452242 


R50956 


Rs.159993 


413231 


DB7461 


Hs.75244 


404641 






428648 


AF05272B 


Hs-1 86021 


414279 


AVW21691 


Hs.101067 


411573 


AB029000 


Hs.70823 


417832 


R20855 


Hs.5422 


431467 


N71831 


Hs.256398 


450313 


AKH89B9 


Hs.332633 


445472 


AB0Q6631 


Hs.12784 


419264 


AA877104 


Hs.293872 


418663 


AK001100 


Hs.41fi90 


452277 


AL049013 


Hs.28783 


447896 


A1436124 


Hs.294069 


447082 


T85314 


Hs.42644 


451468 


AW503398 


Hs293663 


415443 


T07353 


Hs.7948 


427212 


AW293849 


Hs .58279 


442113 


BH622187 


H8.9967Q 


433S17 


AW022133 


Hs.189838 


430829 


AW451999 


Hs.194024 


453111 


AB014598 


Hs-31720 


435839 


AF249744 


Hs.25951 


429113 


D28235 


Hs.196384 



secreted modular caJcfum-bmding protein 
ESTs, Wsrikiy similar to YEW4 JTEAST HYPOT 
ESTs 

cholecysiokfcnln 
PROl 91 4 protein 
KIAA0261 protein 

Homo sapiens cDNA: HJ232B5 fis, cfora H 
acyl-Coenzyme A oxidase 2, branched chai 
satec&i E (endothelial adhesion motecul 
Homo sapiens cDNA RJ119B3 fis, clone HE 
ESTs 

prion protein (p27-30) (CreutzfelckJakob 
sterot-CS-desaturase (fungal ERG3, delta 
Homo sapiens ckine TCCCTA001 51 mRMAseqU 
serum deprivation response (phosphatidyl 
mitogen- activated protein Mnase Wnase 
WWDomain-ConJainlng Gene 
ESTs, WteaWy sirrtlarb AF208855 1 BM-01 
polycythemia rubra vera 1; cell surface 
single-strmded-DNArblndtag protein 
Homo sapiens cDNA FU13492 lis. clone PL 
erbb2-interacftig protein ERBIN 
ESTs 

tyrosylpratalR sutfotransfarsse 1 
Qb^)60d04j1 Soares_feta]_heart _NbHH19W 
ESTs 

SWAP-70 protein 

protein kinase C-Ika 2 

ESTs 

ESTs 

ESTs 

growth hormone receptor 

hypothetical protein FU12666 

v-rel avian reMoendolheEosts viral 

prostate cancer associated proteJn 1 

hypothetical prolan RJ10902 

ESTs, Moderately similar to 138022 hy pot 

Homo sapiens cONA: FU22049 fis, done H 

coagulation factor III (thromboplastin, 

xeroderma pigmentosum, cmnplementatkxi g 

ATP-Wndlng cassette, sub-family C (CFTR 

ESTs. Weakly slrnfer to A49019 dynein he 

Homo sapiens cDNA FU11041 fis, clone PL 

Homo sapiens cDNA FU12166 fis, clone MA 

ESTs 

ESTs 

nasopharyngeal carcinoma susceptibility 
ESTs, Moderately similar to ZN91 JHUMAN Z 
RAB9. member PAS oncogene famffly 
cholesterol 25-hydro«ylaae 
hypolhefical protein MGC4643 
phospholpase A2, group VI! (platetet-ac 
ESTs 

ubfciuib'ivaclivating enzyme El (A1S9T an 
frizzled (Drosophila) homolog 6 
htsfone sooty [transferase 1 
gHoma-ampltfied sequence-41 
HBV pX associated protein -8 
kaJIftrem 1 1 
gycosyttransferase 
BCL2-5ka2 

MNL0219S5*:Homo sapiens phosphogXicomuta 
potassium voltage-gated channel, sabfamr 
QCM5 general control of amino- acH syrrt 
KIAA1077 protein 
glycoprotein MOB 

Homo sapiens mRNA; cDNADKFZp434E052B (f 
Bardet-Bfed) syndrome 2 
Hoitd sapiens mRNA for KIAA0293 gene, par 
ESTs, VfeaHy aimlar fa ALUBJHUMAN 111! 



] PL 



KIAA1223 protein 

Homo sapiens cDNA FU133B4 fis, • 
rnkjcedoxln-tlke 

ESTs, Moderately similar to I38022 hypot 
ESTs 

ESTs, VveaWy similar to ALU7^.HUMAN ALU S 
ESTs. Weakly slmilarto I38022 hypotheti 
ESTs 



Rho guanine nucleotide exchange factor ( 
ndoperowdo synthase 2 (p 



14.4 

14-4 

14.4 

14.4 

14.3 

14.3 

t4.3 

14.3 

14.2 

14.2 

14J? 

14.2 

14.2 

14.2 

14.2 

14.1 

14.1 

14.0 

14.0 

14.0 

14.0 

13.9 

13.9 

13,9 

13.9 

13.9 

13.9 

13.9 

13.9 

13.9 

13.9 

13.9 

13.9 

133 

13.8 

13.8 

13.8 

13.B 

13.6 

13.8 

13.8 

13.8 

13,7 

13.7 

13.7 

13.7 

13.7 

13.7 

13.7 

13.7 

13.7 

U7 

137 

13.7 

13,7 

13.7 

13.6 

13.6 

13.6 

13,6 

13,6 

13,6 

13.6 

13.6 

13.5 

13.5 

13.5 

13.5 

13.5 

13.5 

13.5 

13.5 

13.5 

13,5 

13.4 

13.4 

134 

13.3 

13.3 

13.3 

13.3 

13.3 

13.3 
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AfTTtQ- 


AW937826 


Hs. 103252 




AWfd^oUZ 


Hs.2132 




A KVt-i -iAC 
WUJ1140 


Hs.226627 


*tlO/tf 


a Ainneco 


Hs.1 11495 


«kfJUdf 


\\\t in aa cfl 
NW^JJ14158 


Hs. 279938 






H6.8505D 










Wj/a43 


Hs. 34992 


iWC707 
lODfa / 


tvKl dl4Hl 


H6.336454 




T0555S 


Hs.1568B0 


4o4UJo 


A1059131 


Hs.197733 


44ob^U 


ABDQ2367 


Hs.21355 


450755 


AAQ109B4 


Hs.1594B4 




NWLD01429 


Hs.25272 


40 8495 


W68796 


Hs.237731 




U20158 


HS.248B 




AI767756 


Hs,82302 


416937 


D86957 


Hs.807t2 


453006 


AJ362575 


Hs. 16 71 33 


421750 


AKD00768 


Hs.107872 


415752 


BE314524 


Hs.78776 


411529 


AA430348 


Hs.317596 


41731ft 


AW953937 


Hs.12891 


410763 


AF279145 


Hs.8966 


4<£oUi4 


AI276SB9 


H £.56 123 


447966 


AA340505 


Hs/105887 


428218 


AA424266 


Hs.123642 


409799 


D11928 


Hs.76845 


448779 


BE042877 


Hs.177135 


446944 


ABG14605 


Rs.22599 


417315 


AKJ90042 


Hs.336901 


429697 


AW296451 


Hs.24605 


432674 


AA641092 


Hs.257339 


445309 


AL1 57474 


Hs.12504 


451099 


R52795 


Hs.25954 


An* one 


AA382523 


Hte.105689 


442343 


AA992480 


Ha. 129874 


432244 


AI669973 


Hs.200574 


459513 


Al 032946 




452512 


AW363486 


Ha,337635 


415079 


R43179 


Hfi.22B95 


422365 


AF035537 


Hs.115521 


411057 


AI6B1006 


Hb.71721 


442501 


AA315267 


Hs.2312B 


434466 


AB 037829 


Hs.3862 


437052 


AA861697 


Hs.120591 


433234 


A9 040923 


Hs,65366 


453830 


AA534296 


Hs.20953 


409995 


AW9G0597 


Hs.30164 


414290 


AI568801 


Hs.71721 


417248 


AA329449 ' 


HsJ247302 


418624 


A [734080 


Hs.1 04211 


412654 


AW93480 


H&29263 


450253 


AU 33047 


Hs^4715 


44wty 


AUQ75643 


Hsv313 


42707 B 


AI675062 


Hs,1 11902 


42093a 


AW11B210 


Ks.5244 


4497B4 


AW161319 


Hs.1 291 5 


431325 


AWD26751 


Ks.5794 


41 5270 


UBB666 


Ks.76353 


426110 


NMJ0D2913 


Rs.166563 


40D3Q1 


X03635 


Hs.1 657 


434926 


BE543289 


Hs.50252 


426316 


MV^_002430 


Hs.268515 


420345 


AW295230 


Hs.25231 


432205 


AJ 806583 


Hg,125291 


451893 


AW1 92083 


Ha.290B55 


435266 


AK001942 


Hs.4863 


427003 


U19487 


Hs.2090 


404642 










Hs.4007 


429716 


R25685 


Hs£11933 


420871 


AA702972 


Hs.65300 


448072 


AI459306 


Hs.24908 


441269 


AW0152O6 


Hs.1 78784 


427761 


AA412205 


Hs.140996 


425322 


U63630 


H5.155637 


417733 


A1048678 


HS.B25D3 


436521 


AW2039S6 


Hs.213003 


4QB920 


AL120071 


Hs4899a 


445332 


AK001635 


Rs.14838 


453994 


BE180964 


Hs.165590 


420397 


NW_007018 


Hs.97437 



ESTs, Weakly sbmilar to 2M91_HUMAN ZINC 13.3 

epidermal growth factor receptor palhway 13.3 

lepBn recepior 13.2 

Homo sapiens cDNA FU1 1 643 fte, done HE 13.2 

HSPC067 protein 13.2 

phospfiolamban 13.2 

Target Exon 13^ 

KIAA1323 protein 13J2 

iiypotjiahcal protein MGC14879 13^ 

ESTs 132 

hypothetical protein MGC2349 132 

doublecorb'n and CaM kinase-IIke 1 132 

ESTs 13.1 

E1A binding protein p300 13,1 

ESTs 13.1 

iyrnphocyte cytosolic protein 2 (SH2 doma 13.1 

Homo sapiens cDNA FU14314 (is. done NT 13.1 

KIAA0202 protein 13,0 

ESTs 13,0 

hypothetical protein RJ20761 13.0 

putative transmembrane pcolBin 13.0 

Homo sapiens cDNA FU12927 lis, clone NT 13.0 

ESTs 13.0 

hypothetical protein FU21776 1Z9 

Homo sapiens cDWA FU13443 fls, clone PL 12.9 

ESTs, Weakly similar to Homafag of rat Z 12.9 

EphA3 12.9 

phosphoserirve phosphatase-llkB 12.9 

ESTs 12.B 

atrophin-1 interacting protein 1; acbvi 12.8 

rtbOBomaJ protein S24 12.8 

ESTs 12.6 

ESTs, WeaHy similar to 138022 hypothec 12.8 

ISkely orthdog of mouse Aikatfa 12.8 

inlerieukln 13 receptor, alpha 2 12.B 

MSTP031 protein 12.7 

ESTs 12.7 

ESTs 12.7 

gbax06g09j1 SoaresJa1aTJrver_spleen_ 12.7 

ESTs 127 

hypothetical protefo FU23548 12.7 

REV3 (yeast homobg)-^, catalytic sub 12.7 

ESTs 12.7 

ESTs 12.7 

regulator of nonsense transcripts Z DKF 1Z7 

ESTs 12.7 

KIAA1495 protein 1Z6 

ESTs 1^6 

ESTs 12.6 

ESTs 1Z6 

twisted gasbulaUon 1Z6 

ESTs 126 

liypolhetfcal protein FIJ1 1896 12.6 
Homo sapiens mRNAj cDMA DKF2p434D0215 (f 1Z6 

secreted phospljoprotein 1 (osteopontin, 1Z5 

ESTs 1Z5 

ESTs 125 

ESTs 1Z5 

ESTs. Weakly similar to 2109260A B cell 1Z5 

SFRS protein kinasa 2 12.5 

replication factor C (activator 1) 1 (14 1Z5 

estrogen receptor 1 12.5 

mitochondrial ribosomal protein L32 12.5 

meningioma (disrupted In balanced transl 12.5 

ESTs 12.4 

ESTs 12.4 
ESTs • 12.4 

hypothetical protein DKFZp5S6A1524 12.4 

prostaglandin E receptor 2 (subtype EP2) 12.4 
Nhfl_021965^Homo sapiens phosphogJucornuta 12.4 

Sarcolemmal-flBsodatad protein 12.4 

collagen, type XIII* alpha 1 12.4 

ESTs 12.4 

ESTs 1Z4 

ESTs 113 

ESTs 1Z3 

protein kinase, DNA-activated, catalytic 12.3 

H.saplensmRNA{or3 , UTR of unknown prol 113 

ESTs 113 

fibrcnecbn leucine rich bransinembrane p 113 

hypothatlca] protein FU10773 112 

rtbosomat protein S13 112 

centrosomal protein 1 112 
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447471 AF039843 
442562 BE3795B4 
400080 

425920 AL049977 
419743 AW408762 
414222 AL135173 
427051 BE178110 
452994 AWS62597 
4251S4 NMJM1851 
437124 AA55445B 
411450 H49B19 
421559 MMJ114720 
449625 NR1.014253 
413459 AW291128 
426252 BE1769B0 
414171 AA360328 
436169 AA8BB311 
435047 AA4549S5 
417625 U593G5 
439820 AJL3602O4 
443646 AJ065198 
434874 N5244Q 
443837 A1984525 
45t540 M1 95601 
424232 AB015982 
414555 N98S69 
446874 AW96B304 
454119 BE549773 
436746 AA730O45 
432359 AAQ76049 
416509 R39769 
44475B AL044878 
413991 H44725 
431645 AF07BB49 
421406 AF179897 
408826 AF216077 
424903 T26477 
412977 AA125910 
426981 AU044675 
410853 HD45BB 
44467D H5B373 
444381 BE387335 
434398 AA121098 
438032 BE045624 
433212 BE218049 
424330 AW073953 
452679 Z423S7 
430929 AA489166 
423782 AI472209 
425770 NMJH4363 
437106 AA434054 
430200 BE613337 
453857 AL080235 
411096 U80034 
422805 AA436989 
42253B NM_00644t 
412677 AW029603 
421896 N62293 
453930 AA419466 
443123 AA094538 
415293 R49462 
443161 AJ038316 
420185 ALD44056 
445527 W39694 
426829 R14050 
416B52 AF2B3776 
449919 AI674685 
414844 AA296674 
449543 AF070632 
429784 MB97S6 
407786 AA687538 
414407 M147026 

441247 Awnaeai 
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protein kinase C, mi 1 1 .9 

phosphoBpase A2. group II A (platelets, 1 1 .9 
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uncoupling protein 4 11,9 

ESTs 119 
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feJ>9197 


444821 


AA053564 


Hs.12040 


451989 


AF169797 


H&37413 


439731 


AI953135 


Kfe»45140 


453127 


AI6U6671 


Hs_294110 


435705 


W31254 


Hs.7045 


410746 


BE383816 


Hs.12532 


416250 


U29926 


Hb.83918 


406832 


AW085S90 


Rs.63428 


418339 


AA639902 


Hs. 1042 15 


452367 


U71207 


Hs, 29279 


432867 


AI926047 


Hs.162859 


430291 


AV660345 


Hs. 236126 


440370 


AA88400O 


Hs.8173 


412350 


A1659308 


Hs.73826 


428 B25 


AJ0B4336 


Hs 128783 


441054 


AA913591 


Hs.126480 


415675 


AA894876 


Hs.5687 


453078 


AF053551 


Hs,315B4 


409203 


AA760473 


Hs.687 


446720 


AI439136 


Hs.140546 


418475 


AI858732 


Hs 30443 


417708 


N74392 


Hs 50495 


407870 


AB032930 


Hs.40719 


420568 


F09247 


He ?A7TV> 


41B858 


AW961605 


Hs.2t145 


453548 


A1079983 


Hs.1 16774 


427306 


D260S7 


Hs.174905 


434579 


T55958 




434826 


AF155661 


Hs.22265 


427528 


AU077143 


Hs.1 79565 


413174 


AA723564 


Hs.191343 


443250 


AKW1530 


Hs.132107 


451957 


AT796320 


Hs.10299 


451405 


AI694320 


H3.6295 


422299 


AK000181 


Hs.114556 


427635 


BE39798B 


Hs.179962 


416533 


BE244053 


Ha.79362 



ESTs 

Homo sapiens mRNA for K1AA1657 protein, 

gb:tL3-CT02l4-291Z99^52-Al2 CT0214 Homo 

HlV©A4nduclble 

hypothetical protein 

ESTs 

gb:yo11c07,s1 Stratagena lung (937210) H 

lamlnln. beta3{m'ce!n (125kD), Kalinin 

KTAA1034 protein 

ESTs 

ESTs 

folate hydrolase feroslaie^peclfic memb 

Homo sapiens cDNA FU13446 fis, done PL 

hypothetical protein 0KFZp434E23^B 

serine (or cysteine) proteinase inhibito 

KIAA1Q95 prole* 

carbaxy peptidase A3 (mast cell) 

ESTs, WeaWy sffnlar to ALU1_HUMAN ALU S 

thyroid hormone receptor Inleractor 7 

ESTs 

Homo sapiens mRNA; cDNA DKFZp434P228 (fr 
ELL- RELATED RNA POLYMERASE II, ELONGATI 
N-ace^^actosamlnidasB, aJpha- 
UbJqufflrvoojiiagaUncj enzyme E2M (homolo 
ESTs 

CDC14 (ceil division cycte 14. S. cerevl 

HBV pX associated protefn-8 

ESTs, Weak(y similar to YC0 1 HUMAN HYP0T 

hypothetical protein FU 13117 

Homo sapiens mRNA: cONA DKFZp434E199 (fr 

ESTs 

forkhead box F1 

Sam68-Hke phosphotyrosine protein, T-ST 
ESTs 

un characterized bona marrow protein BM04 
ubiquibn-conjugailng enzyme E2E 2 (homo 
ESTs> Wesfldy simflar to A3461S protUagg 
ESTs 

2inc finger protein 7 IKOX 4. done HF.1 
Novel human gene mapping to chomosome 20 
hypothetical protein F L1 11273 
ESTs, WeaMy similar to 138022 hypothetl 
Homo sapiens cONA: RJ23006 Rs, clone L 
hypothetical protein DKFZp762K222 
ESTs, Weakly similar to A43932 mucin 2 p 
ESTs 

STE2WiteWnase 

adaptor protein containing pH domain, PT 

hypothetical proton FU14084 

ESTs 

GL004 protein 

chromosome 1 open reading frame 21 
adenosine monophosphate deamnasB {fecfo 
ESTs, Weakly similar to Z195_HUMAN ZINC 
ESTs, Moderately slmfer to SPCN_HUMAN 8 
eyes absent (Drosophila) homolog 2 
ESTs 

CQ-49 protein 

hypothaBcri proteto FU108Q3 
protein tyrosine phosphatase, norwecept 
ESTs, WeaWy simterto 138022 hypotheW 
ESTs 

proton phosphatase 1B (formerly 2Q, ma 
matexln2 

cytochrome P450, auhfamBy IVH, pciypepl 
ESTs 

sentrirVSUMO-spectftc protease 
ESTs 

hypothetical protein KJAA1164 
protocadhertnafphalO 
hypothetical protein RG083MQ5.2 
infegrin, alpha 1 
WAAC033 protein 

gb:yb35f05.r1 Stratagene fetal spleen p 
pyruvate dehydrogenase phosphatase 
minicrnnnioBorne maintenance deficient {S. 
ESTs 
ESTs 

Homo sapiens cONA FU 13545 fc. dona PL 
ESTs, WeaWy similar to T17248 hypolheii 
bypolheffcal protein FU20174 
tumor protein p53-bmd|ng protein 
nafinoblastoma-like 2 (p130) 



10.3 
10.2 
10.2 
10.2 
10.2 
10.2 
10.2 
10.2 

ia2 

10.2 
1CL2 
10.2 
10.2 
10.2 
10.2 
10.1 
10.1 
10.1 
10.1 
10.t 
10.1 
10.1 
10.1 
10.1 
10.1 
10.1 
10.1 
10.1 

iai 

10.1 

10.1 

10.0 

10.0 

100) 

10.0 

10.0 

10.0 

10.0 

1&0 

10.0 

10.0 

10,0 

10.0 

10.0 

10.0 

10,0 

9.9 

9,9 

9.9 

9.9 

9.9 

9.9. 

9.9 

9.9 

9.9 

9.9 

9.9 

9.9 

9.9 

9.9 

9.9 

9.9 

9.9 

9.8 

9.8 

9.8 

9.8 

R8 

9.8 

9lB 

9.8 

9L8 

aB 

9.8 

9.8 

as 

9.8 
9.8 
9.8 
9.6 
9.8 
9.8 
9.8 
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439091 


AAB30144 


Hs.135613 


418575 


AA 225313 


H&.222886 


432426 


AW973152 


Hs.31050 


40457 1 






444427 


H25Q94 


Hs.293663 


439183 


AW97CS0O 


He 303261 


432B86 


BE15902B 


Hs.279704 


432689 


AB018320 


Hs.278B26 


417333 


AL157545 


Hs/42179 


429302 


AU076674 


H&.198B99 


412324 


AW978439 


Hs.69504 


422072 


AB018255 


Hs.111138 


431318 


AA502700 


Hs.293147 


417171 


BE613486 


Hs 81412 


421709 


AA1593S4 


Hr m70fifi 

1 IV. 1 Mr VOU 


415156 


X64908 


Hs.7B0BO 


436446 


AW016B09 


Hs.323795 


447462 


AB033059 


Hs 18705 

r to* jo/ vw 


439726 


AW44SB93 

wU 




429525 


N92540 




4091 90 


AU076536 




419111 


AA234172 


Hs. 13741 8 


452487 


AW207659 


Hs BSin 


416039 


AA376389 


Hs 78989 

iWif VWD7 


40B239 


AAQ53401 


Ha 771R9? 


419241 


AA523939 


Hs 1&525H 


409752 


AW953990 




453894 


AW937825 


Hs.56847 


439671 


AW162340 


Hs.6641 


407230 


AA157B57 




42E0B3 


AVU9B2712 


no. l.cO/ \£. 


452221 


C21322 


Hs.288057 


434263 


N34B95 


HS.4464B 


412766 


BE544475 


Hs,54347 


418723 


AAS 04423 


Hs, 10487 


443970 


Al 290341 


Hs 166571 

i la. 1 ww i | 


424534 


DB76B2 


He 1 


419407 


AW410377 


Hs 41502 


447B89 


AW46918Q 




429504 


X99133 




433165 


AA578204 


Hs.292437 


418700 


A19639Q8 




440282 




Ho 71 T7 


411630 




nS,( Ills 


408915 


MM ftlRRKI 


He 4RQJin 


44B552 


AW973653 




432652 


A 1376400 


He IROJvfiR 
no. 1 39900 


435803 


Z44194 


Hs 4994 


437444 


H46003 


nSiJ 13 Id 


410252 


AW821182 


Hs R1i1fl 


435497 


AAB0872S 

nnWvf Ml 


Hs.291712 


449226 


AB0Q2365 


Re 91111 


449455 


mm 004380 


Hr 


420969 


A 1636310 


Hs 


449645 


AW971163 


Ks 9683 


434415 


BE177494 




420805 


L10333 


Hs 55947 


415083 


AJ 632583 


Hs.27179 


430387 


AW372884 


Ha 240770 


445280 


AW055063 


Ms 306088 


425657 


T89839 


Hs 119471 


428250 


AW809208 


He 183297 


453293 


AA3B2267 


Hs 1fW5l 


412446 


Af76B015 


Hs 92127 


441 102 


AA9739D5 




421589 




Hr 1flRR9fi 


448073 


W197B9 




422244 


Y08B90 


rib. 1 




W443S5 


He 


441499 




Hr IMfifiQ 
n&. lUluoif 


420184 


AA1BB408 


H&.95665 


420061 


AW024937 


Hs.29410 


420337 


AW295B40 


Hs.14555 


417407 


AA023276 


Hs,29O305 


45O5B0 


N40QB7 


Hs.16248 


419926 


AW9Q0992 


Hs.93796 


421928 


AF013758 


Hs.109643 


412639 


AW961284 


Hs.296235 


429786 


AL0BQ232 


Hs.220696 


446131 


NMJ)00929 


Hs.290 


451418 


BE387790 


Hs.26369 


416658 


U03272 


Hs.79432 


425139 


AW63046S 


HS.325B20 



ESTs, Moderate V slnilar to 138022 hy pot 9.8 

ESTs. Weakly similar to TRHY_HUMAN TR1CH 9.7 

ESTs 9.7 

NM_0l5902*:Homo sapiens progestin induce 9.7 

ESTs, Moderately similar to 138022 hy pot 9.7 

ESTs 9.7 

chromatin accessibility complex 1 9.7 

Arg/AbMnteracting protein ArgBP2 9.7 

brcmodomain and PHD fingerconlalnfng, 3 9.7 

euksryoflc translation biitiation factor 9.7 

ESTs 9.7 

K1AA071 2 gene product 9 J 

ESTs, Moderately slmBar to A4601D ^lin 9.7 

Spin 1 9.7 

CED-5 pfotehi 9.7 

phospfiorylssa kinase, beta 9.7 

ESTs 9.7 

KJAA1233 protein 9.7 

EST s, Weakly slmBar to 138598 zinc fing 9.7 

eclOTudeosidetriphosptiatedtphosphohyd 9 J 

sarcoma amplified sequence 9.6 

ESTs 9.6 

Home sapiens cDNA FU13329 lis, done OV 9.6 

alcohol dehydrogenase 5 (da95 III), chl 9.6 

ESTs, Moderately slmlar to ALU7_HUMAN A 9.6 

ESTs 9.6 
gb:EST376Q63 MAGE resequences, MAGH Homo 9.6 

Homo salens cDNA FU12874 fe. clone NT 9.6 

Wresln family member 5C 9.6 

keratin 19 0,6 

ESTs, Weakly similar b AF191O20 1 E2JG5 9.6 

hypotheticBt protein FU22242 9,6 

ESTs 9.6 

ESTs 0,6 

Homo sapiens, clone 1MAGE:3954132, mRNA, 9.6 

ESTs 9.6 

KIAA0241 protein 9.6 

hypothetical protein FU21276 9.6 

ESTs 9.5 

llpocalin 2 (oncogene 24p3) 9.5 

ESTs 9,5 

ESTs, Moderately similar to ALU5^HUMAN A 9.5 

donas 23667 and 23775 zinc finger prote 9.5 

Putative prostate cancer tumor suppxasso 9.5 

heplaceHular carcinoma novel gene-3 pro 9.5 

hypothetical protein FU0C052 9,5 

ESTs 9.5 

transducer of ERBB2. 2 9.5 

ESTs 9.5 

mccrafibnllar-assoclated protein 1 9,5 

ESTs, WeaMysfmllarto 138022 hypotheK 9.5 

KJAA0367 protein 9.4 

CRES biding protein {RtibinstelnTayU s 9.4 

ESTs 9.4 

DnaJ (Hsp40) homotog, subfamily C, membe 9.4 
SkROS4tT0596-270300.011-C05 HT0596 Homo 9.4 

reUculon 1 9,4 

Homo sapiens cDNA FU12933 fe, clone NT 9.4 

ntidear cap binding protein submit 2, 2 9.4 

v-crk avian sarcoma virus CT10 oncogene 9.4 

ESTs 9.4 

DKF2P566F2124 protein 9.4 

ESTs 9.4 

ESTs 9.4 

intermediate filament protein syncdlln 9.4 

KTAA1 696 protein 9.4 

Homo sapiens, clone 1MAG&41 79482. mRNA 9.4 

ksryopherin (rmporGn) beta 3 9.4 

ESTs, Weakly similar to T33458 hypolnefi 9.4 

EST? 9.4 

hypothetical protein 9.4 

ESTs 9.4 

Homo sapiens cDNA:FU2151 3 Rs r clone C - 9,4 

ESTs, Weakly sfmllar to protease [H^apl S.3 

ESTs 9.3 

DKFZP586D2223 protein B.3 

polyadenylate bhaoig prateln-mteractin 9.3 

ESTs 9.3 
Homo sapiens mRNA; cDNA DKFZp5B6A061 (Fr 9.3 

phosphollpase A2, group V 9.3 

hypothetical protein FU202B7 9.3 

fibrilffn 2 (congerelal contractura! ara 9.3 
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430027 
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424650 
433862 
429227 
401558 



437678 
416853 
431117 
45050B 
431187 
421202 
429597 
430669 
435B69 
44BB22 
459161 
423B00 
432189 
434361 
443247 
419379 
41J92B 
445704 
44B430 
451246 
418821 
427213 
419103 
450654 
434194 
447033 
420757 
428593 
430223 
444984 
445574 
430294 
419852 
440349 
432161 



416662 
422763 
4461B3 
432897 
4489 Bfl 
449523 
407204 
413554 
414917 
434539 
441805 
412530 
424683 
426163 
413132 
425490 



441079 
437231 
431316 
403137 



4184B9 
453344 
410227 
417173 
408161 
420495 
433213 
451573 



434725 
41BB76 



A1571940 
AA974491 
AA825247 
AI493742 

AW970976 

A11480Q6 

AB023197 

AF131784 

AA151057 

D969BO 

AI961456 

AAB11B45 

AAB2986Q 

AA768553 

AF003522 

NMJXM460 

AW971146 

AF193339 

NMJM3B16 

AW969657 

AF255910 

BE149845 

BE2B4645 

AA331155 

AA527941 

AF129755 

BE614387 

Z17805 

AA888624 

AW08O932 

A150C642 

AW189232 

AA436002 

AW007211 

240229 

AJ245587 

AF119847 

AI357412 

X78592 

AW207440 

NMJH2514 

H15474 

BE410347 

AI538226 

AW503756 

AABB4196 

AKD0O40O 

T25B53 

AA033699 

AA354991 

AH55099 

Y09763 

NM_000579 

R41933 

AA319146 

C04863 

AW748078 

AA285136 

AA76826B 

NB7519 

NMJM1992 

NM.006823 

NM.001621 

T15767 

AW150G97 

BE3030DO 

AA502633 

AI8217B4 

U76421 

BE349075 

AB0Q9284 

U61397 

AW952912 

A1338247 

AW6G5130 

AW130351 

AW015318 

AK000796 

AA740616 



Hs.7549 
Hs.21734 
Hs,250B39 
Hs.1 65210 

Hs.293553 

Ks.222120 

Hs.227743 

Hs.25318 

Hs.153498 

Hs.3610 

Hs.21275 

Hs.106290 

Hs.122634 

Hs.74170 

Hs.250500 

Hs.418 

Hs.293187 

Ks.1 02506 

Hs.2442 

Hs.291029 

Hs.54650 

He.289038 

Hs.282093 



Hs.117772 

Hs.333893 

Hs.93564 

H&197288 

Hs^49247 

Hs.289067 

Hs.39140 

Hs. 1831 61 

H8.1B131 

Hs.96423 

Rs.25275 

RS.2B3940 

Hs.157601 

Hs.99915 

Hs.185973 

Hs. 235935 

tis.132898 

Hb.13063 

HJk32976 

Hs^861B4 

Hs.31476 

HsJ92807 

Hb.7538 

Hs.83936 

Hs.14222 

rte.279780 

Hs.22765 

Hs.54443 

Hs,140237 

Hs.76426 

Hs.47191 

Hs.214410 

K3.301914 

Hs.266273 

H&.271S6 

Ha.199067 

Hs,75209 

Hs.170087 

Hs.22452 

Hs.10741fi 

Hs.288929 

Hs.145037 

Hs.ia8578 

Hs.fi 53 02 

Hs.44571 

Hs.61152 

Hs.81424 

Ks.300383 

Hs.98314 

Hs,137190 

Hs.243549 

Hs.23165 

Hb^104 



ESTs 9.3 

ESTs 9.3 

heat shock factor bonding protein 1 9.3 

ESTs, Moderately similar to 138022 hypot 9.3 

NM_001839":Homo sapiens calponin 3, add 9.3 

ESTs 93 

ESTs a3 

K1AA0980 protein 9 2 

Homo sapiens clone 25194 mRNA. sequence 9.2 

chromosome 18 open reading frame 1 9.2 

KIAAD205 gene produrt 9.2 

hypothetical proleJn FU1 1 01 1 9.2 

ENSP0000022047B*:SECRETOGRANIN III. 9.2 

KefcM molff conlatn'ing protein g,2 

ESTs 9.2 

metaHomioneln 1E (fundionalj 9.2 

delta (DroscphlaJ-ltoe 1 g.2 

Myoblast activation protein, alpha 92 

ESTs a2 

eukaryolic translation \rifi adon factor 9.1 

a dislnlegrin and metalloprcteinBse doma ai 
ESTs 

junctional adhesion molecule 2 9.1 

hypothetical protein MGC41 26 9.1 

hypothetical protein FU2191 B 9 1 

Bb:EST35034 Embryo, 6 week, subtracted ( 9.1 

gbznh30cO4.s1 NCI^OGAP^Pr3 Homo sapiens 9.1 

ESTs g.! 

c-MyctargeUPOl 9.1 

Homer* neuronal immeoTate eaty gene, 2 9.1 

rab3 GTPase-acuVaBng protein, non-cata 9.1 

heterogeneous nuclear protein similar to 9.1 

Homo sapiens cDNA FLI1 1404 fis, done HE 9.1 

cutaneous T-cell lymphoma tumor antigen 9.1 

ESTs 9.1 

hypothetical protein FU12876 9.1 

hypothetical protein RJ23G33 9.1 

KrappeMype zinc finger protein 9.1 

Homo sapTens PRO1550 rnRNA, partial cds 9.1 

ESTs 9.0 

androgen receptor (dftydrotestoslBrone r 9.0 

degenerative spermatocyte (homolog Droso 9.0 

nephroblastoma overax pressed gene 9.0 

fatty add dsaaturase 1 9.0 

transcription factor CA150 9,0 

guanine nucleotide binding proteh 4 9.0 

hypothetical protein AI551D2.5 9.0 

Homo sapiens cDNA FU13872 fa, clone TH 9.0 

ESTs, Weakly similar to envelope [Rsapl 9.0 

C800t409*:gIl744l226|p(rl|S31212colage 9.0 

ESTs 9lo 

ESTs, Moderately similar to MA52-HUMAN M 9.0 
Homo sapiens mRNA; cDMA DKFZp76tF019 (fr 9,0 

NY-REN-16anKgen 9.0 

garnrna-amlnobutync acH (GABA) A recepto 6\9 

cften^re(C-C nonreceptors &g 

ESTs, WeaWy slmfar to ALU 1_HUMAN ALU S &9 

secretogranln II (chromogranto CJ 8.9 

ESTs 8,9 
ESTs, Weakly similar to MUC3JIUMAN MUCIN 8.9 

neuronal specific transcription factor O 8.9 

hypothetical protein RJ13343 8,9 

ESTs 9.9 

v^erb-b2 avran erylhroblastjc leiAemla v 8.9 

protein Kinase (cAMP4spendenl. catatytf 8.9 

aryl hydrocarbon receptor 8.9 

Homo sapiens mRNAfor KJAA1737 protein, 8.9 

ESTs 8.9 

hypothetical protein FU13258 similar to 8.9 

ESTs a.9 

NMJK)5381*:HomQ sapiens nticleolin (NCL), 8.9 

ESTs ao 

adanosTne deaminase, RNA-spedfic, B1 (h B.9 

ESTs b.9 

exostoses {muftlple)-fike 2 3.9 

utiqultln-like 1 (sentrin) a9 

hypotheBcal protein MGC3032 8.9 
Homo sapiens mRNA; cDNA DKFZp586LD120 (f 8.9 

ESTs 8,9 

ESTs 8.9 

ESTs 8.8 

hypothetical protein 8.8 
gb:ny97H Isl NCLCGAP_GCB1 Homo sapiens d& 
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44455B 


AW181975 


Hs.165892 


417355 


D13168 


Hb.62002 


4240B4 


AI940675 


Hs-20914 


431706 


Aiai60a6 


HsJ296341 


424956 


AW19B103 


Hs.158154 


441866 


BE464341 


H&21201 


444647 


H14716 


H 5 .1 1506 


444855 


A1199736 


Hs.206275 


409643 


AW450866 


Hs.257359 


428847 


AA630050 


Hs.124344 


412330 


NM_C05100 


Hs,7B8 


410275 


U85658 


Hs.61795 


425465 


11 6964 


Hs.1904 


424113 


AJ7438B0 


Hs.12876 


459324 


AW080953 




451684 


AF216751 


Hs.26813 


451244 


AW008798 


HS.26744B 


439170 


AA332365 


Hs.165539 


424238 


AA337401 


Hs,13763S 


449686 


AW072813 


Hs_27096B 


44065B 


K29142 


HsJ43032 


410870 


U81599 


Hs.66731 


45B229 


AJS23S02 


Hs.177 


429831 


AA564489 


Hs.137526 


453463 


WD0712 


Hs.32990 


428340 


AF261Q8B 


HsU 54721 


453345 


AA302862 


Hs^0Q63 


452208 


AW340281 


Hs.33074 


440485 


BE243513 


Hs.7212 


445525 


BE149866 


Hs.14831 


408949 


AF1890J1 


Hs.49163 


424668 


D83702 


Hs.151573 


412719 


AW016610 


Hs, 129911 


423872 


A602031G 


Hs.134015 


408242 


AA251594 


Hs.43913 


430848 


AW021726 




421492 


BE1769S0 


Hs.104916 


449893 


T97999 


Hs.18214 


410442 


X73424 


Hs.63768 


425913 


AA365799 


Hs.50785 


425704 


U79293 


Hs.159264 


427723 


A1355260 


Hs.279789 


441683 


BE5G4214 


He.102946 


447335 


BE617695 


Hs_2a6192 


428259 


AA424793 


Hs.24144 


433104 


ALO43002 


Hs.128246 


420041 


AH005142 


Hb,94592 


432432 


AA541323 


Hs.11 5831 


452335 


AW188944 


Hs.61272 


412673 


AL042957 


Hs.31B4S 


430335 


D80007 


HS.23&499 


419904 


AA974411 


Hs.18672 


442402 


NM_0C0954 


Hs.8272 


433759 


AA6BOC03 


Hs,1Q9363 


434064 


AL049045 


Hs.180758 


401785 






418407 


AL044618 


Hs.8492B 


419647 


AA348947 


Hs.91816 


425907 


AA365752 


H3.155965 


452017 


AF109302 


Hs.27495 


424434 


H87028 


Hs.146861 


400303 


AA242758 


Hs.79136 


414600 


AA743331 


Hs.272572 


413992 


W26276 


Ks.136075 


415071 


AKO02197 


Hs.284270 


451230 


BE546208 


Hs.25090 


441790 


AW294909 


Hs.132208 


441124 


T97717 


Hs.1 19563 


438160 


AA779332 


Hs.122671 


433254 


D85782 


Hs.3229 


434851 


AA806164 


Hs.1 16502 


420606 


BE548277 


Hs.103104 


426535 


AU077012 


Hs^88582 


411667 


BE160196 




454056 


AI273419 


Hs.135146 


435614 


RD9718 


H&20403 


446595 


T5744B 


Hs.15467 


435677 


AA694142 


Hs.293726 


447894 


AW204253 


Hs.21912 


452682 


AA456193 


Hs.9071 


418372 


AA311833 


H6.8431B 


447231 


AKD01293 


Hs.330208 


41620B 


AW2D6248 


Hs.111092 



ESTs B.8 

endotnelin receptor type B 8.8 

hypothetical proton FLJ 23056 8.8 

adenylyl cyclase-associated prolan 2 &8 

ESTs, Weakly similar to granule cell mar 8.8 

mc&ft 3; DKFZP566B0846 protein 8.8 

Human done 23589 rnRNA sequence &8 

ESTs, Weafdy similar to ALUAJtUMAN "If 8.B 

ESTS aa 

ESTs 8.8 

A kinase (PRKA) anchor protein (gravin) 8.B 

transcription factor AP-2 gamma (actirat 8.8 

prolan kinase C, lota 8.8 

ESTs g,e 
gbjw28c12jc1 NCLCGAP_Co18 Homo sapiens 8.8 

CDA14 a.8 

hypothetical profeh FU2C039 8.8 

ESTs, WeaMy similar to I38022 Jiypoiheti 3.8 

ESTs 8.8 

ESTs, Moderately similar to ALU4_HUMAM A B r B 

ESTs, WeaMy similar to neuronal thread a8 

homeoboxBlS &8 

phosphatkJyllnosito! glycaru class H &b 

ESTs 8.b 

DKFZP566F084 protein 8,8 

accritasBl.solublB b.b 

neurocaJcTi delta 8.8 

Homo sapiens, dons 1MAGE:3GM51 9, rnRNA, 8,6 

hypothetical protein PP1044 a7 

Homo sapiens, Similar to zinc finger pro 8,7 

putaffverfconucleaselH 8.7 

cytochrome 1 (photoryase41ke) a,7 

ESTs 8.7 

uronyl2-su!fbtransferQse 8.7 

HBFIgeneproiiuct 8,7 

gb:di27e02.y 1 Morton Fetal Cochlea Homo 8 7 

hypothefctf protein FU21940 8.7 

ESTs, Weatty similar to B340B7 hypotheti 8.7 

proptonyl Coenzyme A carboxylase, beta p 6.7 

SEC22. vesicle trafficking protein (S. c B.7 

Human done 23948 rnRNA sequence B.7 

WstonedeacBtylase3 8.7 

EST* 6.7 

hypotheScaJ proteh FU20940 8.7 

ESTs a7 

ESTs, Moderately similar fa unnamed prot 8.7 

kWho a? 

ESTs b.7 

ESTs 8J 

ESTs a7 

KlAA0;85proteiii 8.7 

ESTs a6 

prostaglandin D2 synthase (21kD. brain} 8.6 

Homo sapiens cDNA: FU23603 lis, clone L 8.6 

hypothetical protein PRO0Q82 8.6 

NM_002275*:HQmo sapiens keratin 15 (KRT1 8.6 

nuclear transcription factor Y, beta a.6 

hypothetical protein 8.6 

ESTs 8.6 

prostate cancer associated protein 7 B.6 

hypothetical protein FU20580 8.6 

UV-l protein, estrogen regulated 8.6 

hemoglobin, alpha 2 B.6 

RNA.U2 small nuclear 8.6 

Homo sapiens cDNA FU11335 fls, clone PL 8.6 

hypothetical protein FU20272 8.6 

ESTs b.6 

ESTs B.G 

ESTs 6.6 

cysteine dtacygenase, type I 8.6 

ESTs BL6 

ESTs 8.6 

ESTs, WeaWy similar to ubiquitous TPR m 6.6 
gb:QV1-HTO4miO200-059-hO3 KT0413 Home a5 

hypotha beat protern RJ1 3984 a5 

ESTs a5 

hypothetical protein FU20725 as 

EST«, WeaWy similar to TSGA RAT TESTIS a5 

ESTs a5 

progesterone membrane binding protein 8.5 

repl cation prolan A1 (70kp) 8.5 

crysMm, zeta {qulnons reductase)-!^ 8.5 

rrypothetteal protein FU22332 8.5 
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434747 


AA837085 


Hb.220585 


4I593B 


BE383507 


Hs.78921 


450813 


A1739625 


Hs.203376 


413956 


AJ821351 


Ha.193133 


408699 


AA056614 


Hs.106200 


44QB40 


AW629666 


Hs.203713 


407B91 


AA466620 


Hs.41135 


411019 


AW993097 


Hs.48617 


432620 


AA777749 


H^597B 


424511 


BE300512 


Hs.193557 


40B170 


AW204516 


Hs.31835 


452670 


AF068227 


Hs.30213 


450516 


AA902656 


Hs.21943 


417560 


U73336 


Hs.82283 


4305O6 


BE266026 


rfe.31476 


436953 


AW959074 


Hs.2364B 


418120 


AA213437 


Hs.192249 


419436 


AA991639 


HsJ42413 


432600 


AI821085 




446474 


Af792014 


Hs/13809 


434384 


AA631910 


Hs.162849 


452234 


AW084176 


Hs.223296 


411562 


AL05O2O1 


Hs.70769 


421B22 


AB037748 


Hs. 105204 


431160 


AW971174 


Hs,324330 


449567 


A1990790 


Hs.1BB614 


411Q6B 


3E247593 


Hs.145053 


452185 


AA12D761 


Hs.28307 


437972 


AK002015 


Hs.5887 


423393 


AA383024 


Hs.201603 


425592 


D90041 


Ks.155956 


438440 


AAB07228 


Hs.225161 


431952 


AL049365 


Hs.272251 


422295 


AF051151 


Hs.1 14408 


440145 


AW014231 


Hs.90790 


425242 


W3635 


Hs.155287 


432769 


AA6208U 


Hs, 144959 


433615 


AA7329B2 


Ks.269607 


427229 


AI799751 


Hs.5635 


446947 


AF146747 


Hs.232165 


403149 






436G79 


AI1274A3 


Hs.t204oT 


421991 


NMJJ14918 


HsJ1C438 


436476 


AA326108 


Hs.33829 


434980 


AW770553 


Hs. 14553 


407385 


AA6101SD 


Hs.272072 


4S3121 


AI968264 


Hs.282266 


420630 


AL133101 


Hs,99508 


426101 


AL0499B7 


Hs.166361 


419517 


AF0S2107 


Hs.90797 


425707 


AF116402 


Hs.11713 


440080 


AW051597 


Hs.143707 


422049 


W25760 


Hs.77631 


430154 


AW563058 


Hs.234725 


432695 


D634B0 


H6^78634 


407300 


AA102616 




406964 


AF201468 


Hs.49349 


41B051 


AW192635 


Hs.19479 


412125 


Y17114 


Hs.73393 


441523 


AW514263 


Hs.301771 


417958 


AA7673B2 


Hs.193417 


452040 


AW973242 


Hs.293690 


430460 


NMJW4673 


Hs.241519 


407802 


DB4145 


H&.39913 


420556 


AA278300 


Hs.124292 


414138 


AA135884 


Hs.3772 


4302)09 


AU20659 


Hs.5111 


408267 


AW3B0525 


Hs.32675 


429692 


N48422 


Hs.9977 


408108 


AI580492 


Hs.42743 


4QB41S 


AW9638S7 


Hs.44743 


430334 


AI824719 


Hs,326700 


422083 


NMJX31141 


Hs.1 11256 



ESTs B.S 

A krnase (PRKA) anchor protein 1 8.5 

ESTs 8.5 

ESTs, Weakly arrflar to ALU7_HUMAN ALU S 8.5 

ESTs, Moderately slmteir to ZN9UWMAN 2 8.5 

ESTs. Vfeakly similar to S64054 rrypotheti B.S 

endomiian-2 B.S 

Homo sapiens cDNA FU12540 lis, done NT B.5 

UM domain only 7 B.5 

ESTs, Moderately similar to ALU7_HUMAN A B.5 

ESTs 8.5 

ceroldJfeofusciriosis, neuronal 5 8.5 

NIF3 (Nggl Interacting factor 3, S.pombe 8.5 

S^ethyltetrariydrofolale^oiiTOcyslelno me 0.5 

Homo sspiens cDNA FU 13372 lis, done TH 8,4 

Homo sapiens cDNA RJ13097 lis, done NT 8.4 

ESTs 8.4 

hypoihafcel protein DKFZp434K1421 6.4 

gb:ns95a12.yS NCI_CGAP_Pr3 Homo sapiens M 

hypothetical protein FU1064B 8.4 

ESTs 8.4 

ESTs, Weakly ampler to I3B022 hypothei) 8.4 

hypothetical protein CKFZp586E1923 8.4 

KIAA1 327 protein 8.4 

ESTs 8,4 

ESTs Bl4 

ESTs B.4 

WW domain binding protein 4 (forrriri bind BL3 

RNA binding motif proteb 7 B.3 
Homo sapiens mRNA; cDNA DKFZp434O0917 (f 8.3 

N-acetyi transferase 1 (srytamire N-acery 8.3 

ESTs 8.3 
Homo sapiens mRNA; cONA DKFZp586W1418 (f 8.3 

toll-like receptor 5 8.3 

Homo sapiens cDNA: FU22930 Ss, done K &3 

KIAA0010 gene product fi.3 

ESTs B.3 

ESTs, Weakly similar toALUUttlMAN ALU S 8.3 

ESTs 8.3 

pdycyliiemia rubra vera 1; ceH surface 8.3 

NrVL00l45ftHomo sapiens four and a naif L 8.3 

ESTs, Weakly similar to unnamed protein &3 

K1AA0990 protein 8l3 

bHLH protein DEC2 8.3 

sterol Osecyrtransferase (acyt-Coenzyma B.3 

ESTs, Weakly similar to I3D022 hypothatl B.3 

ESTs 13 
Homo sapiens mRNA; cDNA DKFZp434O0921 (f 8.3 
Homo sspiens mRNA; cDMA DKFZp564F112 {fir &2 

Homo sapiens clone 23520 rnRtiA sequence 8J2 

£74-lll» factor 5 (els oomata transcript 8.2 

ESTs 8.2 

glycine cleavage system protein H (amino &2 

serine (or cysteine) proleinese inhibito B.2 

WAA0146 prctem 8.2 

gb:zn43eQ7.s1 Strategene HeLaceSI s3 S3 8.2 

beta-she APP-deaving enzyme 8.2 

ESTs 8.2 

eyes absent IDrosophila) homolog 4 8.2 

ESTs, Wea% similar to ALUFJtUMAN III! a2 

ESTs B.2 

ESTs, Weakly similar to 139022 hypoTheti B.2 

angiopdstMlto 1 8.2 

novel RGOcantaning protein 8.2 

Homo sapiens cDNA:FL]23l23fis 1 done I 9,2 

ESTs 8J2 

afyVhydrocarbon receplor nuclear transl &2 

tubuSn-specific chapeione e B.2 

ESTs, Weakly Similar to B340B7 hy pctheti B.2 

hypolheHcal protein B.2 

K1AA1435 protein 8.2 

ESTs &\2 

arachldonate IStlpo^genese, second typ 8.2 



TABLE 59B 



Pkey: Unique Eos probeset idenbfier number 
CAT number. Gene cluster number 
_ Accession: Genbarik accegsjon numbers 



Pkey 
409386 



CAT Number Accessions 

11 2523 J AA0712G? T65940 TB4515 AA071334 



690 



WO 03/042661 



PCT/US02/36810 



409752 


1153D1J 


411479 


1247077J 


411667 


1253334 1 


416913 


163001 1 


418866 


179788J 


418676 


179960 J 


419536 


185688 1 


419544 


185760 2 


423S0O 


232161J 


426413 


2666HL1 


429*53 


300543.1 


430846 


324621 1 


431121 


328275J 


432189 


342819 1 


432600 


350959 1 


434415 


365931J 


434579 


3B916J 


439518 


47334J 


443161 


561305 1 


447974 


745643J 



AW963990 AA07B1S6 AW7494B2 AA07746B BE151571 AA376917 

AWB4BO47AW848202 AWB48631 AW848142 AW848702 AW84B121 AWB48632 AW64B140 AW84B571 AW846009 AW848067 AWB4B069 AW84B905 
AW848214 

BE16019BAWS35896T11520AW935S30AWB56073AW861034 

AW934714 BE161007 BE162500 AW749902 AW749664 8E162498 BE161 005 M190449 AW513465 BE161006 BE162499 

T65754 AA229857 AA229658 

AA740616 AA654654 AA229923 

AA6D3305 AA244095 AA2441 63 

A1909154 AA526337 AA244193 AI9Q9153 

AA331 156 AA331 157 AA331155 

AA377823 AW954494 AI022688 

AA8B476SAW974271 AA592975 AA447312 

AW021 726 AA487752 AA48B085 

AW971157 AA492575 AA492520 

AA527941 A1B1Q608 AI620190 AA635266 

A1B21Q85 AW973464 AA554802A1B21831 AA657433 AA640756 AA650339 

BE177494 AW276909 AA632849 

T55958 T57205 AF147346 

W76326AF0B6341 W72300 

AJ03B316A1344S31 AI261653 

R76886A)453674 R77049 



TABLE 590 



Pkey: Unique number corresponding to an Eos pro beset 

Ref: Sequence source. The 7 olglt numbers m iWs column are Genbaok Identifier (Gl) numbers. 'Dunham I. et a». p refers to the publication entitled The DMA sequence of 

human chramosoma 22" Dunham I. el al„ Nature {1 999) 402:489-435. 
Strand: Indicates DNA strand from which exons were predicted. 

NLpnslubn: Indicates nucleotide posftlons of predicted exons. 



Pkey 


Ref 


Strand 


NLposilion 


400860 


9757499 


Minus 


151830-152104,152649-152744 


400880 


9931121 


Plus 


29235-29336\363S3-36S&0 


401197 


9719705 


Plus 


176341-176452 


401424 


8176894 


Pius 


24223-24428 


401519 


6649315 


Pius 


157315-157950 


401558 


7139678 


Plus 


1035KM04090 


401747 


9789672 


Minus 


118596-118816,119119-1192^1195^^ 
131932,132451-132575,133580-134011 


401785 


7249190 


Minus 


165776-165996 ( 168mi6&14,16&WM66569,167tm 


402076 


8117410 


Plus 


128316-128627 


402612 


6010110 


Plus 


25026-25091,25844-25920 


402855 


9662953 


Minus 


59763-59909 


403047 


3540153 


Minus 


59793-59968 


403137 


9211494 


Minus 


92349-92572,92958-93084,93579-93712,93949-94^ 


403149 


9799833 


Plus 


2S034-251B5 


403362 


6571772 


Plus 


64099-64260 


404210 


5806246 


Phis 


169926-170121 


404571 


7249169 


Minus 


112450-112648 


404641 


9786810 


Minus 


32247-32362 


404642 


9796810 


Plus 


102999-103145 


405523 


8454643 


Plus 


114550-114686,1172^117407,119490*119^ 



Table 60A Itste about 1166 genes up-regulated In prostate cancer compared to normal adult Issues. These were selected from 59680 pro besets on the A%metiriEos Hu03 
GenaCh'rp array such thai the rath of "average" prostate cancer to "average" normal adult tissues was greater than or equal to 3.0. The 'average* prostate cancer feve) was set to the 
75 th percentile amongst prostate cancers. The "average" norma! adutt tissue level was set to the 85* percentile amongst mrvmatignant Issues. In order to remove pene-specffic 
background levels of non-specific nybrfdizafion, the 10* percenfle value amongst the non-maignant tissues was subtracted from both the numerator and the denominator before the 
ratio was evaluated. 

TABLE 60A: ABOUT 1168 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number* Genbank accession numbar 

UnlgenelD: Unlgene number 

Unlgene True: Unlgene gene Sue 

R1: Ratio of prostate tumor to norma! adult body tissue 



Pkey 


ExAccn 


Unlgene ID 


UnigeneTlUe 


R1 


426747 


AA535210 


Hs.171995 


Icallikreii 3, {prostata specific antigen 


56.6 


419526 


AI&21895 


Hs.193481 


ESTs 


55.2 


420154 


AI093155 


Hs.95420 


JM27 protein 


44.0 


432441 


AW292425 


Hs.163484 


ESTs 


41.9 


414569 


AF109298 


Hs.11825B 


prostate cancer associated protein 1 


39.9 


431616 


AA508552 


Hs.195839 


ESTs, Weakly etmflar to 138022 hypothec 


37.9 


400299 


X07730 


Hs,171995 


kailikrain 3, {prostata specific antigen 


36.0 


446057 


AI420227 


Hs.149358 


ESTs, Weakly stmBarto A45O10 X-Tnked 


32.9 


425075 


AA506324 


Hs.1852 


acid phosphatase, prostate 


31.1 


400298 


AA032279 


Hs.61635 


six transmembrane eprlhelW antigen of 


30.0 


439176 


Al 446444 


Hs.l 90394 


ESTs, Weakly similar to B28096 Ine-1 pr 


29.8 
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400302 


N4B058 




431579 


AW971C82 


Hs.2228as 


432101 


A1918950 


Hs. 123642 


40B369 


R3B438 


Hs.182575 


453096 


AW294B31 


Hs.11325 


400267 


S39329 


HS.1B1350 


409252 


AKD00631 


Hs.62256 


428819 


AU 35623 


Hs.193914 


401424 






453370 


AI470523 


Hs.139336 


400292 


AA250737 


Hs.72472 


407202 


N58172 


Hs.109370 


415989 


A1267700 




407709 


M456135 


Hs^3023 


40716B 


R45175 


Hs.1 17183 


433444 


AW975324 


Hs.129816 


409731 


AA1259a5 


Hg.56145 


428335 


AA503115 


Hs.183752 


410929 


H47233 


HS.30B43 


424099 


AF071202 


Hs.139336 


418961 


AW967646 


Ha.23023 


452792 


AB037765 


Hs.3C652 


403047 






433466 


AA508353 


Hs.1 053 14 


418396 


AI765805 


Hs.26691 


433847 


AA603367 


Hs.222294 


432240 


A1694767 


Hs.1 29179 


43WB7 


D87742 


Hs.241552 


440260 


Al972aS7 


Hs.7130 


429170 


NM.001394 


Hs.2359 


429220 


AW207206 




416370 


N90470 


Hs,203697 


418819 


AA22B776 


Hs.191721 


421513 


XQ0949 


Hs. 105314 


42991B 


AW873986 


Hs.1 19383 


420092 


AAS14043 


Hs,88045 


432473 


A1202703 


Hs.152414 


450593 


AW4S04B1 


Hs_203965 


43154S 


A1B34273 


Hs.9711 


432574 


AA641092 


Hs.257339 


430187 


A F79990 9 


Hs.1 58989 


441690 


R81733 


H&33106 


408621 


AI970672 


Hs/46638 


448999 


AF179274 


Hs,22791 


450642 


R39773 


Hs.7130 


430376 


AW292D53 


Hs.12532 


432985 


AA650114 


H 6. 3251 98 


410330 


AW023630 


Hs.159425 


434666 


AF15I103 


Hs. 11 2259 


400297 


A1127076 


Hs.306201 


423073 


BE252922 


Hs.123119 


431474 


AL133990 


Hs.1 90642 


418829 


AA516531 


H&55999 


45386* 


AW2G838 


HSv30120 


434217 


AW014795 


Hs.23349 


418432 


M141EB 


Hs.85112 


428396 


A124938B 


Hs.98566 


421059 


AJ 654133 


Hs.30212 


447313 


U929BI 


Hs.16081 


443960 


AJ 093577 


H5.265416 


448290 


AK002107 


Hs.20843 


436032 


AA1 50797 


Hs. 109276 


432527 


AW97502B 


Hs.102754 


434792 


AA649253 


HS.13245B 


420424 


AB 033036 


KsJ7594 


440529 


AW2O7640 


Hs.16478 


424846 


AU077324 


Hs.1832 


448519 


AW1 75665 


HS.27BS95 


418339 


AA639902 


Hs,1 04215 


432600 


AIB210S5 




413597 


AW3028B5 


Hs.1 17183 


446100 


AW967109 


Hs.13804 


431448 


AL137517 


Hs.306201 


424701 


NM_C0S923 


Hb,151988 


432435 


BE218B66 


H3.2B2070 


429290 


AF203O32 


Hs.198760 


410467 


AF102546 


Hs.63931 


439444 


AI277652 


Hs.54578 


450382 


AA39765B 


Hs.60257 


452340 


NM_002202 


Hs.505 


417332 


AW972717 


Ha.288462 


407021 


052077 




419563 


AA526235 


Hs,193162 



folate hydrolase (prostale-specffic memo 28.9 

ESTs, WeaWy slmflar to TRHY_HUMAW TRICH 27.9 

EphA3 26.3 

sobtecaiier family 15 (H77? transport 26.2 

ESTs 25.7 

kaJIikrecn 2, prostatic 25.5 

hypothetical) protein FU20624 24.6 

KIAA0575 gene product 24.5 

MM00I172:Romo sapiens aiohase.type II 24.5 

ATP-blnding cassette, suMamKy C (CFTR 22.4 

9MP-R1B 22 3 

ESTs 22.3 

ESTs 20.2 

ESTs 20.0 

ESTs 19.6 

ESTs 19.4 

thymosin, beta, Identified tn neuroblast 1 9, 1 

mtoroseminoprotein, beta- 18,7 

ESTs ia.0 

ATP-hlncfing cassette, sub-family C (CFTR 17.9 

ESTs 17.7 

K1AA1344 prolan 17.5 

N^005656*:Homo sapiens transmembrane pr 17.3 

relaxfei1{H1) 17.3 

ESTs 17.1 

ESTs 16.9 

Homo sapiens cDWA FU13581 fls, clone PL 16J 

KJAA0268 protein 16.5 

copins IV te,o 

dual specificity phosphatase 4 15.7 

ESTs 15.7 

ESTs, Weakly similar to 138022 hypotheh* 15.3 

ESTs 14.8 

relaxln 1 (H1) 14.9 

ESTs 14.5 

ESTa 145 

ESTs 14,4 

ESTs 14,4 

novel protein H.4 

ESTs, Weakly similar to 138022 hypolhett R3 

ESTs 143 

ESTs 14^3 

chromosome 1 1 open reading frame 8 14.2 

b-aiisrnembrane protaln with EGF-tike and 13,7 

coplne IV ia.5 

chromosome 1 open reading frame 21 13.4 

ESTa 134 

ESTs 134 

T cell receptor gamma locus 13.1 

hypothetical protdn DKFZp56401276 13.0 

MAD (mothers an^nstdecapentBpfegic, Dr 12.8 

ESTs 12,6 

KK homeobox {Drosophla), family 3, A 12.5 

ESTs, Weakly slmllarto NUCLHUMAN NUCIE 12.5 

ESTs 12.3 

mauSn-Uke growth factor 1 {somatomedl 12.3 

ESTs 12.2 

thyroid receptor IntBracffng protein 15 122 

Homo sapiens clone DT1P1 B6 mRNA. CAG rep 12.2 

hypothetical protein FU21986 12,1 

Homo sapiens cDNA FU1 1245 fts, clone PL 12,1 

latBsdn protein 12.0 

ESTs 12.O 

ESTs 12.O 

K1AA1210 protein 1U 

Homosap)Bfi5 cDNA: FU2T71B fis, ctoneC 11.9 

neuropeptide Y 11 ,g 

Homo sapiens pro stem mRNA, complete cds MM 

ESTs. Moderately similar to SPCNJ1UMAN S 11.8 

gb:ns95a1Zy5 NCLOGAP_Pr3 Homo sapiens 11.8 

ESTs 11.8 

hypothetic^ prolefei AM62023.2 11.7 

hypothetical protein DKFZp56401278 116 

mitogen- acb'tfEted protein kinase kinase 11.6 

ESTs 11£ 

neurofilament, heavy poty peptide (20OkD) 11.5 

dachshund (Dmsophlla) homolog 11 .5 

ESTs, Weakly similar to I38022hypotheti 1t5 

Homo sapiens cDNA FU1359B fis, clone PL 11.5 

ISL1 transcrtoton factor, UMmomeodoma 1 1 .1 

hypothetical protein FU21 511 11.0 

gbiHuman manner! transposase gene, oomp 1 1 .0 

Homo sapiens cDMA FU11983 fis. clone HE 10.B 
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432729 AK0G0292 Hs.130732 hypothafcal protein FU20285 10.6 

400301 X0363S Ks.t657 estrogen receptor 1 10.5 

419536 AA6D3305 fl b:np12d1 \jsi «CLCGAP_Pr3 Homo sapiens 1D.5 

445472 AB006631 Hs.12784 Homo sapiens mRNA far KIAA0293 gene, par 10.5 

5 415539 AI733881 Hs.72472 BMP-RIB 10.4 

419264 AA877104 Hs.293672 ESTs. Weakly similar to ALUBJWMAN UK 1G4 

424432 AB037821 Hs.14685B prolccatflierin 10 10.3 

439518 W7632S gb2d60d04.rt Scares JatatheaUfcHH19W 10.3 

43403G AI659131 Hs.197733 hypothetical protein MGC2849 10.3 

10 415263 AA948033 Ks.130853 ESTs 10,2 

431217 NM_013427 Hs.250B30 Rho GTPass activating protein 6 10.2 

444B05 AB007899 Hs.12017 hflmotog of yeast uhiquHin-protein Tigas 10.1 

437252 A1433633 Hs,164159 ESTs, Weakly similar to ALU 1JUJMAN ALU S 10.1 

- _ 444190 AI87B918 Hs.10526 cysteine and glydne-rich protein 2 10.0 

15 418293 A1224463 Hs.16063 hypotheiical proiein FU21B77 10.0 

432244 A1669973 Hs^00574 ESTs 10.0 

451640 AA195601 Hs.26771 Human ONA sequence from clone 747H23 on 9.9 

460164 AJ23S923 Hs.63931 ESTs 9.8 

_^ 447072 D61594 Hs.t7279 tyrosyfprolein sulfotransferase 1 9.8 

20 41S079 R43179 Hs.22895 hypotheiical protein FU23548 9.7 

409799 D1192B Hs,76845 phosphoserlne phosphatase* 9,6 

433234 AB040928 H&.65366 KIAA1495 protein &5 

400296 AA305627 H&.139336 ATP-binding cassete, sub-family C (CFTR 9.5 

422728 AW937B26 Hs.103262 ESTs. Weakly simitar to ZN91_HUMAN ZINC 9.5 

25 426154 NMJ01851 Hs.154650 collagen, fypa IX. alpha 1 9.4 

433927 AI557019 Hs.116467 small nuclear protein PRAC 9.4 

444670 H58373 Hs.332938 hypothetical protein MGC5370 9.4 

449625 NM_014253 Oik{oddOzften-m.DrosophnaJhornolog1 9.4 

^. 450325 AI935962 Hs.26289 ESTs 9.4 

30 427761 AA412205 Hs.140996 ESTs 9.4 

453930 AA419466 H&33727 hypothetical protein FU10903 9.4 

416795 AI497778 Hs.20509 HBV pX associated protsin-8 9.3 

422805 AA4359a9 Ha.121017 H2A hfetone family, member A 9.3 

__ 41884B AI820961 Hs.193465 ESTs 9,2 

35 40B430 879676 Hs.44926 dJpeptidyl peptidase IV (C026, adBnosins 9.2 

415788 AW628686 HsJ8851 KJAA0217 protein 9.2 

404641 NH^0219^Hornosaplerisph05phoglucomula 9.1 

432437 W07O88 Hs.293685 ESTs 9,1 

Ar . 436396 AI683487 Ks.152213 wmgless-typB MMTV integration sits farri 9.1 

40 436962 AW377314 Hs,5364 EMFZP564I052 protein 9.1 

410765 AI694972 Hs.661B0 nucleosnme assembly protein 1-Hte 2 9.0 

428342 AT739163 Homo sapiens cDNA FU1345B fis, cbna PL 9.0 

452281 793500 Hs.28782 Homo sapiens cDNA FU 11041 fis, clone PL 9.0 

. _ 433043 W57554 Hs.125019 lyn#oldnudearproteln(LAF4)mRNA 8.8 

45 4420B2 R41823 H&.7413 ESTs 6.6 

436671 AW137159 Hs.146151 ESTs B.7 

4157B6 AW419196 Hb.257924 hypothetical protein FL)13782 67 

440774 A1420611 Hs.153934 ESTs 8.7 

„ 420120 AL049610 Hs£5243 transcription akmgatjon factor A (S1|)~ 6.7 

50 427139 N77624 Hs.173717 phosphalidlc aad phosptiatose type 2B a.7 

430722 AW96B543 H&203270 ESTs, WeaWy similar to ALU1J4UMAN ALU S 8,6 

422487 AJ0109D1 Hs.l9B267 mucin 4, fiachBobranchlal 8.6 

443646 A1065198 Hs.164226 ESTs 8,6 

414565 AA5Q2972 Hs. 183390 hypothetical protein FLM3590 B.5 

55 42872B NMJH6825 Hs.191381 hypothetical proiein £5 

434926 BE543259 HSv502S2 mHochortdrlaJ rib osomal protein L3 2 B.5 

437162 AW005505 Hs-5464 thyroid hormone receptor coacSvating pr 8.5 

437666 M156781 metalolhionein 1E (functional) 8.5 

453005 A13S2575 He.303171 ESTs a4 

60 418723 AA50442B Hs.10467 Homo sapiens, clone IMAG&3954132, mRNA, &3 

421040 AA715026 Hs.135260 ESTs 6.3 

414212 AA136569 Hs.10846 KIAA01 87 gene product B.3 

417916 NM_006416 Hs-B2921 sotute carrier Tamlly 35 (CMP-slatlc act 8.2 

440749 W22335 Hs.7392 hypothetical prqtetn MGC3199 8.2 

65 412715 MM.000947 Hs.74519 prbnase, pdypepfirfe 2A (S8kTJ) 8.2 

426829 R14050 Hs.194051 Homo sapiens mRNA; cONA DKFZp566B213 ?r 8.2 

447156 AW274731 Hs.157920 ESTs at 

426330 122524 Hs.2256 maWx rratalopr^rtebase 7 (rnatrilysln, 6.1 

„_ 417412 X16696 HfiJ2112 Interleukul 1 recsptcr, type I B.O 

70 424692 AA429B34 Hs.151791 KIAA0092flene product 8.0 

4359B1 H74319 Hs,1BB620 ESTs 7.9 

442081 AA401863 Hs.22380 ESTs 7.9 

4349B8 A141B055 Hs.161160 ESTs 7.9 

407910 AAG50274 Ha.41296 ffixonectlii teuclne rich transmembrane p 7.8 
75 420345 AW295230 Hs.25231 ESTs 7.8 
412324 AW978439 Hs£9504 ESTs 7.8 
420757 X78592 Hs.9991S andiogen receptor (dihydrotestostetone r 7.B 

406374 AW025430 H&.155591 forkhead box F1 7.8 
435604 AA625279 Hs.26892 u characterized bona marrow protein BM04 7,B 
OO 453160 AI263307 Hs.239884 H2B histone family, member L 7.8 
424051 AL1 10203 Ha.138411 Homo saptens mRNA; cONA DKFZp586J1922 (f 7.7 
416182 NMJD04354 Hs.7S069 cyclnG2 7.7 
452110 T47667 He^8005 Homo sapiens cDNAFU11309 fe^ clone PL 7.6 
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R52795 


Hs.25954 


427003 


U194B7 


Hs.2090 


443180 


R15875 


HS.25B576 


454119 


BE549773 


H6.4051O 


418593 


AI75087B 


Hs.87409 


4265B1 


AB040955 


HS.135B9D 


417683 


AW566Q08 


Hs.239154 


439569 


AW6D2166 


Hs.222399 


41*6140 


A1916035 


HS.30119B 


451230 


BE546206 


Bs.26090 


400294 


N9579S 


Hs.276695 


436673 


N23874 


Hs.50477 


419743 


AW408762 


Hs.5957 


428600 


AW663261 


Hs.242413 


410252 


AWB21182 


Hs.61418 


424649 


BE242035 


Hs. 151461 


428216 


AA424266 


Hs.123642 


43415B 


T86534 


Hs.14372 


413249 


AF167160 


Hs.75251 


430261 


AA305127 


HS.237225 


429597 


NMJ003816 


Hs.2442 


423556 


AW976434 


Hs.3623 


433556 


W56321 


Hs. 11 1460 


420871 


AA702972 


Hs.65300 


416278 


AI0BB4B9 


Ha.83937 


456516 


BE172704 


Hs.222746 


417511 


AL049176 


Hs,B2223 


411667 


BE160198 




414422 


AA147224 


Hs.249195 


407116 


AA156790 


Hs.262036 


421566 


NM_000399 


Hs.1395 


416564 


AA631143 


Hs^78695 


437872 


AKD02015 


Hs£887 


446148 


NMJJ16578 


rte,205C9 


449523 


NRUBD579 


Hs.S4443 


431657 


A1345227 


Hs.105448 


457726 


AW974611 




415910 


U20350 


H5J8913 


421977 


W94137 


Hs_1 10165 


432161 


AK00O4C0 


Hs.341181 


4328B7 


A1926047 


Rs,162859 


444931 


AV65206G 


Hs.75113 


421823 


N40850 


Hs.28625 


426981 


AL044675 


Hs.173081 


4251 70 


AU077315 


Hs.154970 


411479 


AW84A047 




420954 


AA282074 


Hs.237323 


433492 


AF085310 


Hs.103159 


449919 


A1874685 


Ha.200141 


452221 


C21322 


HS.2B6057 


431555 


A1B15470 


Hs.260024 


441111 


AI80G8B7 


Hs>126594 


457498 


A1732230 


Hs.191737 


415293 


R49452 


Hs.103541 


429163 


AA8S4766 




43390B 


AW29B141 


Hs.157975 


446896 


T15767 


Hs.22452 


431770 


BE2218B0 


H3.26B555 


416575 


AA225313 


H&222866 


443745 


AB039670 


Hs.9728 


427958 


AA4180DD 


K3.9B280 


447605 


AW627932 


Hs.302421 


440995 


T57773 


Hs.10263 


422173 


BE38582a 


Hs.250B19 


431992 


NNL002742 


Hs*2891 


437052 


AA851587 


Hs.120591 


444030 


AW021254 


Hs.135055 


416836 


D54745 


H9.BQ247 


447033 


AB57412 


Hs.157601 


428927 


AA441837 


Hs.90250 


432189 


AA527941 




448779 


BE042877 


Hs.177135 


430598 


AK001764 


Hs^47112 


411630 


1)42349 


Hs.71119 


4326B2 


AI37640O 


Ks,15958B 


441499 


AW29B235 


Hs.101689 


441676 


9E5642Q6 


HS.49BB9 


421077 


AKO00061 


H8.101590 


452784 


B6463857 


Hs.151258 


452055 


AI377431 


Hs.141693 


431576 


M76665 


Hs.275215 


418994 


AA29652Q 


Hs.69546 


452277 


AU049013 


Hs.28783 



silerleukin 1 3 receptor, alpha 2 7.6 

prostaglandin E receptor 2 (subtype EP2) 7.5 

claudin 12 7.5 

uncoupling protein 4 7-5 

thrcrnbospondin 1 7,5 

KIAA1523 protein 7.5 

ankyrin repeat, family A (RFXANK-ffte), 7.5 

CEGP1 protein 7.4 

roundabout [axon guidance receptor, Bros 7.4 

hypothetical protein FU20272 7.4 

Homo sapiens prostein mRNA, complete cds 7,4 

RAB27A, member RAS oncogene famly 7,4 

Hotttcj sapiens clone 2441 6 mRNA sequence 7.3 

hypolhetical protan OKFZp434K1421 7.3 

rrtcrcJMar-associated protein 1 7.3 

embryonic ectoderm development 7.3 

EphA3 7.3 

ESTs 7.3 

DEAD/H <Asp-Gtu-A!a^Asp/Hfe) box btndhg 7.3 

hypothefcal protein HT023 7.3 

a disintegnn and metsltoprotBirase doma 7.3 

hypothetical protein FU11220 7.2 

caJcii^n/c^niodulln-dependentprotBlnMn 7.2 

ESTs 7.2 

hypothetical protein 7.1 

KIAA1610 protein 7.1 

chordln-like 7.1 

gb;QV1-WM13-01(J2t)(W59-h03 KT0413 Homo 7.1 

Horm»boxA13 7.1 

ESTs, Weakly similar ta 2223_HUMAN ZINC 7.1 

early growth response 2 (Krox-20 {Drosop 7.0 

Homo sapiens prostatn raRNA, complete cds 7,0 

RNA binding motif prolan 7 7.0 

HBV pX associated proteln-8 7.0 

chefDokine (C-C mobf) receptor 5 6.9 

ESTs. Weakly simitar to B34087 hy pothetj 6.9 

9b:E8T38B916 MAGE resequences, MAGN Homo 6l9 

chemcWne (C-X3-C) receptor 1 6.9 

ribosomal protein L26 homolog 6.9 

ESTs, Weakly stmlsx to envelope [H.sapc 6.9 

ESTs 6.8 

general transcription factor 1IIA 6.8 

ESTa 6.8 

K1AA053O protein 6.B 

transcription factor CP2 6.6 

gb:lL34TO14-291299-052-A12 CT0214 Homo 6.8 

M-ace^lglurosaTinB-phosphatemutass 6.8 

ESTs 6.8 

ESTs 6.8 

hypothetical protein FU22242 6.8 

Cdc42 effector protein 3 6.7 

ESTs 6.7 

ESTs 6.7 

ESTs 67 

gb:am20a1 Hsl SoaresJIFLX.GBC_S1 Homo s 6.7 

ESTa 6.7 

Homo sapiens mRNA lor K1AA1737 protein, 6.7 

5^'exorfporwcte9se2 6.7 

ESTs, VfeaWy similar to TRHYL.HUMAN TRIGH 6.6 

ALEX1 proteJn &6 

potassium fatermedlata/ernafl conductance 6.6 

9^rr£n4 6.6 

ESTs 6.6 

phorboMke protein MDS01 9 6.6 

protein kinase C, mu 6.6 

ESTs 6.6 

ESTs 6.6 

crutfecysioMnm 6.6 
ESTs 6.5 
ESTs 6,5 
gb:nh30c04.s1 NCLCGAP_Pr3 Homo sapiens 6.5 
ESTs 6.4 

hypothetical protein FU10902 04 
Pulative prostate cancer tumor suppresso 6.4 
ESTs 6.4 
ESTs 6.4 
ESTs 6.4 
hypothetical protein 6,4 
hypothetical protein FU21 062 6.4 
hypothetical protein MGC1 0G5B £4 
hydroxysterold (1 1-bete) dehydrogenase 1 6.3 
sdectin E (endothelial adhesion molecul 6.3 
WAA1 223 protein 6.3 
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452242 
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410141 


R07775 


Hs.287657 


41B019 


R68911 
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450B13 


AI739625 


Hs.203376 


419239 


AA4681B3 


Ks.184596 


411096 


U8C034 


Hs^85B3 


4516B4 


AF216751 


Hs-26813 


427674 


NM_Q03528 
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407275 
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41795B 
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Hs.193417 


418407 
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435176 
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Hs.189413 



ESTs 

Homo sapiens thymic stroma) lymphcpateli 
ESTs 

ESTs, Wea^d, elmllar to sltefnalhre), sp 
hypothetical protein FLH3197 
Human done 23948 niRNA sequence 
ESTs 

hypoihafecal protein FU00052 

ESTs 

ESTs 

ESTs 

Homo salens cDMA: FU21122 fis, clone C 
ESTs 

protein Wnass C, ioia 
ESTs 

ESTs, Moderately stmlar to ALU'/^HUMAN a 
ESTs 

KIAA1435 protein 

presenilln 2 (Abnebnerfisefise 4) 

ESTs 

zinc finger protern 7 (KOX 4, dons HF.1 

ESTs 

ESTs 

lepUn receptor 

Homo sapiens TnRNA; cONA DKFZp727C191 (Ir 

eukaryolic traraJation inirjaflon factor 

ESTs 

ESTs 

Homo sapfens mRNA; cLWA DKFZp434P228 (fr 
pdycythemta rubra vera 1; cell surface 
hypolhabra! protein FU20544 
ESTs 



sgIuIb carrier family, member 4 

Homo sapiens mRNA fufl length Insert cDN 

ESTs 

ESTs 

ESTs 

ESTs 

CGI-147 protein 

Novel human gene mapping tochomosomo 20 
folate rtydrolasa (prostate-specific memb 
ESTs, WeaHy simiar to TS6A RAT TESTIS 
ESTs 

WM_02249dHomo sapiens hypotheb'cal prd 
iiypothafcal protein FU22313 
Bell (S. oerevlsiaa} homcfog 
C8001409^l|7441226|p1r||S312l2colagB 
ESTs 

NMJ021S6S*-Jiamo sapiens phosphogluconiuta 

ESTs, WeaWy slmlar to AW7_HUMAN ALU S 

ESTs, WeaWy slmlar to ALU7_HUMAH ALU S 

cadherin-Sks protein VR20 

Homo sapiens cDNA FU1216B fis, done MA 

ESTs 

gb:RC1-DT0001-03l299^11-a11 OT0001 Homo 

ESTs 

ESTS 

ESTs 

EteroK^Nfasaturase {fungat ERGS, delta 
DEAD/H (Asp-Glu^AlshAsp/Hls) box pcfypep 
Homo sapiens clone HB-2 mRKA sequence 
Homo sapiens cDNA FU12292 fis, clone MA 
hypothetical protein 
hypothetical protein RJ10SB3 
ESTs 
ESTs 

ESTs, ModBrately slmflar to Z195J1UMAN Z 
ESTs 

Homo sapiens dona TCCCTA0Q151 mRNA sequ 
ESTs. WsaUy slmlar to JG1087 RNA hefe 
gyccsyltransferasa 

Homo sapiens cDNA: FU21291 fis, dons C 

ESTs 

ESTs 

Homo sapiens cDNA: FU23241 fis. done C 
mttochoftd rial "mBrmecfste peptidase 
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Homo sapiens cONA FU13752 fis, done PL 44 

ESTs, Weekly similar to PC4259 ferritin 4.4 

uncnaracterized hypothalamus protein HCD 4A 

Homo sapiens cDNA FU13097 fis. done NT 4.4 

ESTs, Weakly similar to I38022 hypothaff 4.4 

ESTs 4.4 

ESTs 4.4 

ESTs 4,4 

Homo sapiens mRNA; cDNA DKFZp564K143 (fr 4.4 

dtublquttlR AA 

ESTs 4.4 

nuclear receptor subfamily 4, group A, m 4.4 

ESTs 4.4 
ESTs, Weakly similar to 04HUD1 debrisoqu 44 

Homo sapiens cDNA FU 13090 fls, clone NT 4.4 

ESTs 44 
REV3 (yeast homolog)-like, catalytic sub 44 

KIAA1573 protein 4.4 
lipopDlysaccaridBHspedrTc response 5-1 4.4 
ESTs 4.4 
eenim/gtucocorticold regulated Mnase-fl 4.3 
Homo sapiens mRNA; cDNA DKFZp761K2024 (f 4,3 
exostoses (multip)e}-lilui 2 4.3 
Homo sapiens mRNA; cDNA DKF2p586Ftfl22 (f 4.3 
ESTs 4,3 
collagen, type XIII, alpha 1 43 
hypothetical protein MGC4643 43 
Homo sapiens, done 1MAGE:360651 9, mRNA, 4.3 
ftyporaefica! protein (JJ551D2.5 4.3 
KIAA0241 protein 4.3 
chromosome 18 open reading frama 1 4.3 
gb:wd73f12jc1 NCLCGAPJji24 Homo sapiens 4.3 
rrypothellcal protein FU14007 43 
transmembrane, prostate androgen induced 43 
ESTs, Weakly similar to T23762 hypothetl 4.3 
ESTs 4.3 
hypothetic^ protein FU2D136 4.3 
coenzyme Q, 7 {rat, yeast) homoiog 4.3 
hypothetical proton MGC13105 4.3 
x001 protein 4.3 
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431533 
414312 
438379 
41192B 
414680 
423104 
440B40 
449802 
434B74 
431429 
413492 
413929 
435846 
441523 
460546 
414222 
41644S 
409151 
412707 
442991 
447439 
449885 
4072D4 
450377 
419647 
442049 
430669 
424181 
443S34 
426215 
422634 
445B95 
447669 
407193 
447752 
402555 
443161 
415827 
408161 
418365 
429525 
435401 
446657 
421141 
430335 
435020 
445309 
443547 
410763 
410592 
423693 
428634 
430253 
425211 
443273 
428055 
425707 
451294 
430519 
441765 
431117 
446354 
451468 
409705 
447082 
418594 
426501 
436797 
416288 
419865 
425920 
416858 
452367 
433209 
428B01 
419629 
451061 
420544 
400695 
417173 
419985 
429340 
433759 



AL042613 

AA155694 

N23018 

AA8B8624 

AA743331 

AJ005273 

AW629666 

AW901804 

N62448 

AF072813 

D87470 

U90268 

AA7O0870 

AW514263 

AAD10200 

AL135173 

AL0430O4 

AA306105 

AW206373 

BE281238 

M3135G5 

AW296669 

R41933 

AB033091 

AA348947 

AA310393 

AW969657 

AL039462 

H73972 

N77630 

NH4JM6010 

D29954 

AL049985 

H91679 

M73700 

AID36316 

H17462 

AW952912 

AW014345 

N92540 

R44477 

A1335191 

AW117261 

D8D007 

AW505076 

AL157474 

AW271273 

AF279145 



AA32979S 

AAB11845 

AKD01514 

M18667 

AI042063 

AA420584 

AF115402 

A1457338 

AF129534 

R53790 

AF003522 

AW449850 



8E15B773 

T85314 

A1732D83 

AW043782 

M731491 

H51299 

NMU007020 

AL049S77 

U03272 

U71207 

AB0409Q7 

AW277121 

AB020695 

AW291487 

AA677577 

U61397 
H66373 
N35938 
AA6BO003 



Hs.262476 
Hs.191050 
Hs.171391 
Hs.197289 

Hs.1 23547 

Hs.23984 
Hs.293970 

Hs.75400 

Hs.93810 

Hs.14304 

Hs.301771 

Hs.176551 

HS.87B 

Hs.79337 

Hs.50785 

Hs.1 6443 

Ks.8886 

Hs.145020 

Hs.66095 

Hs.1 40237 

Hs.74313 

Ks.91B16 

Ks.190044 

Hs.291029 

Hs.142517 

Ks.134450 

Hs.13895 

Ks.118821 

HS.13421 

Hs.19180 

Hs.105938 



Hs.23079 



Hs.161690 

H350S353 

Hs.1 0056 

Hs.260702 

Ks.125914 

Hs.239499 

Hs.301855 

Hs.12504 

Hs.23767 

Hs.agea 

H843569 

Hs.1098 

Hs.106290 

Rs.236844 

Hs,1867 

Ks.132156 

Hs.101760 

Hs.1 1713 

Hs.29834 

rts.49210 

Hs.23294 

Hs.250500 

Hs.346335 

Hs.293663 

Hs.213207 

H3.187619 
Hs.293616 
Hs.334477 

hte.93502 
Hs.1 62209 
HS.7B432 
Hs.29279 
Hs.276436 
Hs.254881 
Hs.91662 
Hs.213659 
Hs.98732 

Hs.B1424 
Hs.5856 
Hs.199429 
Hs.109363 



S-adanosytmethlonlne decarboxylase 1 4,3 

ESTs 4.3 

Otermfnal binding protein 2 4,3 

rab3 GTPase-activaling protein, non-cata 4.3 

hemoglobin, alpha 2 4.3 

antigenic determinant of recA protein (m 4.3 

ESTs, Weakly Simla 1 to 564054 hypotheti 4.3 

hypothafjcaf protein FU20147 4,3 

mdhylmalofiate-BsniiaWehyde fJehydrogenas 4.2 

reHoion 3 4.2 

KIAA0280 protein 4.2 

cerebral cavernous malformations 1 4.2 

ESTs 4.2 

ESTs, WeaMy similar to ALUF_HUMAN 19! 4.2 

ESTs 4.2 

sorbitol dehydrogenase 4,2 

WAA0135 protein 4.2 

SEC22, vesicle taffkrtlng protein (S. c 4.2 

Homo sapiens cDNA: RJ21721 fls. clone C 4.2 

hypothec protein FU20424 4,2 

ESTs, Weakly similar to KJAA1205 protein 4.2 

ESTs 4,2 

ESTs, WeaMy similar to ALU1_HUMAN ALU S 4.2 

WAA1285 protein 4,2 

hypctheScaJ protein 4.2 

ESTs 4.2 

ESTs 4.2 

Homo sapiens mRNA; cDNA DKFZp434P081O p 4.1 

ESTs 4.1 

Homo sapiens cOWA FU11654 lis, done HE 4.f 

CG1-62 protein 4.1 

WAA0056 protein 4-1 

Homo sapiens mRNA; cDNA DKF2p564E122 (fr 4,1 

gb.7iK)4a07.s1 Scares fatal fiver spleen 4.1 

lactotransfenin 4.1 

NMJ)01839*^omo sapiens catponki 3, add 4.1 

gb-.otfJ8c08.x1 Soare3jtotaUetus_Nb2HFB_ 4.1 

ESTs 4.I 

hypotheEcal protein MGC3032 4,1 

ESTs 4.1 

ecfo nucleoside triphosphate dphosphohytf 4,1 

hypofiisfical protein FU14621 4.1 

ESTs> Weakly similar to 210926CA B cell 4.1 

ESTs 4.1 

WAA0185 protein 4.1 

DIGecrge syndrome critical region germ 6 4.1 

liNyorCirjlogofmciuseArkarJla 4.1 

rrypothelfcaJ protein FU 12666 4.1 

hypothetical protein FU21776 4.1 

ESTs 4.1 

DKFZp434J1613 protein 4.1 

Ketch motif containing protein 4,1 

hypothetical protein FU10652 4.1 

progastrfcsto {pepsinogen C) 4.0 

ESTs 4.0 

ESTs 4.D 

E74-like ractor 5 (ets domain transcript 4.0 

ESTs 4.0 

F-bcw only protein 4 4.0 

hypothetical protein FU14393 4.0 

delta (Drosophlla>flkBl 4.0 

ESTb 4.0 

ESTs, Moderately si'miler to 138022 hypot 4.0 

ESTs 4.0 

. Bibredoxirv-lke 4.0 

ESTs 4,0 

ESTs 4.0 

hypotheEcal protein MGC14879 4,0 

gb.Yp07o06.s1 Soaies breast 3NbH3sl Homo 4.0 

Ul-snRNP Nntfng pratei n homolog (70TtD) 4.0 

cJaudtn 8 4,0 

fibrillin 2 (congenital contracture! are 4.0 

eyes absent (Orasophila) homolog 2 4.0 

KIAA1474 protein " 4.0 

ESTs 4.O 

KIAA0888 protein 4.0 

ESTs, WeaWy similar to KJAA13S7 prater 19 

Homo sapiens Chromosome 16 BAG done CfT 3.9 

C110Q25l4*:giI112a0151|ptl|Ea27S6beta- 3.9 

ub'quibn-like 1 (senbin) 3.9 

ESTs, Highly similar to dA393J16l3 [Hsa 19 

Homo sapiens mRNA; cDNA DKFZp434M22T6 (f 3.9 

Homo sapiens cDNA: RJ23GG3 fls. done L 3,9 
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428862 


NMJDC0346 


Hs.2316 


413450 


Z99716 


Hs.75372 


427615 


BE410107 


Hs.179617 


450649 


NM_001429 


Hs.25272 


44579B 


NMJM2421 


Hs.13321 


4161S8 


H27332 


Hs.99598 


431147 


AI767751 


HS.203DG 


439192 


AW970536 


Hs.105413 


446157 


BE270828 


Hs.131740 


45t90C 


AB0231S9 


HS.27207 


42OB02 


U22376 


Hs.1334 


4260B3 


AW962712 


Hs.126712 


442320 


AI287817 


Hs.t29635 


447541 


AKO002fi3 


Hs.18800 


444635 


796667 


Hs.17B77 


432103 


T15803 


KS.27245S 


412576 


AA447718 


Hs.107057 


423952 


AW877787 


Hs.136102 


437916 


BE566249 


HS.20S99 


442242 


AV647908 


Hs.90424 


447280 


BE617907 


Hs.97635 


421750 


AK000768 


Hs.107672 


450580 


N4O0S7 




432359 


AAD76049 


Hs.274415 


429323 


NMJX31649 


Hs.2391 


407332 


AUDI 565 


Hs.200113 


426413 


AA377823 




412652 


A1B01777 


Ks.260024 


426226 


AA769M5 




437816 


AI823445 


Hs.280699 


444S34 


AW271626 


Rs.42294 


459241 


AA032276 


Hs.99010 


408242 


AA251594 


Hs.439t3 


443484 


AJ09145S 


Hs.134559 


4220 S3 


ML0C1141 


Hs.111256 


428043 


T92248 


Hs.2240 


407192 


AAB092OO 




421590 


AF0M715 


Hs.105940 


453020 


AL162039 


Hs.31422 


437372 


AA32396B 


Hs.283531 


432273 


AL137508 


Hs. 274256 


407917 


□63139 


Hs.41587 


408547 


AA574291 


Hs,57fl37 


427635 


BE3979B8 


Hb.179962 


4309BB 


AW972030 




452234 


AW084176 


Ks.223296 


453696 


AA037615 


Hs.42746 


40B875 


NMJJ15434 


Hs.48604 


431197 


AL036596 


Hs.250745 


434072 


H70854 


Hs.283059 


430138 


R98299 


H3.177502 


447906 


ALO50062 


Hs.19999 


415992 


CO5037 


Hs.145807 


435655 


AW105663 


Hb.6947 


430607 


AW973521 


Ks.247324 


43939B 


AA2B4267 


Hfe.221504 


448515 


H6844? 


Rs,l352B 


445242 


BE1E6475 


H8.21108 


418576 


AW968159 


Hs.269104 


436024 


AIB00041 


Hs.190555 


407783 


AW996872 


HS.172D2B 


436207 


AA334774 


Hs.12845 


442B61 


AA243837 


Hs.577B7 


448207 


AM75490 


H8.170577 


450628 


AW382884 


Hs,204715 


432606 


NR1.002104 


HS.3Q66 


438613 


C05969 


Hs,243l22 


4430(31 


AW134G96 


Hs.49418 


4476C8 


AW205042 


Hs.18955 


400312 


AF263613 


Hs.44198 


412777 


AI335773 


H8270123 


424602 


AK002055 


Hs.151046 


424B82 


AI379461 


Hs.153636 


425898 


AA365649 


Ks269478 


443323 


BE560621 


Rs.9222 


452129 


AW291379 


H6212827 


453927 


AAQ82465 


Ha.1 25031 


432336 


NMJK>2759 


Hs 274382 


425988 


BE045S97 


BE .274454 


443123 


AA094538 


Hs.272808 


445467 


AI239832 


Hs.15617 


423855 


AA331761 


Hs.254659 


403790 







SKY {sex detemrtning region YVbox 9 (ca 

N-acdylgalactasBmiradase, atpha- 

CGl-82 protein 

E1A bindmg protein p30O 

rearranged L-myc fusion sequence 

hypotiebcsl protefo MGCS33B 

ESTs 

ESTs 

Homo sapiens cDMA; FU22562 fis. done H 
KIAA0982 protan 

v-myb avian mydoWastosts viral oncogen 
ESTs. W9aWystrrflartoAF1910201 E21G5 
ESTb 

hypothefcet protein FU20281 
ESTs 

protein phosphatase 3 (formerly 28), cat 
ESTs 

KIAA0853 protein 

hypothetical protein FU23142 

Homo sapiens cDNA: FU23286 fls. clone H 

ESTs 

hypothetical protein FU20761 
ESTs 

Homo sapiens cDNA FU10229 lis, clone HE 

apical protein, Xenopus laevls-tlte 

Homo sapians cDNA FU11379 fis. clone HE 

gb:EST90605 Synovial sarcoma Homo sapien 

ESTs 

gb:DeB0h07^1 NCLCGAP.GCBI Homo sapiens 

ESTs 

ESTs 

ESTs, Moderately similar to T14342 NSD1 

PIGF1 oene product 

ESTs 

erachi donate 15-Slpoxygenase, second lyp 
uteroglobin 

gb:afl2eQ2.s1 SoaresJssiis_NHT Homo sap 

jerky {mouse) homc4og-like 

Homo sapiens mRMA; cDNA DKFZp434M229 (Tr 

hypothetical protein DKFZp547G1B3 

hypothetical protein RJ23563 

RAO50 (S. cerevtelae) homolog 

ESTs 

tumor protein o£3-b'nding protein 
gh:EST384925 MAGE resequences. MAGL Homo 
ESTs. WeaWy similar to I38022 hypothec" 
ESTs 

DKFZP434B18B protein 

polymerase (RNA) III {DN A directed) (62k 

Homo sepena PRO10B2 mRNA. comptete cds 

ESTs 

DKFZP566K023 protein 
hypothetical protein FU13593 
HSPC069 protein 

mitochondrial rfboscmal protein S14 
ESTs 

hypothetical proteh FLU 4054 

ESTs. Weakly sWsr to ALU 1_HUMAN ALU S 

EpHnettal calcium channel 2, CaT-Uke A 

ESTs 

a clsrrtegrin end melalloprotslnase rJorna 

byjjothetfcal protein MGC13159 

ESTs 

ESTs 

ESTs 

granzyme K (serine protease, grarrzyme 3; 
hypdnetfcal protein FU 13057 similar to 
ESTs 

Homo sapiens cDNA FU20W fo, done KA 



ESTs 

hypotneb'cal protein FU 11193 

far upstream element (FUSE) binding prot 

ESTs. WeaWy similar to PC4259 ferritin 

estrogen receptor binding site associate 

ESTs 

chollne/emanolamlneprKuprKitransferase 
protein Kinase, Interferon-inducible don 
ESTs, Wea% to 13B022 hypcdhetf 
putative transcripfon regulation nuctea 
ESTs, Weakly shilar to AUi4_HUMAN ALU S 
ESTs 

NWL001334«:Homo sapiens cHlhepsin O (CTS 
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434470 


AA63481B 


HS.29B138 


ESTs 


3.7 


452032 


BE244O05 


Hs.27610 


retinricackf- and interferon-lnd Liable 


3.7 


420026 


A1831190 


Hs.166676 


ESTs 


3.7 


421B38 


AW8B10B9 


Hs.108806 


Homo sapiens mRMA; cDNA DKFZp556M0M7 (f 


3.7 


449916 


T60525 


Hb.299221 


pyruvate dehydrogenase fchase. isoenzyme 


3.7 


462270 


AW975014 


Hs.26 


feirochelaiase (protoporphyria) 


3.7 


445941 


AJ267371 


Hs. 172635 


ESTs 


3.6 


416882 


AI633044 




tryptopharryl tRMA synthetase 2 (mitochon 


3.6 


412533 


AA679863 


Hs.69606 


ESTs 


3.6 


431740 


N75450 


Hs.1 83412 


ESTs, NfadsrateTy similar to AF1 16721 67 


3.6 


442710 


AI015631 


H&23210 


ESTS 


3.6 


448212 


AI475858 




gb:lc87d07.x1 wa_CGAP^CLL1 Hon» sapiens 


3.6 


413431 


AW246428 


Hs75355 


ublqultln-conJugaJtlng enzyme E2N (homofo 


3.6 


453D38 


AW292415 


Hs.20509 


HBVpX associated proteih-B 


3.6 


427661 


AA410292 


Hs.104761 


ESTs 


3.6 


420923 


AF097021 


Hb.273321 


deferentially expressed Vi hematopolea' 


3.6 


414844 


AA29B874 


Hs.77494 


cfecwyguanoslne Wnase 


3.6 


441866 


BE464341 


Hs.21201 


nerin 3; DKFZP566B0B46 protein 


3.6 


40BB15 


AAS33930 


Hs^8B036 


tRMA Isopentertylpyrophosphate transfers 


3.6 


423482 


BE280172 


Hs.1 29228 


gabctcklnase 2 


3.6 


424677 


U09414 


Hs.151689 


zinc finger protein 137 {rfone pHZ-30) 


3.6 


430160 


AW968210 


Ks.293957 


ESTs. WeaWy similar to AlUC.HUMAN fill 


3.6 


433672 


BE2B1165 


R3.288033 


TUS~assoclated serine-arglnlne protein 1 


3.6 


433887 


AW204232 


Hs.279522 


ESTs 


3.6 


452670 


AF058227 


Hs.30213 


cerokMlpoJusclnosls, neuronal 5 


3.6 




H92738 


Hs_7SB11 


N-acylsphfrigostee amWohydrolase {ackl c 


3.6 


432728 


NM„006979 


HSJ7B721 


HLA class 11 region expressed gene KE4 


3.G 


431724 


AAS14535 


Hs.283704 


ESTs 


3.6 


435703 


AW630133 


Hs.B3313 


GKDQ3 protein 


3.6 


445674 


BE410347 


Hs.13063 


transcription factor CA150 


3.6 


4141 7B 


AW957372 


Hs.46791 


ESTs, Weakly similar to 138022 hypomefl 


3.6 


416111 


AA033813 


Hs.79018 


chromatin assembly factor 1, subunlt A { 


3.6 


432*40 


AK001403 


Hs.279521 


hypothetical protein RJ 20530 


3.6 


439726 


AW449893 


Hs^93707 


ESTs, Weakly similar to 138598 zinc ting 


3.6 


445704 


AM93742 


Hs,1677Q0 


ESTs, Moderately srmHarto J38022 hypol 


3.6 


449115 


AW959952 


Hs.3752B 


ESTs, Weakly simferto AF090944 1 PRO06 


3.6 


439379 


AA835002 


Hs.125611 


ESTs 


3.6 


414646 


AA353776 


Hb.901 


CD48 antfgen (B-cel membrane protein) 


3,6 


434579 


T5555B 




Gtxyb35fG5.r1 Stralagene fslal spleen (9 


3.6 


451357 


AA923729 


HsJ26322 


cat cycle related kinase 


3.6 


420255 


AA258124 


H&293878 


ESTs, Moderately sfcnflar to ZN91_HUMAN Z 


3.6 


431263 


AW129203 


Hs,322915 


ESTs 


3.6 


431952 


Z70696 


Hs.272240 


Homo sapiens cDNA RJ11086 fis. clone PL 


3,6 


439092 


AA830149 




gb:oc44fOBji1 NC1_CGAP_GCB1 Homo sapiens 


3.6 


411562 


AL050201 


Hs.70769 


hypothetic at protein DKFZp5B6E1H23 


3.5 


417771 


AAB0469B 


Hs.82547 


retinofc acid receptor respond er (tazaro 
KttAQ8B2 protein 


3.5 


419440 


AB020B89 


Hs.90419 


3l6 


417333 


AL157545 


Hs.173179 


biomodomaln and PHD finger containing, 3 


3.5 


419241 


AA523939 


Ks.165259 


ESTs 


3.5 


4107B2 


AF226053 


Hs.66170 


HSKM-B protein 


3.5 


413384 


NMJM0401 


Hs.75334 


exostoses (multiple) 2 


3.5 


424081 


NM_0Q6413 


Hs.139120 


rlbonudease P (30kD) 


3.5 


4280C4 


AA449583 


Ha.161393 


gkitamatsK^steme Bgase, catalytic sub 




420390 


AA330047 


Hs.191187 


ESTs 


3.5 


418662 


T25853 


Hs.7538 


ESTs 


3.5 


433260 


AA5B1404 


Hs.289037 


Homo sapiens cDNA FU141 35 lis, done MA 


3.5 


4332B5 


AW97S944 


Hs.237398 


ESTs 


3.5 


421991 


NMJ0T491B 


Hs.1lQ48ft 


KIAA0990 protein 


3.5 


413950 


AA249096 


HsJ2793 


ESTs 


3.5 


43131 B 


AA502700 


Hs^93147 


ESTs, Moderately sTmilarto A46010XJift 


3.5 


400289 


X07820 


Hs.2258 


matrix metefloprotelnase 10 fstromelysln 


3.5 


411149 


N68715 


Hs.269128 


ESTs 


3.5 


417601 


NM.014735 


H8.82292 


KIAAD215 gene product 


3.5 


41B334 


AA319233 


Hs.5521 


ESTs 


3.5 


422583 


AA41O506 


Hs.27973 


KJAA0874 protein 


3.5 


425717 


X07282 


Hs.171495 


rethote acid receptor, beta 


3,5 


425856 


AA36490B 


H5.98927 


hypothetical protein FU13993 


15 


434569 


A1311295 


H3.34447B 


KIAA0196 gene product 


3.5 


419436 


AA991639 


Hs.242413 


hypolhfib-cal protein DKFZp434K1421 


3.5 


437296 


AA350994 


Hs.20281 


KIAA1700 


3.5 


432697 


AW97505Q 


Hs,293B92 


ESTs, Weakly similar to ALU4J1UMAN ALU S 


3.5 


439605 


Af 08 6431 


Hs.223B0 


ESTs 


3.5 


442145 


AI022650 


Hs.8117 


erbb2-fnterac6ng protein ERBIN 


15 


432589 


AU35725 


Hs.131708 


ESTs 


3.5 


425910 


AAB30797 


H5.184760 


CCMT- box-bin cfing transcription factor 


as 


412095 


AI824707 


Ks.5921 


Homo sapiens cDNA: FU2 1592 lis, doneC 


3.5 


421129 


BE439899 


Hs.89271 


ESTs 


3.5 


429433 


AA452899 


H3.2135B6 


ESTs. Weakly similar to K1AA1 353 protein 


3.5 


434839 


A1743069 


Hs.1 34735 


ESTs 


35 


436166 


A1391470 


Hs.1 58816 


ESTs 


3.5 


448966 


AW372914 


Hs.86149 


phosphoinosltol 3-phosphate-Nndlng prot 


3.5 


449318 


AW236021 


Hb.78531 


Homo sapiens. Similar to REKEti cDMA 5730 


3-5 


453311 


AW104911 


Hs.126707 


hypotherJcal protein FLM1457 


3.5 
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456497 
41378G 
430935 
439904 
449909 
445S17 
419594 
417251 
423545 
438527 
453916 
430200 
4371 B7 
423645 
432370 



404571 
419727 
436374 
452721 
427472 
418245 
402031 
407137 
449603 
425592 
436605 
437367 
407908 
430144 
416292 
423800 



429084 
432886 
408267 
406832 
422564 
424842 
429984 
435256 
436137 
440348 
442910 
443242 
445469 
44S517 
425354 
433037 
423044 
434126 
449845 



440370 
438825 
446874 
408101 
419131 
423178 
433764 
440653 
411558 
452615 
421234 
424075 
412045 
420041 
426174 
427083 
441021 



441124 
444169 
421476 
422165 



450747 
401197 
433404 
422546 
443834 
414341 



AW967955 H S -1 23648 EST*, Weakly similar to AF1 08460 1 tibtnu 3.5 

AW613780 Hs.13500 ESTs 3,5 

AW072316 zinc finger protein 131 (clone pHZ-10) 3.5 

AWB92676 gb:CM3-NN0HM-28TJ30O.131-c12 NN0004 Homo 3.5 

AAO04681 Hs.59432 ESTs as 

NMJH3642 Hs.13340 histone acetylfransferase 1 3.5 

AA013051 Hs.91417 topoisornerase (ON A) II binding protein 3.5 

AW015242 Hs.99488 ESTs, Weakly similar to YKS4_YEA6T HYPOT 33 

APO00692 Hs.129781 chromosome 21 open reading frame 5 3.5 

AJ969251 Hs.115325 RAB7, member RAS oncogene famly-like 1 3,5 

AW974874 Hs.75212 omrmme decarboxylase 1 as 

BE613337 Hs.234896 gemkiin 3.4 

AL080208 Hs.306325 Homo sapiens mRNA;cDNA DKFZp586Cl523 (f 3.4 

A1215632 Hs.147487 ESTs 3l4 

AA308334 Hs.274424 N-acetylneuramlnlc acid phosphate synlha 3.4 

AA6574S4 gb:rrt66f04.s1 NCLCGAPJ^ Homo sapiens 3A 

NM_01 5902':Kamo sapiens progestin Induce 3.4 

AA227609 Hs.94834 EST* 3.4 

AA400709 Hs.96716 ESTs, Weakly similar to T1 7210 hypolheB 3.4 

AJ269529 Hs.301&71 solute carrier family 37 (grycerol-3~pho 3,4 

AAS22539 Hs.131250 transposon-derived Bustar3 transposase-l 3.4 

AA08B767 Hs.83883 transmembrane, prostate androgen induced 34 

ENSP00000251 OSFiPlasma membrane calcium 3.4 

T97307 gta:ye53hD5.s1 Soares fatal liver spleen 3.4 

AI665662 Hs.197698 ESTs 3.4 

D90041 Hs.155956 N-acetyKransterasa 1 [aryfamlne W-acety 34 

AA73f533 Hs^7075l ESTs 3.4 

AA749316 Hs.271879 ESTs, Moderately similar to AUUI.HUMAN A 3.4 

BE37975B Hs.1 10853 ^characterized hematopoietic stsnVproge 3.4 

AT732722 Hs,187694 ERGL protein; ERGIC-53-like protein a4 

AA179233 Hs.42390 nasopharyngeal carcinoma susceptibility 3,4 

AA331 153 gb:EST35034 Embryo, 6 week, subtracted ( 3,4 

AA6B9526 Hs.344249 steroid dehydrogenase homobg a4 

AJ0Q1443 Hs.195614 spfiDBigfaclor3b,6ubunU3, 130kD a4 

BE15902B Hs.279704 chromatin accessihiBty complex 1 3,4 

AW380525 Hs.343564 Ui bulfn -specific chapa-one a 3.4 

AW06S6S0 He.63428 ESTs, WeaWyslmSar to Z195_HUMAN ZINC 3l4 

AI148006 Hs,222120 ESTs 3.4 

AA034127 Hs.153437 signal transducing adaptor molecule (SH3 3.4 

ALO50102 Hs.227209 hypcthetfcal protein FU21 617 3.4 

AF193766 Hs,13B72 cytokine-ike protein C1 7 3.4 

AI056769 Hs.133512 ESTs 34 

AW015802 Hs.47023 ESTs 3.4 

AI365130 Hs.11307 ESTs. Weakly similar to T1 9326 hypothec 3.4 

BE243910 Hs,9082 nadeoponn p54 3.4 

AW298370 Hs.1S3714 comptemerit-clq tumor necrosis fackx-rej 3,4 

AW5OO105 Hs.23643 serine/threonine protein kinase MASK 3.4 

U 62027 Hs.155935 complement component 3a receptor 1 3.4 

NMJH4156 Hs.279938 HSPC067 protein 3.4 

AA320829 Hs.97266 protocadherin 18 a4 

W93170 Hs.284164 protein x 0004 &4 

AW9711B3 Ha.9603 DnaJ (Hsp40) liomolo0, subfamily Cmembe 3l4 

AW952412 HS.G5874 ESTs, WeaWy srmSar to A40348 Efev/Sex- 3l4 

AA8B40D0 Hs.8173 hypothetical protein FU10BO3 3l4 

BE327427 Hs.79953 ESTs £4 

AW96B3W HSJ6156 ESTs 3.4 

AW988504 Hs,123D73 CDC2-related pratefo kinase 7 3.4 

AA4062B3 Hs41167 ESTs 3A 

AI033140 Hs.124983 Homo sapiens mRNA; cDMA DKFZp564C142 (fr 3.4 

AW753676 H*39982 ESTs 34 

H29142 Hs.143032 ESTs.Weakty sTmilar to neuronal Ihread 3.4 

AA1O2570 Hs.70725 garnma-amtoobulyric acid (GABA) A recepto 3.3 

AA418841 obawOlalUISoafesJihHMPicSIHbmosapI 3.3 

AA907153 Hs.190080 ESTs 3,3 

A1BO7320 Hs.227630 REVsllencIng transcription factor 3.3 

AAD99B02 Hs.83883 transmembraiie, prostate androgen induced 3.3 

AB005142 Hs.94592 Motho a3 

AA5479S9 HS.115B3B ESTs a3 

NMJ005363 Hs.173497 Sec23 {S. cerevlsiae} horaetog B 3J3 

AW57B716 Hs.7644 H1 histonetamlry, member 2 3^ 

AW197887 HSL253353 ESTs ^3 

T97717 Hs.119563 ESTs 3.3 

AV64B170 Hs^8756 ESTs 3.3 

AW953805 Hs^1BB7 ESTs 3.3 

AUJ41199 Hs.14B1 htsUdine decarboxylase 3.3 

BE160198 gb:QV1-mD4?34)1(J2O0-059-hO3HT0413rlomo 3.3 

At064821 H6.31B635 ESTs. Kghly ehlllarto 1818357A EWS gen 3.3 

ENSP00000229263':HSPC213. 3.3 

T32982 Hs.102720 ESTs 3 3 

AB007969 Hs.301478 KIAA0500 protein 3,3 

N20517 Hs.194397 lepfln receptor 3.3 

D800Q4 Ks.75909 KEAA01 82 protein 33 

NNL005824 Hs.I55545 37 kDa leixftis-dch repeat {LRR) protein 3.3 
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4144D7 
420324 
423453 
444170 
445474 
450582 
451246 
434194 
444324 
427581 
429258 
400880 
445525 
451 968 
411190 
410660 
420493 
423740 
428523 
42B743 
432363 
435102 
438147 
44580B 
453058 
453286 
453802 
422017 
411373 
435114 
443674 
427528 



421654 
409542 
410119 
417379 
423201 
442787 
407930 
423427 
424903 
426775 
432378 
43B875 



453124 
417560 
433002 
430929 
437444 
446207 
407870 
429377 
426030 
420583 
4252G4 
442562 
418871 
421029 
452997 
435035 
408412 
421684 
421687 
432409 
434378 
409401 
420195 
41Q264 
410296 
41383B 
414342 
414709 
425475 
437814 
449897 
452295 
437517 
432960 
452499 
427299 



AA1 47025 

AR63474 

AW460737 

AW613879 

AI240014 

AI339732 

AW189232 

AF119847 

AI30133O 

NMJH4788 

M44B765 

BE149B66 

H56196 

AA305342 

AI06111B 

AI635113 

Y07701 

AW974540 

AL08OO6O 

AA534489 

AWB99053 

AW250553 

AV655234 

AWG12293 

AA034319 

AL134767 

NM_G03B77 

8E32B276 

AA775483 

AIOB1330 

AU077143 

AI034361 

A1B36310 

AW1632B7 

AA5O302O 

F07841 

AA 1963 DO 

NM.000163 

W93048 

AA045847 

AL137612 

T26477 

AA384564 

AJ493046 

AA827640 

AI933416 

AI139058 

U73338 

AFO46730 

AA489166 

H460O8 

A1346930 

AB032990 

AA813192 

BE243933 

H77859 

AA353953 

BE379584 

NM_ooieoa 

AW0577B2 

N64777 

BE568487 

AW193Q33 

BE281591 

AW1S1450 

AA80653S 

AA631739 

A12Q1895 

N44348 

AKO01853 

AW630675 

AV6S1185 

AA742181 

M704703 

W56339 

AJ0B8192 

AW819642 

BE379936 

AI927675 

AW150945 

AK000101 

AA830210 



Hs.76704 
HSS6744 
Hs.128791 
Hs.1 02408 
Hs.259558 

HsJ9140 
Hs.2B3940 
K6,143638 
Hs.1797113 



Hs,14831 

Hs.26409 

Hs.69171 

Hs.65328 

Hs.270366 

Hs.293007 

Hs.98626 

Hs.301549 

Hs.76917 



Hs.288684 
Hs.29041 

Hs.1 10776 

Hs.8B61 

HsJ2B8936 

Hs.t450Q8 

Hs.179565 

Hs.135150 

Hs.28310 

Hs.106469 

Hs.36563 

Hs. 13926 

Hs.125180 
Hs.250723 
Hs.168361 
H5.2B5B48 
H8-22BB3 

H&.146133 

HtU8S059 

Hs.189674 

Hs.125790 

Hs.82283 

Hs579909 

Hs.156933 

H8.31518 

Ha.149728 

Hs.40719 

Hs,2Q05S8 

Hs.108642 

Hs.65450 

Hs-20369 

Hs.1209 

Hs,293D53 

Hs.44656 

HS/4766B 

lfc.124436 

Hs.105768 

Ks. 109201 

Hs.130732 

Hs.335440 

Hs.161309 

Hs.26243 

Hb.615Q8 

Hb.271946 

Hs.76574 

H&.7S912 

Hs.77031 

Hs.1 07057 

Hs.135474 

Hs.24135 

Hs.28866 

Hs.244482 

Ks.144758 

Hs.29700 

Hs.214283 



ESTs 3.3 

Pastels androgarwegulated transcript 1 3.3 

CGI-09 protein 3.3 

ESTs 3.3 

ESTs 3.3 

G-rich RMA sequence binding factor 1 3.3 

cutaneous T-cell lymphoma tumor antigen 3.3 

Homo sapiens PRO1550 mRNA, partial cda 3.3 

ESTs 3.3 

KJAA01 29 gene product 3.3 

gbzx10e09.r1 SoaresJotelJeius _Nb2HF8_ 3.3 

N^_000611*:Homo sapiens CD59 antigen p18 3.3 

Homo sapiens, SSmflar to zinc finger pro 3.3 

Homo sapiens mRNA; cDWA DKFZp547K204 (fr 3.3 

protein kinase C-like 2 3.3 

Fanconl anemia, cornptemenlafeon group F 3.3 

ESTs, Weakly similar to 178885 serinertb 3.3 

amlnopeptldflsB puromyctn sensitive 3.3 

ESTs 3,3 

Homo sapiens mRNA; cDNA DKFZp564H172 (fr 3.3 

gb:m76g11.s1 NCLCGAP„Co3 Homo sapiens 3.3 

F-bcjiontyproleto8 3,3 

H^KbHidmgfactor-2 3.3 

ESTs, Moderate* similar to PC4259 ferri 3.3 

Homo sapiens cONA FLI11750 fa, done HE 3.3 

Homo sapiens cOMA FLJ14177 Us, clone NT 3,3 

gb:DKFZp547GlOS0_n 547 (synonym: hfbrl) 3.3 

STAT iiduced STAT fahibifor-2 3.3 

ESTs 3.3 

mitochondrial ribosoma) protein L9 3^3 

ESTs 3,3 

minichromDsome maintenance deficient (S. 3.3 

lung type-l eel mmnbtane^ssoclated gty 3-3 

ESTs 3.3 

suppressor of varl (S^ereVia'aB) 3-lika 13 

hypothetical protein FU22418 3.3 

ESTs 3.3 

gb:zp99b10.s 1 StratagenB muscle 937209 H 3.3 

growth hormone receptor 3.3 

hypothetical protein MGC2747 33 

Homo sapiens cONA FU12B07 fis, clone NT 35 

KIAA1454 protein 3.3 

ESTs r Weakly similar to I38022 hypolhsti 3.3 

ESTs 3.3 

ESTs 3.3 

ESTs 3.3 

ESTs 3.3 

levclne-rlch repeat-conlaining 2 3.3 

5-rci BthyUeiraftyd rofo5ai£-h 0 roocysta na me 3\2 

cyclmTt 3.2 

ESTs 3.2 

ESTs 3.2 

ESTs 3.2 

hypothetical protein WAA1164 3.2 

KIAA0547 gene product 3.2 

zho linger prolan 22 {KOX 15) 3.2 

roScutorM 32 

ESTs, Weakly stnllar to gonadotropin ind 3.2 

dollcnyknpliospr^igQea^ark^iJrotel 3,2 

acy^Coenzyme A dehydrogenase, long chaJ 32 

ESTs 3.2 

ESTs 3,2 

x 006 protein 12 

ESTs a2 

hypoRiefica! protein FU1051 1 3.2 

CGJ-B6 protein 3.2 

KIAA1 575 protein 3.2 

EST 3.2 

proteasoma (presama, macropaln) subunit, 3.2 

Homo sapiens cONA FU11177 fls, done PL 3.2 

Homo sapiens cONA FLM0991 lis, dona PL 3.2 

ESTs 3.2 

mitochondrial ribosomal protein L19 3.2 

KIAAQ257 protein 3.2 

Sp2 transcription factor 3.2 

ESTs 3,2 

ESTs. Weakly simitar to DDX9JWMAN ATP-D 3,2 

transmembrane protein vezaBn; hypolhefi 3.2 

programmed call death 1 0 3*2 

Homo sapiens, clone IMAGE361 171ft mRNA, 3.2 

ESTs 3.2 

hypothetical protein FU200M 3.2 

ESTs. Moderately samlar to ALU1 J^UMAN A 3.2 
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A A 07 J 4 *YT 

AAy/4127 


Hs.129777 


ESTs 


3.2 


MC7AOQ 


alRcnmc 


Lin 11?<70 


cryptic gerie 


3.2 


415691 


AWbtwy/y 




bbiS 




433B52 




Ha. l<OP.ciJ 


ESTs 


"K 0 


439735 




Ur U^DiC 

ns.i*i/iwo 


hypothelicaJ pre tain 


•4 "5 
j7 




AA/C-CTIft 
AA^CM iU 




CO*fj> lAArwJUI.i nimiku In. A JI7£01 Q . P n« 

bb is, weawysimitef 10 A4robV tl-ceB gr 


■> 1 


427715 


BF245374 


no. low-to 


KIAA11R1 nmfoln. 
rvAM \ 10 | proi«f> 


o,i 


408699 


AAl/uDO I'* 




CQTn Mnrfapat aIu eVrJIor In. TMQ1 U'l I LA A M 7 

tb is, MoaerBuH/ similar io iNy i w humaiN £ 




415715 


F 30364 




Cola 


o.Z 


428172 


mtf/ IGUf 




cois 


J./ 




RD rots 


nS. IZDZof 


arc iniger protein ZNr 140-iiKe protein 






mwouy&o 


lit. 1 ^ ryj G 1 


ESTs 


3.2 


1 


AinnTticQ 


nS./yUoo 


ESTs 


3^ 




AVA/OTI'OTk 




CDTV. 

bblS 


3.2 


ATM *>R 






Homo sapiens cUNA FLJ12300 fis, done MA 


3.2 




NM_UU03f 7 


ris«i7lyiij 


sema domain, immunoglobulin domain (ig), 


3^ 








Target Exon 


3.2 


434747 


rVujOl UtKJ 




to IS 


3.2 


414591 


AlflRftJOn 




rpT. IflnnnJInlif olmllw In. A> 1 IO Ul Ik JA kl Al 1 1 O 

fcb 1 S. VlwaKiy Similar i0 ALUoJiUMAN ALU b 


3.2 


4/caJ!)1 


AKUUIaJb 




Homo sapiens cDNA FU 10674 fls, clone NT 


22 


426171 


All 2e 506 


Ha.6557 


zinc finger protein 161 


3.2 








NM_001397:Homo sapiens endolhelln oonver 


3.2 


4l£b3U 


AA7COCO 


Hs.2bB273 


hypothebca! protein FU 13346 


3,2 




AJU7U33U 


Li- ico4>t(i 


HOfTlO Sapiens CUNA: rUZIDDOira, ooiifl c 


3.2 


439221 


AA7^71flfi 


He MOOA 


CCTrv nlnrlaralnV. flmlLir In. I7QAQ£ nrvrln 

Lbi s, Mooficaiaiy similar to if dooo senn 


3.2 




MI UJ JjJJ 


Ur T17& 


KAiJi i;b, pomDej nomaog 


34? 


4*KMiO 


1199/ * 




Homo sapiens done 24407 mRNA sequence 


3.2 


44656B 


AA149121 


Hs.71947 


ESTs 


3.2 


420131 


FU32d6 


Hs.95262 


nuclear factor related to kappa B bindtn 


3.1 


4907(10 




nS.d4r4Ua 


nypotnaflcal protein MGC1 3102 


3.1 


<tUralJ 




KS.4lM» 


_L_n_l._m^ - JJ „T. , t 4 . OA 

phosphatide aria phosphatase type 2A 


3.1 


415009 


C75253 


H5.22D95D 


ESTs 


3.1 


AnfiR?7 
4U00jii 




ns.lbJ/cU 


ESTs 


3.1 


438656 


aua*i4 4/ fti 

AW014493 


HS.12S727 


ESTa 


3.1 


419675 


AAHS341U 


Hs.93557 


proenHephaTin 


3.1 


'KXnDU 






solute carrier family 30 {zinc transport 


3.1 


418701 


nAO 1H3'KJ 


tic: Qrl^n 


CCT» WanUu .Imllv In &| ) l/H LN IMAM llll 

bo is, weaxjy similar baluu^uman us 


3.1 


4iJ!W/1 


kill fH«\Of> 


HS. 1 54424 


datodinase. fodothyronfne, type 11 


3.1 




raja/nu 


his. a toy 


Homo sapiens, ScnSar to RIKEN cONA 5830 


11 




AA-<VIQ%43 
AAOOa^yfO 


VXtr 104 4C 

na.zillQ 


J-ipi/-l* 4 #__J| JJ.J-.1-__, _«jnln. H A rt n-n»njji 

tjUOi4 [cell arts ion cycle 14, S. cereio 


3.1 




rtAW//ID 


rlS.17oDQ4 


ESTs 


3.1 


422662 


BEZ74778 


Hs.1 19007 


RAB4, member RAS oncogene family 


3.1 


449543 


ArU 70632 


HsJ23729 


Homo sapiens done 24405 mRNA sequence 


3.1 


437083 


AWUB2597 


k_l_ O X jfDjTM> 

H 3.244 B62 


ESTs 


3.1 


4*vtoqa 


Al5"4R77R 
nlOJDnlcD 




•guanine nudeolkte binding prolan 4 


3,1 


41 fJBU 




nS.JAflJoo 


ESTs 


3,1 


419865 


H16382 


Ha .7025 8 


ESTs 


3.1 


4300(39 


BE253Q12 


Hs. 153400 


1 mi— ill ja i tf h it i IA i it a fill \ i ^ 

ESTs, Weakly similar to ALU1_HUMAN Alii a 


3L1 


UU1GO 


At flAtyiJO 
AUU4?44< 


Lie 0^jl70il 


U__-. ,__»__ _r>|j . _•-«_» a r^isw—*— rn imm /*_ 

namo sapiens mRNA; cCNA OKrZp5546Uo3 rjr 


11 


440140 


Al^JiTcao 
ALJtJ/O JO 


U- 4j|DA7D 


ESTs 


11 


426547 


AAZ43464 


HSJ234101 


pie-6-cell leutoemla transcripfion factor 


3.1 


418555 


AM-17215 


HSj87159 


hjpotheEcal protein FU 12577 


3.1 


453496 


AA442103 


H&330B4 


solute carrier family 2 [fad Rated gin 


3.1 


4^0 47 




rlS.l 03941 


VIA A-|ajq 

NAA1U40 protein 


3.1 


43103 6 


aiaHoiaqq 
AYV19ZQad 


H8.290855 


ESTs 


3,1 


4477-iD 


ALldTBJo 


rt3.I93G9 


matrllln2 


3.1 


415068 


Z19448 


Ha. 131887 


ESTs, Weaidy similar to T24396 hypotheb 


3,1 


4W4J3 


A lb/ /43a 


Llr> QOftTfi 


guanine nuaeooaa btnding protein 4 


11 


44J301 


AILKXiUiO 


L|_ 4 J (Kioc 


ESTs 


3.1 




AW IO/D.X} 


nS.n^loa 


nypoiRfigcal protam rU 13 749 


3.1 


4143 7jC 


AA14do54 




gb3o65a02.r1 Slratagene pancreas {93720 


3.1 


4Z/4Z1 


AA4lK4i4 




ooatomer protein comptex, submit beta 


3.1 






JJ. 4CTT7 


ESTs 


3.1 






Up 

rtS.lTOi4 


tolS 


11 


4*»/r 


AVVU07UB0 


Uu nn jinn 

nSr284192 


ESTs 


3,1 


433730 




Hs.3542 


hypothetical protein FU1 1273 


3.1 


436894 


HB0696 


Hs 2 3331 3 


ESTs 


11 


437756 


AA767537 


Hs. 197096 


ESTs 


11 


4doff79 


AW976218 


Hs-32565 


ESTs 


3.1 


441791 


AW372449 


Hs.61271 


hypothetical protein FU21 159 


3.1 


457716 


F18572 


Uc 90Q7A 


CPT- UlbinLnj crvnT^r In Al 1 IA tit 1- J kU Al 1 1 O 

to is, weawy sKnaar in ALU4J1UMAN ALU o 


1 4 

3.1 


452017 


AF1C9302 


Hs.27495 


prostate cancer assccoted protein 7 


11 


419030 


T79957 


Hs. 188456 


ESTs 


11 


430308 


AI343469 


KS.1276B5 


KIAA1527 protein 


11 


433571 


AA7B525S 


Hs.135191 


ESTs, WeaWy similar to unnamed protein 


- 11 


421645 


AF189723 


Hs.10677B 


ATPBse. Ca transporting, type 2C. member 


11 


428B7B 


AA436884 


Hs.48926 


ESTs 


11 


432205 


A16065B3 


Hs.125291 


ESTs 


3.1 


440093 


A1080053 


Hs£909 


DKFZP564G202 protein 


3.1 


428259 


AA424793 


Hs^4144 


ESTs 


3.1 


434614 


AJ249502 


Ha^96B9 


ESTs 


3.1 


420330 


AA640891 


rls.102406 


ESTs 


3.1 


433323 


AA805132 


Ms.159t42 


ESTs 


3.1 



704 



WO 03/042661 



PCT/US02/3681© 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



439544 


W26354 


HS.2B89I 


41 Aioo 


A|o«9496 


Hs.1 08932 




AF1 29755 


HS.B6474 


400684 






436354 


AJH7SZ52 


Hs5151 






Vls.2055 






Hs.64794 




N59650 


HsJ27252 




AAZ2S19B 






AKIOOlUO 


HS.25615D 




AA037675 


Hs.1 52675 




AA813587 


H&.18B173 


4D8705 


AAJ12135 


Hs.46967 


4ou2oO 


AA36125B 


Hs*23786B 


^ Jem < ck 


T53519 


H&334692 






Hs.1 3220a 




AA114016 


Hs.75746 




map mo rri7 

AW29B597 


Hs.61884 


42278ft 


AL1 17362 


Hs.77196 


439703 


AF06653a 


Hs.196245 


AAAiDti 

4444 B9 


An 51 010 


Hs.1 57774 


453876 


AW964440 


Hs.19025 


44U193 


AW902312 


Hs.7037 


4336 B0 


A1805366 


Hs.1 99945 


4<yjoo3 


M5B524 


H 3.2025 


447595 


AW37913C 


Hs.1 6953 


4jflDU44 


a Acnunn 
AAaUZ49u 


Hs.1 70290 


431962 


ALQ49385 


Hs.272251 


445084 


H36914 


Hs 25084a 


452737 


AKC01680 


Hs*3048B 


458229 


AI929602 


Hs.177 


435712 


AA694607 


H&.176956 


40737B 


AA299264 


Hs£7776 


414993 


AWB19403 


Hs.77724 


419469 


AW291128 


H&276422 


421986 


AIIMrnini 

AW4504B1 


Hs.1 61 333 


432833 


N51075 


Hs/110028 


445210 


H09323 


Hs.27133 


447620 


AW290951 


Hs.224965 


449375 


R07114 


Hs.271224 


426695 


AI355647 


Hs.1 69999 


443967 


AW294013 


Hs200942 


441224 


AU076964 


Hs.7753 


448822 


BE149B45 


Hs.289036 


435698 


H72286 


Hs.1 28387 


441889 


A109O455 


Hs.26B371 


411704 


AM99220 


Hs.71573 


415954 


AA171B50 


Hs.42251 


419544 


A1909154 




420077 


AW512260 


Hs.87767 


443475 


A1066470 


Hs.132609 


447231 


AK001293 


Hs.330206 


438436 


AA607168 


Hs.271552 


427671 


AW9924D5 


Hs.59622 


443280 


AA29S6B8 


Hs.24183 


448264- 


A1478933 


Hs.188260 


42QS73 


AW601325 


Hs,337757 


453543 


D25215 


Hs.35804 




AA242758 


Hs.79136 


445943 


AW89B533 


Hs.181574 


432200 


AW327432 


HS.255B43 


431707 


R21326 


Hs.267905 


432675 


AI791855 


Hs.105B84 








44B073 


W19769 


Hs.336635 


419713 


AW96B05B 


H8.923B1 


447966 


AA340605 


Hs. 105667 


423553 


AL122D55 


HS.129B36 


418304 


AA215702 




417206 


AA291183 


Hs,81648 


410011 


AB02OG41 


Hs.57656 


411B50 


AK002033 


Hs.727B2 


436966 


AF085888 


H 5 , 269307 


445921 


AW015211 


Hs.146161 


447124 


AW976438 


Hs.17428 


452953 


AI932B84 


Hs.271741 



hypothetical protein FUI1360; arternis p 

ESTs 

ESTs 

NM_0Q2425:Komo sapiens matrix mmalopro 
RAN binding protein 7 
uT^utUn-aclwab'ng enzyme El {A1S9Tan 
zinc fiiger prolan 183 (RING ffnger, C3 
ESTs 

g^nc26a07.s1 NCLCGAPLfrl Homosapfens 
Kfemo sapiens, Similar to R1KEN cONA 2610 
ESTs 

Homo 3Bptens cDNA FU121B7 fe, done MA 

HSPC034 protein 

in(erteukJn 7 recaptor 

hypothelteaf protein MGC14141 

ESTs 

aldehyde dehydrogenase 1 family, membar 

Homo sapiens, clone lMAe&4296026, mRNA, 

Human DNA sequence Irom done RP5-876B10 

ESTs 

ESTs 

DC32 

Homo sapiens done 24923 mRNA sequence 
ESTs 

transforming growth factor, beta 3 
phosphodiesterase 9A 
ESTs 

Homosapians mRNA; cDNA DKF2p5&6Ml418 (f 
hypothetical protein Flj1476t 
DKFZP434F091 protein 
phospheedylnosltot glycan, doss H 
ESTs 

ESTs. Moderaidy similar to 130D22 hypo! 

KJAA05B6 gene product 

DKF2P5B6G1122proleln 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

purinergic receptor (faniBy A group 5J 
ESTs 



hypothetical protein MGC4126 
ESTs 

hypothetical protein FU 20274 
hypothelicaT protein FLJ10074 
ESTs 

gkQE/-BT2O(M>1O499-O07 BT200 Homo sapten 

ESTs 

ESTs 

crystelSn. zeta (qulnone ledudasej-lflt 
ESTs 

Homo sapiens, done IMAGED 507281, mRNA, 

ESTs 

ESTs 

Homo sapiens mRNA; cDtM DKFZp566M063 (fr 

hect domain and RID 3 

UV-1 protein, estrogen regulated 

EGTs 

ESTs 

hypothetical protein FU 10422 
ESTs 

DKFZP434G032 protein 
Homo sapiens, done IMAGB41 79482, mRNA 
nudix (nucleoside diphosphate linked mot 
ESTs, WsaWy similar to Homolog of rat Z 
K1AA102B piotein 

gbxr97g10.rl NCLCGAP_GC8t Homo sapiens 

hypoiheffcaJ protein FU11021slmflarto 

PFTAIHE protein kinase 1 

hypothetical protein RJ11171 

ESTs 

ESTs 

RBPl-Bte protean 

ESTs, Wealdy similar to A46010 X4inked 



3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
11 
3.1 
10 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 

ao 

3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 

ao 

3.0 
3.0 
3.0 
3.0 

ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
3.0 



80 



TABLE 60B 



Pkey: 



Unique Eos probeset Identifier number 
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CAT number GeriB cluster number 
Accession: Genbank accession numbers 



Pkey 

411479 

411667 
414372 
414680 

415989 
416286 
4168B2 

416913 
417379 
41B304 
418647 
419868 
419536 
419544 
423412 
423800 
426226 
426413 
426503 
426775 

426991 
428342 

429163 
429220 
429258 
430935 
43096B 
431429 



432093 
432125 
432169 
432340 
432363 
432600 
43261D 

434579 
434966 
435023 
437666 



438147 
439092 
439518 
439904 
440B40 

441102 
442562 



443161 
445808 
447082 
448212 
449625 



450582 
452260 



452815 
453802 
455100 



CAT Number 

1247077.1 

1253334J 
143909.1 
147525J 

1 56454.1 

1585983J 

162718J 

163001.1 
167238J 
173658.2 
177521.1 
179788J 
1&56S5J 
185760.2 
228001.1 
23216U 
262918.1 
266650.1 
2682B3_1 
271GB3J 

27415.1 
290035.2 

300543J 
301384.1 
301917J 
325772.1 
326269.1 
33313.1 



341283.1 
341776.1 
342819.1 
345248.1 
345469J 
350959.1 
354375J 

38910.1 
396604.1 
39B093J 
44433.2 



45074.1 



,1 

47334 1 
479942.1 
50357.2 

509604.1 
54500.2 



561305.1 
65133 1 
707248.1 
7S5099J 
8113.1 



450560 83929 J 



83933J 
9074.1 



92338 _2 



9 3255 J 

981589.1 

1253334.1 



Accessions 

AWB48047 AW848202 AW848631 AW8481 42 AWB48702 AW848121 AW848632 AW6481 40 AW848571 AWB48009 AW848067 AW848069 
AW848905AW848214 

BB6019BAW935898T11520AW935930AW656073AWB61034 
AAW3554 AW7S3140 AA213770 AWS70B65 AA5S9075 M492132 

AA743331 AA837386 AWB64540 AA775711 AAI50965BE465475AA968994AA886905 AH 41054 AW194991 AA2521 47 AA31 9500 All 842B3 
AA708749 AA644620 AAB52769 AA242975 AA151D74T19B90 

A126770Q AI720344 AA191424 A1Q23543 A1469633 AA 172056 AW958465 AA172236 AW953397 AA355086 
H51299 H44619 H46391 RBC024 H51892 T72744 

AI633044 AW016212 AW241143 AA769058 R43272 AW06895B AA21091B AA293774 AI748815 AI763294 A13331 14 AI277384 A10B8297 AI46&477 

AI824624 AW169606 AJ631751 240749 A1984673 A1671 316 M189D24 AW23541 2 BE178426 R24677 R4O635H05K30 R40597 

AW934714 BE161007 BE1625Q0 AW749302 AW749B64 BE162498 BE1G1005 AA190449 AW513465 BE161006 BE162499 

AA1S6390 AA507837 AA1 9646B 

AA215702 AA368006 AA215703 BE066555 BE006B76 

AA226198 AA226513 AA3B3773 

T65754 AA229857 AA2295S8 

AA603305 M244095 AA244183 

AI909154 AA526337 AA244193 A1909153 

AF109300 A1299378 AJ2Q2654 

AA331156 AA331157 AA331155 

AA769045 AA372590 AW953533 

AA377823 AW954494 AI022G8B 

AA380153 AA380233 AW963529 

AA384564 AW9EG475 H021 21 N41 297 06321 3 AABB688B AI92241 4 AWG44240 AW19SB08 AI076736 AAS99294 AI954433 AW1 17617 A1640323 
H98134 

AK001 536 Mi91092 AW51 0354 AI554256 AL353958 AA134266 

Ar739l6B AA42B249 AI193636 AW50519B AW977291 AAB245B3 AA8B3419 AA724079 AI015524 A1377728 AW293682 AI92B140 M73143B 

A1092404 At085630 AA731340 

AA884766 AW974271 AA592975 AA447312 

AW2072Q6 AW341 473 AA448195 AI951341 

AA448765 C04967 C03045 AA658293 

AW072916 AH 84913 AA4B91 95 AW466994 AW469044 N59350 AI819642 AI280239 A1220572 AA789302 AI47361 1 AW841126 D60937 
AW972830 AA527647 AA488B20 AA570382 

AF072813 AF119297 AA3628B5 AF059524 NM.006054 AA157365 AW163823 AA05514B AA227062 AA41fl0S7 AA227076 AA078753 AA233594 

D5B629 AA232373 AA233S77 T3S956 BE618D35 AA3S4497 AA359082T32O10 AA134519 BE2999Q1 BE26809S BE396826 AA324258 AL120308 

AA187561 AA3116B0 

H28383 AW972670 H28359 AA525B08 

AW972667 AA526539 AI057032 AW1 67B42 

AA527941 A16106O8 AI620190AA635266 

AA534222 AA632632 TB1234 

AA534459 AW970240 AW970323 

AI8210B5 AW973464AA554B02AI821831 AA657438 AA640756 AA650339 

AA863400 AI991439 AW016017 AW0147O4 AI367512 R17550 AA744752 R46187 AW471324 AI126B70 AA628033 AI276287 A1094253 AI285003 
AH47163 A1911443 AW512612 AA972102 AA999975 AI684428 AI335035 D63102 AI524234 AI539156 AA565542 
T55958 T57205AF147346 
AA6S7494 A1582663 AI581 639 

AI692552 AI393343 A1800510 AJ37771 1 F24263 AA661876 

M156781 AW293839 U52W4 AA024953M776445 BE073977 AW4449Q4 AW602574 BE164040 BE164012 BE163972 BE163974 BE163992 

AAB374B1 AW468444 BE1B5091 AW46BGG2 AA6B7333AABlia30 AASStfiCS A1866686 A1572124AA043777 AA040926 D2O160 AJ536733 

AA812469AW874142AJ47166 

AW2S0553 L07876 Z36B43 R30B93 A1190097 AW96S317 

AA830149 AW978407 M85983 AW503637 

W76326AF08B341 W72300 

AWB92676 AAB53B77 D44747 

AW629666 AW959831 AW205739 BE620243 AA412367 AW300025 AW051920 AI288591 AW2361 14 Al 302852 A1038548 AA534496 A17972Q7 
AA921877 

AA973S0S A1299SB8 AA917D19 R6323S T90771 

BE379584 R3421 1 BE544768 AW973709 AI653056 AJ653173 A1286043 A1656750 H741 80 A1492830 A1376090 AI4721B4 D59940 AW170056 

A10B2443 AW021 142 A1167921 AI348677 AE78577 AWJ 30886 AA76161 7 AJS98203 M1 15535 Ai 264790 R34328 D69939 AW205O74 AA554902 

D62102AJ0007 

AW8B316 A1344631 AI261653 

AV655234 AW966332 AA340239 

TB5314 A1360684 T85S2BT91254 

AI475858AW969013 

NM-014253 AF1D0772 BED8B769 AL02271 8 BE161779 AW863569 BE161 640 AL039060 BE168542 AW2955S4 AA323193 AA235370 AW779760 
N48674 A1375997 R45432 D59344 AI203107 F07491 R35360 R25094 A1913631 AI498402 T61382 AI016320 M45528T61415 AA331486 
N40087 H12925 AA460779 AAQ96372BE1E0847 AWB16B90 AWB16B93 AVV816577 AW8J6891 AW816882 AWB16B6B AW8168B9 AW81694Q 
AW61 6892 AW816941 AW B1 6578 AA0291B3T83320 HB6850 AAC 10295 BE160823 Z25353 AA150883 AW8 67764 AWO23806 AWO22096 
AA164518 M730973 W0M1 7 W65303 

AI339732AA010300 AW515041 AA768334 N29860 AA425B74AA425118 AA8S5829 AW936B78 

AA453208 NM004251 U44103 AK71547 N57463 AW0D7521 AK79232TB2809 R67109 AW796099 AI833027 AT7653S5 AI763029 AI758228 
BE326331 AI934576 AI922378 AW276431 AI718466 N3656S AA904753 BE464245 AI338752 A1659875 AW272338 AI423136 A1069270 AI160904 
AA664354 BE1 

A1B31594 AW970667 AW027959 AH29B00 A1927949 A1650270 A1S2510S AW514661 AI70B393 AL13B076 BE180S10 AI92B721 Ai 399955 AA7 491 39 
A1B62160 AW87401 1 A1242763 AA262795 AA039864 H73499 AI093249 BE245661 AI816834 N25206 AA8283D1 AI084565 A1302816 AA026905 
AA77255 

AA418841 AI452657 A1768B76 AA023973 BE179B73C00215 AA41BS30 
AU 34757 AW079131 

BE16019B AW935898Tt1520AW93593OAWB56073 AW861034 

706 
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457728 



AW9748U AA651634 AA650072 



TABLE GOC 



Picey. Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham 1. et al." refers to the publication entitled The DNA sequence of 

human chromosome 22." Dunham 1. et al (> Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which axons ware predicted. 

WLposlfoii: Indicates nucleotide positions of praf cted Batons. 



Pkay 


Ref 


Strand 


Nt_position 


400664 


B11B49S 


Plus 


13558-13721,13942-14090,14554-14679 


400695 


7249150 


Plus 


160456-160567,164757.164673 


400880 


9931121 


Plus 


29235-29336,36363-36580 


401197 


9719705 


Plus 


176341-176452 


401424 


B176894 


Rus 


24223-24428 


401451 


8634068 


Minus 


119926-121272 


401519 


6649315 


Plus 


157315-157950 


401558 


7139678 


Plus 


103610-104090 


402031 


7656761 


Plus 


33080-33263,33939-34094,36103-36507 


402002 


3287156 


Minus 


53242-53432 


402655 


9682953 


Minus 


59763-59909 


403046 


3540153 


Minus 


55707-55859,56369-66511 


403047 


3540153 


Minus 


5979^59938 


403790 


8084957 


Minus 


67826*7947,89835-90002 
112450-112648 


404571 


7249169 


Minus 


404632 


9796666 


Plus 


4509M5229 


404641 


97S6B10 


fvinus 


32247-32362 


404642 


9796810 


Plus 


102939-103145 


405523 


9454643 


Plus 


114550-114688,117265-117407,119490-^9599,123237-123395,131 140-131217 


405558 


1621110 


Plus 


450*4644,5983-6083 



Table 61A Bats about 440 genes up-regulated In prostate cancer compared b normal prostate and normal adult tissues. These were selected from 59680 probesets on foe 
AfiymetrK/tos Hu03 GeneChip array such that the ratio of 'average* prostate cancer to "average" normal prostate tissue was greater than or equal to 2.0 and the rafio to normal adult 
tissues was greater than or equal to 2.0, The 'average" prostate cancer level was set to the 75* percentile aniDngst prostate cancere. The "averager normal prostate tissue level was 
set to the 95* percenlBe amongst prostate tissues. The "average' normal adult tissue level was set to the 85* percenfle amongst non-malignant tissues. In order to remove' gene- 
specffic background levels of non-specific hybrldteaUon, the 10* percentile value amongst the non- malignant tissues was subtracted from both the numerator and the denammator 
befcre the ratio was evduated. 

TABLE 61A: ABOUT 440 GENES UP-REGULATED IM PROSTATE CANCER COMPARED TO NORMAL PROSTATE ANO NORMAL AO ULT TISSUES 



Pkey: Unique Eos probesel identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

UnigenelD: Umgena number 

Unlpeoe TrltellnigenBgena title 

R1: Ratio of prostate bunorto normal arM body feaas 

R2: Rafio of prostate lumor to rennalprostelje tissue 



Pkey 


ExAccn 


UncgenelD 


UnrgeneTrtjB 


R1 


R2 


444151 


AW972917 


*te,128749 


alpha-melhy lacvkCoA racamase 


42.2 


42.2 


421566 


NMJM0399 


Hs.1395 


early growth response 2 IKrax-ZJ (Droscp 


3.2 


26.5 


420729 


AW954B97 


Hs.290825 


ESTs 


3.7 


15.8 


401197 






ENSP0D0Q0229263VHSPC213. 


3.0 


126 


450096 


A16820B8 


Hs.79375 


holocarboxylase synthetase (biotrn-[prt>p " 


124 


12.4 


4491S6 


API 03907 


HsJ71353 


prostate cancer antigen 3, non-coding DD 


123 


123 


443271 


BE5BB568 


Hs.195704 


ESTs 


11.6 


11.6 


434076 


AW880709 


Ks.283683 


chromosome 8 open reading frame 4 


21 


11.2 


431117 


AF003S22 


Ks.250500 


delta prosophaaHlfce 1 


9.4 


&4 


417315 


AIQ80042 


Hs.180450 


ribnsomat protein S24 


Z3 


9.D 


416182 


NMJXM354 


Hs.79069 


cydnG2 


8.4 


8.4 


421913 


Ar334355 


Hs.109439 


osteogtyrin (osteolnducb've factor, mime 


2.2 


8.4 


434217 


AW014795 


Ha53349 


ESTs 


8.3 


8.3 


425782 


U6846B 


Hs.159525 


call growth regulatory with EF-hand doma 


8.3 


83 


442501 


AA316267 


Hs.23128 


ESTs 


2.0 


B.3 


429384 


AL050102 


Hs.227209 


hypothetical pralem FU21517 


5.1 


8.1 


449048 


Z4S051 


HS.22H2D 


sMarto S68401 (cattle) glucose Indue 


3.1 


8.0 


417407 


AA923278 


Hs.290905 


ESTs, VfeeWy slmUarto protease [H.sapi 


34.0 


7.9 


420120 


AL049610 


Hs.95243 


transcription elongabon factor A (Sll)- 


7.5 


7.5 


419078 


M93119 


Hs.89584 


Insullncma-assodated 1 


7.4 


7.4 


425905 


AB032959 


HS.3185B4 


novel C3HC4 type Zinc finger {ring fings 
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2.6 


2.6 


422728 


AW937826 


Hs.103262 


ESTs, Weakly similar to ZN91J1UMAN 3NC 


9.0 


26 


453070 


AKC01465 


Hs31575 


SEC63, endoplasmic reticulum transbcon 


22 


26 


400627 


T64904 


Hs.163760 


ESTs 


&7 


26 


407151 


H2583G 


Hs.301527 


ESTs, Moderately similar to unknown pis 


24 


2.6 


425815 


R94023 


Hs.945B0 


ESTs, Moderately similar to 138022 hypot 


26 


2.6 


433805 


M706010 


Hs/1 12742 


ESTs 


26 


26 


431474 


AU 33990 


Hs.190642 


ESTs 


9.3 


25 


436204 


AK002014 


Hs.47546 


Homo sapa'ans cDNA FU1145B fis, cbne HE 


22 


25 


439328 


W07411 


Hs.1 18212 


ESTs, Moderately similar to AUU3_r1UMAN A 


13 


25 


449115 


AW959552 


Hs.37528 


ESTs. Weakly slmBar to AF090944 1 PRQOB 


23 


25 


437531 


A1400752 


Hs.1 12259 


T call receptor gamma locus 


26 


25 


445424 


AB028945 


Hs.12696 


coriactin SH3 domain-binding protein 


3,4 


25 


403047 






NMJ05655^Homo sapiens transmembrane pr 


21.1 


25 


4161SB 


H27332 


Hs.99898 


hypothetical prateri MGC5338 


25 


25 


417576 


AA339449 


Hs.822B5 


phosphoribosylglycinarnale formytlransfer 


25 


25 


430607 


AW973521 


Hs.247324 


mftochondrial ribosomal protein S14 


25 


25 


436401 


AI0B7958 


HsJZ9088 


ESTs 


25 


25 


450654 


AJ245587 


Hs.25275 


Kmppel-type zinc finger protein 


25 


25 


426216 


N77630 


Hs.13895 


Homo sapiens cDNA FUH654 fis. cbne HE 


3.0 


25 


431325 


AW026751 


Hs.5794 


ESTs, Weakly similar to 21092B0A a cell 


11-9 


2.5 


410102 


AW246508 


Hs.279727 


Homo sapiens cDNA FU 14035 lis. done HE 


25 


2.5 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



446791 


AJ632278 


Hs.1 95922 


ESTs 


2.5 


2.5 


418004 


U37519 


Hs.87539 


aldehyde dehydrogenase 3 family, member 


2.5 


25 


419791 


AI579909 


Hs.105104 


ESTs 


27 


2.5 


447210 


AF035269 


Hs.17752 


pnosphalidylserinB-specTIic phcsphoDpaa 


3.0 


2.5 


425294 


AF033827 


Hs.155553 


HNK-1 siifdransfersse 


21 


25 


452741 


BE392914 


HsJ0503 


Homo sapiens cDNA FU11344 fis. clone PL 


25 


25 


447702 


AI420256 


f At* A 1-J A 

Hs.161271 


ESTs 


Z1 


24 


412043 


BE156622 


Hs.333375 


Homo sapiens clone TA40 untranslated tjiRN 


24 


24 


421413 


AI82B128 


Hs.57637 


ESTs, Weakly similar to A49364 59 prolei 


2.4 


24 


429128 


AA446869 


Hs.1 19316 


ESTs 


2.0 


2.4 




AA44B765 




gbzxirje09.rl SoaiesJctatJetus_Nb2HF8_ 


2.4 


24 


435102 


AWB99053 


M5.76917 


r-box only protein 5 


24 


2/4 


456161 


BE264645 


Hs^82Q93 


hypothetical protein FU21916 


3.1 


2.4 


407021 


U52077 




gb:Humai\ marinerl transposase gene, comp 


7.3 


24 


437517 


AI927675 


Hs.244482 


Homo sapiens, ctone IMAGE:361 1719. mRMA. 


Z1 


24 


408267 


AW3B0525 


Hs .343564 


tubulin-specific chaperone e 


4.3 


24 


447713 


AI420733 


Hs.207063 


ESTs 


2,4 


24 


456848 


AL1 21057 


He.296406 


WAA0685gene product 


2,2 


2A 


437617 


AI02B701 


Hs.5716 


KSAA0310 gene product 


2.6 


2.4 


432378 


AM93046 


Hs.146133 


ESTs 


5.2 


2.4 


443180 


R15675 


Hs.258576 


cteudln 12 


10.1 


24 


445929 


AIOB9660 


Ha323401 


dpy-30-!lke protein 


2.2 


2.4 


424701 


NM_005923 


Ks.151988 


rn'rtog en-acS vated protein kinase kinase 


23 


24 


445255 


NWJ)14841 


Hs.1 2477 


synaptosomat-essociated protein, 91 kDa 


25 


24 


409731 


AA 125985 


Hs.56145 


thymosin, beta, identified in neuroblast 


27.5 


24 


421709 


AA159394 


Hs. 107056 


CEO-6 protein 


2.3 


24 


412707 


AW206373 


Hs.1 6443 


Homo sapiens cDNA: FU21721 fls» clone C 


4.7 


24 


4188B2 


NM.004996 


Hs.89433 


ATP-hinding cassette, sub-family C (CFTR 


24 


24 


451071 


AW13B807 


Rs.204939 


ESTs, Kfghly similar to A54677 homeolic 


24 


24 


420344 


BE463721 


Hs.97101 


putative G protein-coupled receptor 


3l1 


2.3 


433037 


NM_014159 


Hs.27S938 


HSPC067 protein 


24 


23 


418329 


AW247430 


HS.841S2 


cystBth'onirie-beta-syiithase 


. 4.1 


23 


419544 


A1909154 




gb:aV-BT20O-O1O499-0O7 BT200 Komoseplen 


2.3 


23 


4Z4602 


AK002D55 


Hs.151045 


hypothetical protein FU11 193 


2.1 


23 


450693 


AW450461 


Hs^03965 


ESTs 


23 


23 


462397 


A1680772 


Hs.306094 


trinucleotide repeat containing 12 


2.3 


23 


41204$ 


AA099802 


HB.B3863 


transmembrane, prostate androgen Induced 


4.1 


23 


428180 


A11 29767 


Hs,182o74 


guanine nucleotide binding protean (G pr 


ZD 


23 


419438 


AA406400 


Hs.12482 


glyce ron b phosphate O-acyHransferase 


27 


23 


443194 


A1954958 


Hs.279009 


matrix Gta protein 


2,1 


23 


414869 


AA157291 


HsJ21479 


ublnudein 1 


25 


2.3 


407300 


AA1 0261 6 


RsJ 20769 


gbzn43s07.s1 Stratagena Hela cell s3 93 


23 


2.3 


436396 


A1683487 


Hs.1 5221 3 


w*Eigless-type MMTV integration site faml 


2.3 


2.3 


439221 


AA737105 


Hs.32250 


ESTs, Moderately slmBar to 178886 serin 


Z3 


23 


453602 


AL1 34757 




gb:DKFZp547G10901r1 547 (synonym; Wbr1) 


4.1 


23 


421059 


A1654133 


Hs.30212 


thyroid receptor interacting protein 15 


7.S 


23 


426413 


AA377823 




gb:EST90805 Synovial sarcoma Homo saplen 


22 


23 


431147 


AI767751 


Hs.20300 


ESTs 


S.8 


2.3 


442677 


AJ557914 




peroxisomal acyl-CoA thioastarsse 


25 


2.3 


446091 


AW022192 


Hs.200197 


ESTs 


4.0 


23 


431563 


AKG7643 


Hs.120912 


ESTs 


Z2 


23 


439478 


AF049460 


fls.6574 


deformed epidermal autoregulatory factor 
hepaifi (transmembrane protease, serine 1 
C2002829:gl|4507689)ref|NP_00329d.1| tra 


2.1 


23 


413049 
4056Q5 


NM_002151 


Ks.823 


3.5 
23 


23 
23 


407917 


U63139 


Hs.415B7 


RAD50 (S. caraviEiaa) homotag 


23 


2.3 


450378 


AW249181 


Hs.19354 


ESTs, Weakly similar to T19673 hypotheti 


23 


2.3 


429303 


AW137635 


Hs.44238 


ESTs, Weakly similar to S65657 alpha- 1 C- 


23 


23 


417248 


AA329449 


Hs.247302 


twisted gastrulatton 


23 


22 


429277 


AW452016 


Hs.127863 


ESTs, Weakly slmBar to SFR4JWMAN SPUC 


2.6 


22 


431555 


AI815470 


Ks^60024 


Odc42 effector protein 3 


4.5 


22 


440099 


AL080058 


Hs.6909 


DKFZP564G202 protein 


29 


22 


416653 


AA76B553 


Hs.193145 


metallothtoneln IE (runcBonat} 


ZA 


22 


431542 


HG3010 


Hb.5740 


ESTs 


7.0 


22 


401556 






EKSPQ000D220478*^ECRETOGRANlN 111. 


22 


22 


4182B1 


UG9550 


Hs.1 154 


ovfdudai glycoprotein t, 120kD (much 9 


22 


22 


422295 


AF0S1151 


HM 14408 


toMIke receptor 5 


2.2 


22 


432453 


AJ885537 


Hs.27172 


ESTs, Moderately smlbrto PC4259 fern 


22 


22 


447499 


AW262580 


Hs.1 47674 


pratocadheriii beta 16 


2.2 


2.2 


452994 


AW962597 


Hs.31305 


K1AA1547 protein 


4.0 


22 


450325 


AI935962 


Hs.26289 


ESTs 


11.9 


22 


422390 


AW450893 


Hs.1 21830 


ESTs, Weakly simitar to T426B2 hypotheti 


2.1 


22 


448789 


BE539108 


Hs.22051 


hypothetical protein MGC15548 


26 


22 


436679 


AJ127483 


Hs.128451 


ESTs, Weakly similar to unnamed proteb 


11.8 


22 


425263 


NM_001197 


Ha.155419 


8Cl2-)ntaradlng Wlfer {apoptosis-induc 


3.2 


2.2 


451939 


U80456 


Hs.27311 


sngJe-mbded (DiosopftHa) be mo tog 2 


13,6 


22 


410252 


AW821182 


Hs.61418 


mccrofibrllar-associBled protein 1 


4.2 


22 


439699 


AF06G534 


HS.1B7561 


ESTs. Moderately slmBar to ALU^HUMAN A 


4.8 


22 


422576 


BE548555 


Hfl.118554 


CGI-B3 protein 


21 


22 


433293 


AF007835 


Hs.32417 


hypothetical protein MGC4309 


11.0 


22 


410733 


D842B4 


Hs.66052 


CD33an5gan (p45) 


3.1 


22 


421379 


Y15221 


Ha.1039B2 


small Inducible cytokine subfamfly B (Cy 


21 


22 


442995 


AA532511 


Hs.288455 


Homo sapiens cONA: FU23270 lis, clone C 


22 


2.2 


408633 


AW612232 


Hs.254835 


ESTs 


3.0 


22 
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Hs.29663B 


prostate arneranDaoon J actor 


4,4 


22 


A17AA A 


A\J\tmcviJi 
ftWS/3J24 


Hs.129816 


ESTs 


11.7 


2.2 


446354 


AW449650 


Hs.346335 


ESTs 


2,6 


22 


4 JDOQd 


a Cfi*7X flAa 
AhUf oUuy 




gb:Hon» sapJene full length Insert cDMA 


5.9 


22 


425628 


NMJJU4476 


Hs.1915 


folate hydrolase (piostate-specffic tnemb 


12.8 


22 








mza lusiona Tanwy, memwr L 


1 D 
J.D 


22 


A A {HAG 






norno sap ions clhma. tljzzsju tvs, ciohb i\ 


A tl 


22 


ARARAT 






NM_02l9G5*:Honuo sapiens phosphoglucomuta 


2.2 


22 


Av/Zbi 


AA411175 


ii j ijrtrtf, 
Hs.141939 


ESTs. Moderate/ similar to S65657 alpha 


2.2 


22 


4UoUbB 






C2DD2uD4:gi|73O3957[Qb(AAF590o0.1i [AE00 


3.1 


2.1 


415162 




Hs.78051 


IraiEcripbon factar21 


2.1 


2.1 


423349 


AFD10259 


H3.1 27428 


homeobox A9 


5.9 


21 


424534 


Do7bo2 


Hg. 150275 


WAA0241 protein 


2,3 


21 


407756 


AA1 1G021 


his 38260 


ubtquldn specific protease 18 


2.5 


21 


440658 


H29142 


Hs, 143 032 


ESTs, Weakly similar to neuronal thread 


2.5 


21 


416547 


H62914 


H $.266946 


hols, Weakly similar to PC4259 ferritin 


21 


21 








nypothBuraJ protein FUJI 037 7 


Z1 


21 


44/EoJ 


MM /Vljf7QG 


HS.ib7H2 


ihioredoxtn-liVe, 32kD 


2.1 


21 


4483 bo 


AW372914 


Hs.86149 


phosphoEiosttol 3-phosphat&bindinj) prot 


2.1 


2.1 


449897 


AW619642 


Hs.24135 


transmembrane pnotdn vezatlti;hypolheH 


2.1 


21 


452959 


A19334T6 


HS.1B9574 


ESTb 


Z1 


21 


427089 




HS. 173497 


Sec23 (S.cerBvisiaeJ bomotog B 


28 


21 


408291 




Hsv44131 


NAAC974 proteh 


21 


21 


418727 


AA2276Q9 


H&.94834 


ESTs 


21 


21 


440491 


R35252 


HS.13055& 


ESTs, Weakly similar to 21D9260A B eel 


2.1 


21 


433466 


AA50B353 


Hs.105314 


TaaMn 1 (H1) 


17,3 


21 


4JZoyD 




nS.z7oo34 


i a xni lie ». 

KIAA0146 protein 


2.3 


2.1 


427982 


NM_0161S6 


Hs,181326 


KIAA1073 protein 


3.2 


2.1 


445573 


AA250970 


Hs. 251946 


pofy(A)-Wnding protein, cylopiasmtc 1-1 


25 


2.1 


434569 


AI311295 


Hs.344478 


Kl AA0196 gene product 


21 


21 


441791 


AW37Z449 


1 l_ »4 «VTJt 

Hs.61271 


hypothetical protein FU21159 


2t 


21 


438520 


AA706319 


Hs,9B416 


ESTs 


27 


21 


414269 


AA298489 


Hs.303171 


olfactory receptor, family 51, subfamily 


17.3 


21 


410660 


AI061118 


Ks.65328 


Fence ni anemia, camp tementa lion group F 


3.5 


21 


455660 


AA909249 


He. 112282 


solute carrier family 30 (zinc transport 


7.4 


21 


450861 


AI52389B 


Hs.17617 


ESTs 


24 


2.1 


419193 


029643 


HS.347B9 


oblchyl^rphosrrfKWigosaccnHrxJe-prtrla 


21 


21 


AVX7VA 






Homo sapiens cDHA FU 14913 fis t clone PL 


3.2 


21 


41620B 


AW291168 


Hs41295 


ESTs, WeaWy simlTar to MUC2_HUMAN MUCIN 


6.4 


21 


407182 


AA3T2551 


Hs. 230157 


ESTs 


21 


21 


421689 


N 87820 


Hs. 106826 


KIAA1696 protein 


21 


21 


432833 


W51075 


Hs,1 10028 


ESTs 


21 


2.1 


4251 70 


AU077315 


Hs.154970 


transcription factor CP2 


26 


20 


436278 


GE396290 


HsJSQ97 


synaptogyrin 2 


28 


20 


438719 


AA357129 


HS.239625 


Integral membrane protein 2B 


23 


20 


4*51/28 


NNLsn6979 


Hs.278721 


HLA class II region expressed gene KE4 


3.6 


20 


410910 


J02931 


Ha.62192 


coagulallon factor lit (thromboplastin, 


20 


20 


429716 


R25685 


Hs.211933 


collagen, type Xttl, alpha 1 


3.6 


20 


424081 


Nl^.006413 


Hs, 1391 20 


ribonuc!easeP(30kD) 


20 


20 


433800 


A1034361 


Hs.135150 


lung type-1 cell membrane-associated ply 


20 


2.0 


413627 


BE182082 


Hs^46973 


ESTs 


3^ 


20 


414133 


AW02218B 


Hs.41167 


ESTs 


23 


20 


411096 


U 60034 


Ks.68583 


mitochondrial Intermediate peptidase 


14,7 


20 


450244 


AA007534 


Hs.1 25062 


ESTs 


ai 


20 


4ZZ49J 


AW4/4IOO 


li_ nj-fnio 


nypotneucal protein FU 13158 


26 


2.0 


40B219 


AA393383 


Hs.133331 


ESTs 


21 


20 


419986 


AI345455 


Hs.78915 


GA-binding protekt transcription factor, 


20 


20 


4ZlS*w 


AF052217 


ti&A 10099 


core-binding factor, runt domain, alpha 


20 


20 


434128 


W93170 


Hs>2B4164 


protein jc 0004 


27 


20 


434503 


T96231 


HS. 17762 


ESTs 


20 


20 


443292 


AK00D213 


Hs.9195 


hypothetical protein 


20 


20 


443555 


N71710 


Hs,2l398 


ESTs, Moderately amilarto A Chafn A, H 


20 


20 


448172 


1475276 


Hs.135904 


ESTs 


6.0 


2.0 


40<UJa 






ESTs 


20 


20 


40029O 


H18836 


Hs.31608 


hvrwlhaticfll nmtpin PI 190(141 




?n 

iv 


449625 


NMJM253 




odk (odd Oz/ten-m, DrosophiEa) homolog 1 


10.1 


20 


441406 


245957 


Hs.7837 


phosphoprctein cegutated by mJtogenic pa 


23 


20 


451369 


AA017321 


Hs.269691 


ESTs 


21 


20 


434011 


AW953437 


H&5486 


clone FIB5214 


21 


20 


408B06 


AW847B14 


H&2S9006 


Homo sapnns cDNA: F1J21532 fia, cbne C 


5,4 


20 


433388 


AI432672 


HS.2B8539 


hypothetical protein FU 22 191 


27 


20 


432653 


N6209S 


Hs^93185 


ESTs, WeaWy similar to JC7328 amino ad 


5.5 


20 


446783 


AW138343 


Hs.141867 


ESTs 


4.4 


20 


437323 


AA371145 


Hs.194397 


leptfri leoeplor 


2.5 


20 


410076 


T053B7 


Hs.7991 


ESTs 


2^ 


20 


421501 


M29971 


H9.1384 


C^S-methyfgoanlne-DNA meiriyl transferase 


21 


2.0 


419733 


AW362955 




Homo sapiens cDNA FU1441 5 fis, dona HE 


25 


2.0 



TABLE 61B 



WO 03/042661 



PCTYUS02/36810 



Pkey: Unique Eos prdbeeet WenBtler number 

CAT number Gene duster number 

Accession: Genbank accession numbers 



Pkey 


CAT Number 


414660 


147525 J 


416882 


162718_1 


416913 


163QDt 1 


419544 


185760^2 


419733 


1875B9_1 


424036 


2346 0_1 


426413 


266650 1 


429258 


301917.1 


431676 
432689 


336411 1 
35275.1 


433S69 


46651 1 


43951a 


47334J 


442677 


548626J 


445 BOB 


65133J 


449625 


8113.1 


450580 


83929.1 


453B02 


9B1589.1 



Accession 

M743331 AAB37388 AW664540 AA775711 M15D965 BE465475 AA96B994 AA8B6905 A1141054 AW194991 AA252147 AA3195Q0 AH842BB 
AA703749 M644620 AA652769 AA242975AA151074 T19B90 

At533D44 AW016212 AW2411 43 AA76905B R43272 AW068958 AA21 091 8 AA293774 A1748B15 AJ763294AI3331l4A!2773B4A«)aB297 AM68477 
A1B24624 AWl BS6Q6 AI631751 Z40749 AI984673 A167131 6 M189024 AW235412 BE1784 

AW934714 BE161007 BE162500 AW749902 AW749864 BE162498 BE161Q05 AA190449 AW513465 BE161006 BE162499 
AJ9Q9154 AA526337 AA244193 AI909153 

AW362956 H59486 AI040665 WB0959 W94209 H27231 TB4625 H75715 W04957 W63676 AA659693 AA51 4302 W63789 BE04641 2 T91396 
AI951 970 AW044233 N2D01B AWS63548TS01 14 AI139947 AA809643 AA846232 AA581966 AA7B9GD2 

AA771J588 H15373 AW161070 BE3D4523 BE378517 AA9B930O AA904029 BE254211 AA44914B AI268420 A130D495 AI215637 AJ300494 AJ26855 \ 
AA928971 AA179427 AA947684BE393792 H96018 A1B85781 AJ16B567A129055B C15404 
AA377823 AW954494 A10226BB 
AA4487B5 C04967 C03045 AA65B233 
AI6B5464 AW971 336 AA513587 AA525142 

AB01B320 H56457 AA247916 N83488 NB7920 AA095663 T19858 AL134279 AA094167 AI673378 A1000340 R47500 W16595 AW152297 AJ625937 

AA002027 AI814B51 AA902666 Af039729 AW975063 BE302243 A1240793 AA1 93203 N5581 

AF075009R63109R63068 

W76326AF066341W72300 

AI557914 W81031 AW473764 AJ814081 W81068 AW182826 AW173295 A1376594 A122050O BE257195 BE2464BBR55637 C20788 A10144O7 
A*24B353 AW028015 
AV666234 AW966332 AA340239 

NMJM253 AF100772 BE0B8769 AL022718 BE161779 AW863569 BE161640 AL039D60 BE16B542 AW295554 AA323193 AA235370 AW779760 
N48674AI375997 R45432 D59344AI2D3107F07491 R35360 R25094AI913631 AI49B402T613 

N40087 H12925 AA460779 AA09B372 BE16084? AW816B90 AWB16B93 AWB16577 AYV816891 AWB16882 AWB16B68 AW81G8B9 AW816940 

AWB16B92 AWB16941 AW816578 AAQ29tB3T8332Q H86850 AA010295 BE1 6QB23 Z25353 AA150983 AW8 

AL134757AW079131 



TABLE 61 C 



Pkey: Unique number corresponding Id en Eds prabeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gi) numbers. "Dunham I. et nl" refers to the pubrTcatiori entitled 'The DMA sequence of 

human chromosome 22.' Dunham I. etal, Nature (1999)402:489495. 
Strand: Indicates DMA strand from which axons were predicted. 

Nt position: Indicates nucleotide position a of predicted axons. 



Pkey 


Ref 


Strand 


NLposltkm 


401197 


9719706 


Rug 


176341-176452 


401519 


6649315 


Plus 


157315-157950 


40155B 


7139675 


Plus 


103510-104090 


402802 


3287158 


Minus 


53242-63432 


403047 


3540153 


Minus 


59793-59958 


404641 


9796B10 


Minus 


32247-32382 


405685 


4508129 


Minus 


37956-38097 


406068 


9114084 


Plus 


382-543 



Table 62A lists about 600 genes up-regulated in prostate cancer compared to normal adult tissues. These were selected from 99680 prcbasers on theAffymetrix/Eos Hu03 GsneChip 
array such that the ratio of 'average" prostate cancer to 'average" normal adult tissues was greater than or equal to 3,0. The "average" prostate cancer level was set to the 75* 
percentile amongst prostate cancers. The 'average* normal prostate tissue level was sal to the 95* percentile amongst normal prostate tissues. In order to remove gemvspeclffc 
background levels of non-specific hybridization, fre 10* percentile value amongst non-malignant body tissues was subtracted from both trio numerator and the denominator before the 
ratio was evaluated. 

TABLE 62A: ABOUT 500 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE TISSUES 

Ptey; Unique Eos probesetWentJher number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnlgenelD: Unigene number 
UrirjeneTUteUnlgene gene title 





Ratio of prostate tumor ta normal prostate tissue 




Pkey 


ExAccn 


UnlgenelD 


□nfgene Title 


m 


444151 


AW972917 


Hs.126749 


alpha-methySacyl-OoA racemase 


4Z2 


421565 


NM 000399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


2&5 


427274 


NM_005211 


Hs.174142 


colony stimutefirig factor 1 receptor, to 


1&0 


420729 


AW964B97 


Hs.290825 


ESTS 


15.8 


412326 


R07566 


Hs.73Bl7 


small Inducible cytokine A3 (homologous 


147 


4151 B8 


BE157260 


HS.79D70 


v-myc avfen mye^ocytomatosis viral oncog 


116 


417696 


BE241624 


H&A2401 


CO 69 antigen (pBO, earfy T-cell acttvaU 


13.1 


424247 


X1400B 


Hs.234734 


lysozyme (renal amyloidosis) 


12.6 


401197 






HNSPOC000223263*:IISPC213. 


12.6 


426793 


X898B7 


Hs.172350 


HIR (hlsteriB call cyde regulation defec 


12.6 


450096 


A16B206B 


Hs.79375 


nriDcarboxytase synthetase (WoUn-Iprop 


12.4 


449156 


AF103907 


Ks.171353 


prostate cancer antigen 3, non-coding DD 


1Z3 


443271 


BE56B568 


Hs.t9S704 


ESTs 


11.6 


434078 


AW8B0709 


Hs.283683 


chromosome 8 open reading frame 4 


11.2 


423396 


AI3B2555 


Hs.127950 


tromodomaln-cofrtaJnlng 1 


11.2 
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40603a Y14443 zinc finger protein 200 1U 

426559 AB0Q1914 Hs.170414 pained base amino arid cleaving system 10.6 

419553 N34145 Hs, 10177 ESTs, Moderrfdy slm'la to 2N91J^UMAN Z 10.4 

418036 237976 Ks.a3337 latent transforming growih factor beta b 10 3 

5 41O023 AB017169 Hs£7929 sit (Drosophlla) homotog 3 10.1 

421684 BE281591 Hs.10676B hypothetical protefo FUt 0511 9.9 

426490 NML0O1621 Hs.170O87 aryl hydrocarbon receptor 9.9 

437666 AA156761 metaaothtonein IE (functional) 9.5 

440274 R24595 Hs.7122 scrapie responsive protein 1 9.4 

1Q 431117 AFQ03522 Hs.250500 delta IDrosophilaHfoa 1 9.4 

405141 Y14443 zinc finger protein 200 9.2 

448520 A BOO 2357 Hs.21355 douHecortin and CaM kinase-Wce t 9.2 

417315 AI080042 Hs.180450 ribosomal proteb S24 9.0 

433332 A1367347 Hs44898 Homo sapiens done TCOCTA001 51 mRNA sequ 9.0 

15 456614 AV653110 Hs.106650 hypdhflbcai protein FU20533 8.9 

427019 AA001732 Hs.173233 hypothetical protein FU10970 8.9 

432606 N^UX)2104 Hs.3066 granzyme K (serine protease. oranzymB 3; 8.8 

417381 AF164142 HB.82G42 solute carrier family 23 (nucteobass tra 6.6 

. _ 425174 D67450 Hs.15497B MAA0261 protein B.6 

20 445701 AF055581 Hs.13131 lymphocyte adaptor protein 8.5 

416162 NM_004354 Hs.79069 cycllnG2 8.4 

445230 U97018 Hs.1245t echrnoderm microlubde-assocaated protel 8.4 

421913 A1934365 Hs.1t)9439 osteogen (ostsoinductrve factor, rrrime 8.4 

434217 AW014795 Hs.23349 ESTs 8.3 

25 417363 AW129357 Hs.329700 ESTs 8.3 

425762 U86468 Hs.1 59525 cell growth regulatory with EF-handdoma 8.3 

442501 AA315267 Hs.23126 ESTs 8.3 

429503 AA394183 Hs.2fi873 ESTs 8.2 

429984 ALQ501Q2 Hs.227209 hypothetical protein FU21 61 7 at 

30 443B37 AI9B4625 Hs.9884 sptncSs pole body protein 6.0 

449046 Z45051 Hs.22920 similar to €68401 (catlte) glucose Indue 6.0 

444880 AW118683 Hs.154150 ESTs 7.9 

417407 AA923276 Hs-290905 ESTs, Weakly similar to protease [tisapi 7,9 

436962 AW377314 Hs£364 DKFZP5S4I052 prolein 7.9 

35 450313 AI0389B9 Hs.332633 Bardel-Sledl syndrome 2 7.6 

420120 AL049610 Hs.95243 transcription elongation factor A {Sit}- 7.5 

436613 C055B9 Hs.243122 hypothetical protein RJ13057stoiDa/lo 7.4 

419078 M93119 Hs.89584 iisuDnomfr associated t 7.4 

425905 AB032959 Bs.318584 novel C3HC4 type TJnc finger (ring finge 7.3 

40 451900 AB023199 Hs.27207 K1AA09B2 protein 7.3 

447769 AWS73704 Hs.320B31 Homo sapiens cDNA FU 1 4597 ffs. clone NT 7.3 

452436 BEG77546 Hs.31447 ESTs, Moderately similar to A46010 X-lto 7.3 

452359 BE167229 Hs.29206 hypothetical protein MGC1 4376 7.2 

42501 a BE24S277 Hs.154196 E4F transcription factor 1 7.2 

45 447737 AKD00543 Hs.19404 DKFZP5B4LQ862 protein 7.2 

424897 D63216 Hs.153684 frizzled -related protein 7.1 

433867 AK000596 118.3618 hlppocalcMIke 1 7.1 

427308 D26067 Hs.174905 K1AA003 3 protein 7.1 

„ 432359 AA07B049 Rs.274415 Homo sapiens cDNA FU10229 Rs, clone HE 7.0 

50 452221 C21322 H&288057 hypothetical prolein FU22242 7.0 

416051 AAB35B68 Rs.25253 meuwosldase, alpha, class 1A, member 1 6.9 

421 1 81 NM_005574 Hs.1B45B5 UM domain only 2 (rhomboBn4lte 1) 6.9 

446076 AJ133123 Hs.20196 adenylate cyclase 9 6.9 

426759 AI590401 Hs,21213 ESTs 6.8 

55 416292 AA179233 Hs.42390 nasopharyngeal carcinoma susceptibility 6.8 

417302 BE245B12 rte.8941 ESTs 6.8 

417022 Nkt014737 Hs.B03O5 Ras association (RaIGDS/AF-6) domain tarn 6.7 
451957 AI796320 Hs.10299 Homo sapiens cDNA FU13545 Ms, dona PL 6.7 
452859 A1300555 H5.28815B hypothetical protein FU23591 6,7 
60 413431 AW246428 Hs.75355 liA^fon-conjugafing enzyme E2N (homoio 6.7 
408405 AKD01332 Hs.44672 hypothettcal protein FU10470 6.7 
421552 AF026692 Hs. 105700 secreted frizztedfefatad protein 4 6.6 
426108 AA822037 Hs.1 654 68 programmed ceil death 5 6.6 
„ 429490 A1971131 HsJ23689 ESTs, Wealdy slmlar to ALU7J-IUMAN ALUS G.6 
65 454219 X75042 B&44313 v-nrf a/Ian reflcutoendcMosis viral 6.5 
443020 AI350055 9s,lQ6129 ESTs 6\5 
415752 BE314624 Hs.78776 putative transmembrane protein 6.4 
412462 A1499930 Hs.334685 mitochondrial GTP binding prate* 6.4 
rt 417880 BE241595 Hs.82848 seJectirt L {lymphocyte arihesbn molecule 6.4 
70 444381 BE387335 K&.2B3713 ESTs, WaaMy sTmflar to S64054 hypothefl 6.4 
448019 AW947164 Hs.195641 ESTs, Moderately similar to I3B022 hypot 6.3 
409799 D11928 Hs.76845 phosphoserine phoaphatas&liks 6.3 
42430B AW975531 Hs.154443 mitiichrornosorna maintenance deficient ($. 6.3 
411373 BE326276 Hs^861 ESTs 6.3 
75 423445 NM.014324 Ks,1 28749 a^ha^nethylacyl-CoA racemase &2 
422583 AA410506 Hs.27973 KIAA0B74 protein 6^ 
418196 AI745649 Hs.26549 KlAAl708protehi 6.1 
438209 AL120859 Hs.6111 aryt-hydrocarbon receptor nuclear trans] 6.1 
437147 AL049964 H$^35B hypothetical protein FU2D366 6.0 
OO 432415 T16971 Hs.289014 ESTs,WeaJdysim^rtDA43932mucln2p 6.0 
446696 Tt5767 Hs.22452 Homo sapiens mRNA for KIAA1 737 prolein, 6.0 
408242 AA251594 Hs.43913 PIBF1 gene product 5.9 
447935 AW292577 Hs.94445 ESTs 5.9 
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449118 


R67477 


Hs.23103 


427472 


AA522539 


Hs. 131250 


431548 


AIB34273 


Hs.9711 


417153 


X57010 


Hs.81343 


410689" 


X91662 


Hs.66744 


414734 


AA151712 


Hs.82572 


429597 


m 003316 


Hs.2442 


419047 


AW952771 


Hs.90043 


435080 


A1831760 


Hs.155111 


445715 


AB01295S 


Hs.13137 


422538 


NM Q0&441 


Hs.1 18131 


431055 


AL1 57645 


Hs.48793 


434640 


NMJH6045 


Hs.3945 


452144 


AA032197 


Hs.102558 


407938 


AA905097 


Hs.65050 


445467 


AI239S32 


Hs.15617 


414812 


X72755 


Hb.77367 


44B552 


AW973653 


Hs.20104 


406672 


M26041 


rls.198253 


416111 


AA0338J3 


Hs.79018 


4S3942 


AW1 90920 


Hs.19928 


416519 


AF0J3168 


Hs.79393 


432485 


N90665 


Hs»276770 


414516 


AI307802 


H&.1 35560 


414343 


AI036166 


Hs.323378 


412490 


AWB03564 


Hs.288850 


41S3B4 


AUQ769Q3 


Ks.79283 


453874 


AW591783 


Hs.3B13t 


458079 


At7S6870 


Hs.54277 


451694 


AF216751 


Hs.26813 


418758 


AW959311 


Ha.172012 


445757 


AW449055 


Hs. 13264 


449907 


AA004B25 


Hs. 103281 


419159 


AW974945 




421470 


R27496 


Hs.1378 


411486 


N85785 


Hs.181165 


429568 


AJ088691 


Hs.208414 


403851 






428839 


AJ767756 


Hs.82302 


417426 


NM_002291 


rte.82124 


424624 


AB 032947 


He 151301 


426350 


Nht.003245 


Hs.2022 


436476 


AA32610B 


Hs 33829 


422746 


NM 004484 


Hs 119651 


437571 


AA760894 


Hs.153023 


407902 


ALU 7474 


Ks.41181 


446965 


AU0387O4 


hte. 1568 27 


447072 


D 51 594 


Ha.17279 


44S969 


AW295142 


Hs 1801 87 


453204 


R10799 


Hs 191990 


452092 


BE245374 


Hs.27842 


438510 


AlO 80220 


Hs.6285 


444917 


R68651 


Me 144997 


414212 


M1 36569 


Hs. 10848 


451593 


AF151879 


Hs.26706 


417318 


AW953937 


H6-240B45 


444172 


BE147740 


HsJ 04558 


433600 


R42B33 


Hs^2232 


434170 


AA826509 


Hs.122329 


424090 




Hs.139262 


408331 


NWL007240 


Hs.44229 


435706 


W31254 


Hs!7045 


417421 


AM 38201 


Hb.82120 


432697 


AF1 55099 


Hs.279780 


433560 


AI925195 


Hs. 130891 


410762 


AF226053 


tfs.66170 


421623 


N4O850 


Hb 28625 


417094 


NM 006695 


Hs.8l182 


425397 


J04oaa 


Hs 15634S 


451244 




nonnduf f 


453370 


A147Q523 


Hs.139396 


424259 


AK001776 


Hs.143954 


434094 


AA305599 


Hs.2382D5 


452323 


W44358 


H5J292812 


437446 


AA7B8946 


Hs.101302 


411352 


NM_002890 


Hs.758 


443062 


N77999 


HS-8963 


425242 


D13535 


Ha.155287 


445800 


M126419 


Hs.32244 


439024 


R9SS96 


Hs^5598 


407235 


020559 


Hs.169407 


430935 


AYV072916 




412953 


Z4S794 


Hs.238809 



Bell (S. ce/evislae} homolog 5.9 

Iranspogon-derived Buster3 transposase-l 5.9 

novel protean 5.B 

collagen, type il t alpha 1 [primary oste 5.8 

twist (DrasopWIa) homolog (scrocephotos 5.8 

ESTs 5.8 

a disntagrin and metaJIoprotenase dcma 5.8 

ESTs 5.8 

hypotheHcaJ protein FU 14428 5.8 

UV radiation resistance associated gene 5.8 

5,1 0-mathenyl tetrahydrofolale synthetase 5.7 

sialyltrartsferase 6 (liacetyilaccsaminid 5.7 

CGM07 protein 57 

Homo sapiens, ckxiB MGG:5352. mRNA, comp 5J 

phospholamban 5.7 

ESTs, WesWy sTmlfcr to AUJ4 _HUMAN ALU S 5.6 

monokine induced by gamma Interferon 5.6 

hypothetical protein FU00052 5.6 

major hislocompatiblJily complex, dass 5.5 

chromaffin assembly factor 1 > subunltA ( 5.5 

hypotheHcaJ protein SP329 5.5 

tuberous sclerosis 1 5.5 

CDW52 antigen (CAMPATH-1 antigen) 5.5 

ESTs. Weakly sBmilar to T43458 hypotheU 5.5 

coated vescte membrane protein 5.4 

Homo sapiens cDNA: FU22528 Rs, clone H 5,4 

satectinPHgand 5.4 

collagen, type XIV, alpha 1 {undulfn} 5.4 

DNA segment on chromosome X (unfque) 992 54 

CDA14 5.4 

hypothetical protein OKFZp434J037 5.4 

KIAA0B56 protein 54 

ESTs 54 

hypothatical protein 5.3 

annexfn A3 53 

euRan/oUe translation etangafao factor 5.3 

Homo sapiens mRNA: cDNA DKFZp5S4O0472 (f 5.3 

C5002l54';gjJ7299016|gb|AAF54217.1I {AE0 a3 

Homo sapiens cDMA FU14814 fe, clone MT 5.3 

larninin. betal 5.3 

Ca2*depeiidenl activator protein for sacr 5.3 

transojutaminase 3 (E polypep^de, prate 5.3 

bHLH protein DEC2 5.3 

gtypicanS 5.2 

ESTa 5.2 

Homo sapiens mRNA; cDNA DKF2^>727C1 91 (fr 5.2 

ESTs, WeaWy sWlar to ALU1JWMAN AlU S 5.2 

^yrosytproteln sutfofransferase 1 5.2 

Homo sapfens cDNA HJ14337 lis, done PL 5.2 

ESTs 5,2 

hypothetical protein FU11210 5.1 

DKFZP586P01 23 protein 5.1 

ESTs &1 

KIAA0187 gene product 51 

CQ-121 protein 51 

ESTs 51 

ESTs, ModBrataly similar to 138022 hypol 51 

ESTs 5.O 

ESTs 50 

WAP essociBted fector-1 50 

dual specificity phosphatase 1 2 5.0 

GLQ04 proJein 5.0 

nuclear receptor subfamily 4> group A, m 5.0 

NY-REM-18antigan 5.O 

hypolheteal protein MGC4400 5.0 

HSKI^B protein 5.0 

ESTa 5.O 

hfetemne N-methyltransferase 4,9 

topolsomerase (DMA) B alpha (1 70kD) 49 

hypothetical! proteTn FU20039 4,9 

ATP-Mnding cassette, sub-family C (CFTR 4.9 

hypottiefcal protein FU10914 4,9 

hypothetical protein PRO2013 4.9 

ESTs, Weakly similar to T3346B hypotheli 49 

ESTs, Moderately similar to CA1C RAT COL 4.9 

RA3 p21 protein activator (GTPase acb'va 4,8 

Homo sapiens mRNA lull length Insert cDN 4.8 

KLAA0010 gene product 48 

Inositol polyphospriatB-4-pho5phatase, ty 48 

ESTs 4.a 

SAC2 (suppressor of actln mutaUons 2, y 48 

zinc finger protein 131 {clono pHZ-10) 47 

ESTs 47 
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413450 


Z99716 


Ks.75372 


421863 


AI952677 


Hs.106972 


437916 


BE566249 


Hs.20999 


451952 


AL120173 


K&.3016S3 


416155 


AI8D7264 


Hs.205442 


451B31 


MMJJ01674 


Hs.460 


414522 


AW518944 


Hs.76325 


427176 


AW381559 


Hs.40334 


441390 


AJ692560 


Hs.131175 


447382 


AVH127790 


Hs.182261 


431676 


AI&S5464 




422241 


Y00062 


Hs.170121 


A52242 


R50956 


Hs.159993 


401519 






414342 


AA7421B1 


Hs.75912 


42&197 


AAGQ4410 


Hs.100009 


429467 


NM_0O4477 


HsJ203772 


414341 


060004 


Hs.75909 


408380 


AF123050 


Hs.44532 


431393 


AW971493 


Hs.134269 


425907 


AA365752 


Hs.155966 


419111 


AA234172 


Hs.137418 


425840 


AW978731 


Hs.301824 


414020 


KM_002954 


Hs.75703 


427078 


AJ676062 


Hs.111902 


419326 


W94915 


Hs.42419 


419465 


AW5Q0239 


Hs.21187 


408705 


AA312135 


Hs.46967 


429250 


H56585 


Hs. 198308 


431585 


BE242B03 


Hs.262823 


449673 


AA002064 


Hs.16920 


438440 


AA8Q7226 


Ha.225161 


427210 


BE3Q6283 


Hs.173987 


428673 


AW601325 


Hs.337757 


456974 


M12529 


Hs.169401 


416814 


AW192307 


Hs.80042 


443123 


AA09453B 


Hs.272803 


436420 


AA443S66 


Hs.31595 


443250 


A1041630 


rte.132107 


411962 


AA099050 




423545 


AP000692 


Hs.129761 


423696 


AA32979S 


Hs.1098 


423641 


AU37256 


Hs,130489 


430259 


BE5501fl2 


Hs.127626 


453787 


AB011792 


HS.35D94 


426328 


AA42608O 


Hs.96489 


4422B1 


N34742 


K3.17Q065 


436446 


AW016S09 


Hs.1 19021 


450704 


H85157 


Hs.40695 


433666 


AWOJ4493 


Ha.126727 


417300 


H6072O 


H3.81B92 


mm 


AUM6642 


Hs.194019 


444212 


AW503976 


Hs.10849 


447188 


H65423 


Hs, 17631 


447658 


AW0B0339 


Hs.80426 


418239 


AA279530 


Ha.83968 


426627 


AW057605 


HS.172665 


432882 


MM.013257 


Hs.279696 


425235 


AA353113 


Hs.112497 


417986 


AA4810O3 


Hs.97128 


43882B 


AL134275 


Hs.6434 


406086 


AW1 57022 


Hs.343551 


410853 


H04588 


Hs.30469 


414002 


NM_M6732 


Hs.75678 


431318 


AA5D2700 


Hs.293147 


406554 


AA636381 


Hs.315111 


416782 


U5035 


Hs.79886 


416701 


AA814846 


Ha.96343 


419063 


AM79560 


Hs.96613 


4299DO 


AA460421 


Hs.30875 


437672 


AW748265 


Hs.5741 


439518 


W76326 




450661 


AW952160 


Hs.146550 


443907 


AU076484 


rfe.9963 


40&151 


AA306105 


Rs.S0785 


430968 


AW972830 




456B04 


AI421645 


Hs.139851 


4142S1 


AI239619 


Ha.13040 


420544 


AA677577 


Hs£fl732 


446430 


AI500642 


Ks^89Q67 


429570 


BE2422S6 


Hsl2441 


444100 


AA383343 


H&22116 


443622 


AI911527 


Hb.11805 



N-acetylgatactosaminidflse F alpha- 

Homo sapiens mRNA; cDNA DKFZp434P22B (fr 

hypothetical prata&i FU23142 

ESTs 

ESTs, WaaWy similar to T34036 hypothec 

activating transcnplion ractor 3 

step U splicing factor SUI7 

ESTs 

ESTs 

ESTs 

gtettaenXxl NCI_CGAP_Pr2B Homo sapiens 
protein tyrosine phosphatase, recaptor t 
gycosyltransferase 

C1 5000476*:fli|12737279|ref|XP_012163.li 
KIAA0257 protein 

acyl-Coenzym© A oxidase 1 , pslmltoyl 
FSHD region gene 1 
KIAAD182 protein 
dluttquihn 

ESTs, Highly similar to cytokine raceplo 

ESTa 

ESTS 

hypothetical protein PR01331 

smafi fridudWe cytokine M (homologous 

ESTs 

ESTs 

Homo sapiens cDMA: FU23068 fls, clone L 

HSP0034 protein 

tryptophan rich basic protein 

hypothetical protein FU 10326 

ESTs 

ESTs 

eukaryotte translation inrfetton factor 

Homo sapiens mRNA; cDNA DKFZp5B6M063 (fr 

apolpoprofein E 

otlir^|^lc:Mm9C3teNAc2-PMdk:hylg| 
pulabva transcription regulation nuclea 
ESTs 
ESTs 

gtezk85d12j1 SoaresjregnanLuterusJ^bH 

chromosome 21 open reading frame 5 

DKFZp434Jl813protefo 

ATPasev amirwphosphollpldtrapsporter-ll 

RalGE F-JIke protein 3, mouse homolog 

extracellular matrix protein 2, female o 

ESTs, Weakly similar to 138022 hypotheH 

Homo sapiens cDNA FU 13492 rTs r clone PL 

ESTs 

ESTs 

ESTs 

KIAAO101 gene product 
sttrac&n 

basement membrane-Induced gene 
hypothetical prolein DKFZp434E2135 
ESTs 

integrln, beta 2 {anffgen C018 (p95). ly 
rnetriyTenetBtrahydrafolBte dehydrogenase 
serontftfucocorfcold regulated Idnase-lf 
Homo sapiens cDNA: FU 22743 fls, done H 
ESTs 

hypothetical protein DKFZp761F2014 
hypothetical protein RJ225Q4 
ESTs 

FBJ murine osteosarcoma viral oncogene h 
E&Ts, Moderately similar to A460lQX-fei 
nuclear receptor co-repressoriHDAC3 comp 
ribose 5-phoEphata isomeiase A (rlbose 5 
ESTs, Weakly simflar fa AUKLHUMAN lilt 
Homo saptens cDNA FU 12292 fis, clone MA 
ESTs 

flavoriemoprotein &57 

gfrzrf60d04r1 SoaresJetaUtearLNbHH19W 
ESTs 

"TYRO protein tyrosine kinase binding pro 
SEC22, vesicle trafficking prolan (S. c 
gb:EST364925 MAGE resequences, MAGI Homo 
caveolin2 

6 protefn-coUpled receptor 86 
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422473 U94780 Hs.1 17242 meningioma expressed antigen 6 (coiled-c 3.2 

407816 AW500857 Hs.40137 anaphase-prcmr^ng complex 1; meioiJc ch 3.2 

„ 468809 AW972512 Hs.20985 sin^assoclated pdypapEde, 30kD 12 

75 453BG4 AW021407 Hs.21068 hypothetical proton 12 

448944 AB014605 Hs52599 atrophin-1 uiteradin9 protein 1; acKvi 3.2 

435129 AI3B1659 Hs5670B6 ESTs 12 

424894 H83520 Hs.1S367B reproduction 8 12 

_ rt 416959 D28459 Hs.80612 lAiqurbrnxir^alrng enzyme E2A (RAD6 h 12 

80 410185 BE294068 Hs.737 Immediate early protein 3.2 

451149 ALD475B6 Hs.1 0283 FWAbTnaTng moW protein 8B 35 

410531 AW752953 gb:QVO-CT0224-261099-035Hg02 CT0224 Homo 35 

411954 AA652523 Hs.269496 EffTs, Weakly rfmitar to I3B022 hypo the b 12 
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423748 AI149048 Hs.3Q211 fyypoihetfcal protein FU22313 3.2 

444439 AI15101O Hs.157774 ESTS 3.2 

453124 All 39058 Hs.1 25790 leucbie-rich repeat-contaWng 2 3.2 

4S2413 AW082633 Hs.212715 ESTs 11 

5 409390 AI927212 Hs.3734 ESTs 3.1 

453078 AF053551 Hs.31584 metaxm2 3.1 

453024 AWB4S787 gb:QV3^T0194-1B1099^04-e01 CT0194 Homo 3.5 

439444 A1277S52 Hs.S4578 ESTs, Weakly sfmalar to (38022 hypoLhatf 3.1 

_ 441124 T97717 Hs.1 19563 ESTs 3.1 

U 416128 AA173632 CDC14 (cell division cyde M. & cere* 3.1 

445101 T75202 Hs.12314 Homo sapiens mRNA; cDNA DKFZp5B6C1 019 (f 3.1 

407949 W21874 Hs^47057 ESTs, WeaWy similar to 21O9260A B cefl 3.1 

413492 D87470 Hs.75400 WAA026G protein 3.1 

421044 AF061871 Ks.101302 Human DNA sequence from dona RP1 -238D1 5 3.1 

5 422603 BE2425B7 Hs.1 18651 rremaiopoieticsly expressed homeobax 3,1 

427399 NMJ014883 Hs.177664 KtAAG9t4gene product 3.1 

434804 AAG49530 Hs.348148 gb:ns44flM«1 NCLCGAP^AJvl Homo sspfens 3.1 

442906 AW2S6888 Hs.170939 ESTs 31 

iri 445540 AI277811 Hs.146291 ESTs 11 

IV 446084 AL13765B Hs,1376B Homo sapiens mRNA; cDNA DKFZp434l1216 (f 3.1 

435293 AI040777 Hs.1 17170 ESTs 3.1 

400750 Target Exon 3.1 

416852 AF2B3776 Hs,80285 Homo sapiens mRNA; cDNA DKFZp586C1723 (f 3.1 

. _ 423566 AW97S434 Hs.3S23 hypothetical protein FU 11 220 3.1 

15 409558 BE253407 Hs.72363 Homo sapiens mRNA for FU00116 protein, 3.1 

400194 NMJ0O3763:Homo sapiens synlaxln 16 (STX1 3.1 

400479 Target Exon 3.1 

402895 ENSP0DM0252117:MydlritrariSCripUonfac 3.1 

403423 Target Exon 3.1 

>U 408161 AW952912 Hs.300383 hypothetical protein MGC3032 3.1 

410886 AW809324 gb:MR4^121-141099410-Gu$_1 ST0121 Ho 3.1 

414922 D00723 Hs.77631 gryrjne cleavage system protein H {amTno 3.1 

420121 AW968271 Hs.191534 ESTs, WeaWy similar to ALUl JtOMAN AUU S 3.1 

430519 AF129534 Hs.49210 F-bc* only protein 4 3.1 

>5 447147 AA910353 Hs.292815 ESTs, Weakly similar to T23482 hypotheH 11 

422461 ALC501G3 Hs.1 17339 DNAX-acttvatbn protein 10 11 

430467 D87742 Hs.241552 WAAQ268 proten 3.1 

409549 ABQ29015 Hs.54888 phosphdrpase C, epslbn 2 3.1 

423554 M9051B Hs.1674 glutairtne^uctose-6-phospha^ 3.1 

W 431286 AW263476 Hs.44268 myelin gone expression factor 2 3.1 

430226 BE245582 Hs2551 adrenergic, beta-2-, receptor, surface 3.1 

408051 A1S23351 Hs.172148 ESTs 3.1 

420090 AA220238 Hs.94986 ribonucteaseP{38kD) 3.1 

400296 AA305B27 Hs.1 39336 ATP-hindlng cassette, sub-famiry C [CFTR 3.1 

+5 414279 AW021691 GCN5 {general control of ammo-ackl syrrt 3.1 

440426 A1159B00 Hs.7181 Homo sapiens cDNA FU136B3 lis, done PL 3,1 

423872 AB020316 Hs.1 34015 uronyt 2-suifblransterasB 3.1 

420154 A1093155 Hs.95420 JM27 protein 3.1 

447560 AF0B5214 Hs.1 8858 phospnoUpas« A2, group IVC (cytoscEc, 3.1 

>0 453293 AA382267 Ha.10653 ESTs 3,1 

434393 AA121098 H&3838 SBnJriHndtlcibld kinase 10 

414135 NM.004419 Ks.2128 dual specificity phosphatase 5 10 

441527 W19504 Ks.7584 solute earner family 21 (organic anion 3.0 

423957 AW976309 Hs.136235 Homo sapiens cDNA FU13542 Ms, done PL 3.0 

!)D 444001 AI095087 Hs.1 52299 ESTs, Moderately similar to S65657 alpha 3.0 

411315 AW838547 gb;PM3-lT0D32-O3OtC0-005^0B LTD032 Homo 10 

414178 AW957372 Hs.46791 ESTs, WeaMy similar to 1 38022 hypothell 10 

441401 AI824338 rls.126B91 ESTs , 10 

457718 F18572 Hs_2297B ESTs, Weekly simlar to ALUOUMAN ALU S 10 

bi) 429500 X78565 Ks.289114 hexabrachlon {tenasdn C, cytctactln) 10 

424240 AB023IB5 Hs.1 43535 ca!ciurru^n>c<liilrhOeperKJent protein km 10 

447476 BE293466 Hs.20B80 ESTs, Weakly similar to 138022 hypothec 10 

419440 ABO206B9 Hs.90419 WAAD882 protein 10 

„ 421786 AI188653 Hs.21351 ESTs 3.0 

55 433285 AW975944 Hs^37396 ESTs 10 

409997 A1906055 Hs.57749 synaptic nuclei expressed gene 2; KHAA10 10 

417874 BE615160 Hs.62629 protein tyrosine phosphatase, nwrecepi 10 

420956 AA351584 Ha.100543 Homo sapiens done 24505 mRNA sequence 10 

445345 AW003850 Hs.12532 chromosome 1 open reading frame 21 10 

70 450571 AI356967 Hs.43086 ESTs, Weafty similar to A46010 XJInked 10 

427315 AA179949 Hs.1 75563 Homo sapiens mRNA; cDMA DKFZp564N0763 (f 10 

434953 BEO49102 Hs.1 799 B2 ESTs, Weakly simlar to TRHYJ-iUMAN TRiCH 10 

436465 AK001455 Hs.5198 Down syndrome critical region gene 2 10 

428130 AW4449B5 Hs.77603 ESTs ^ 10 

O 414839 X63692 Hs.77462 DNA {cytosinB-5-)-m8lhyrlransfBrase 1 10 

445784 A1253155 Hs.146065 ESTs 10 

448677 AI800311 Hs,15B29l ESTs 3.0 

426448 R0BO54 gb:yaBSg07rl Sosfes fatal Ever spJean 3.0 

445921 AW015211 HS.1461B1 ESTs 10 

OU 406673 M34996 Hs.193253 major hfctooompatfblllly complex, class 10 

430387 AW372BB4 Hs240770 nuclear cap binding protein subunit 2, 2 10 

427157 U51166 Hs.173824 mymlne-DWAglycosylase 10 

435907 AW304025 Hs.256397 ESTs 10 
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451102 AA015583 Hs.41185 Homo sapiens mRNA; cDNA DKFZp56401 262 (f 

USK ^JS^H 44 iiyptophanyllRNAsynih9las9 2(jnltochon 

414052 AW578B49 Hs,2B3552 ESTs. Weakly shilar to unnamed protein 

450580 N40087 ESTs 



3.0 
3.0 
3.0 
3.0 
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Pkey: Unique Eos probeset Menfflfer number 

CAT number. Gena cluster number 

Accession: Genbank accession numbers 



Pksy 

410531 
410886 
411315 
411982 
414279 



CAT Number Accession 



120720D_1 
1225822J 
1238570 1 
126744 T 
143227.1 



414680 147525,1 



41552B 
416126 
416882 

422673 
426446 



430968 
431304 
431676 
432125 
432363 
437158 

437866 

439047 
439518 
450506 

450580 

452260 

453024 

455473 

456719 



1539409.1 

157163.1 

162718.1 

219674.1 
267323.1 
325772^1 
32S269_1 
331286.1 
336411.1 
341776.1 
345469.1 
433S2J& 

44433JZ 

4681 39_1 
47334,1 
836J 

83929.1 

9074.1 

944876_ J 2B 

1292917J 

222707.1 



AW752953HB8044 BE1 56092 
AW809324 BE144977 8E144956 
AW836547 AW836513 AW836587 
AA099050 AA099526 T47733 

R17236R52580F11642 

M173632 Al 174858 AA581361 AJ700024M173988BE165417AI365964 

N59027 AA314594 N53937 R031 00 
R06054 AA376789 AWS55453 

^^tS! 7 ^mZf^ M ^ U AIB19642 A1280239 AI22TJ572 AA789302 AM7361 1 AWB41126 D60937 

BE157283 BE157287 AA 502438 

At685464 AW971336 AA5135B7 AA52S142 

AW972667 AA526539 AJO57032 AW167842 

AA5344B9AW97O24OAW970323 

^™?J AW ? 93839 U52054 AA024963 AA778446 BE073977 AW444S04 AYV602574 BE1 64040 BE164012 BE163972 BE163974 8E1 63992 
AA837481 AW468444BE1 85091 AW468002 AA687333 AA81 1830 AA581806 AJ866636 A1572124 AA04 Btl t" 5 "^ BEl535f74 SE163992 
AVW79177AA846994AA829672 ^ 
W76328 AFDB6341 W72300 

AW846787 AVU903370 AW903378 AW752032 AW838052AW839051 AW839054 AWB39058 AW903374 AWB39045 AW9 
AWB39043 AVV639049 AW903372 AW846755 AW846767 AW903366 AW846766 BE146826 AW83905S AW8W802 AW 
AW984786ASV984816AW984811 AW984fln7 AW9fWR1Q AUWflA7f«AWQM7Ro l^Kx>A^^ A^o^on .«,™™«. 



5^™£K"E^ AW9B4620AW 
243784 R133B2 AW572911 AA449369 H17037 R19603 A1632555 AW8W030 BE502530 Z25032 AA8D5324 AA449241 AI651825 Al?64RM 
AW1EH318A^267A1053735A1:B37^ 



TABLE 62C 



*l»y: Unique numter comspgndlng to an Eos probeset 

Ret I**™™ SDU ™' ™*l ^ Ws column are Genbank Identifier <GI) numbers. Dunham I. et al." refers to foe putUcallon arrtfrJad The DMA seouance of 

human chromosome 22/ Dunham I et at.. Nature (1999) 402:48&495. ' ^ OT Bmmo * m UNA 8Q( ^ uenoe 01 

Slrand: Indicates DNAslrand from which axons were predicted. 

ffljosffion: In dicates nuct Q otkfep<?sitCTsclDreaic1edfl>CTs. 



Pkey 


Ref 


Suand 


400479 


6439766 


Mnus 


400658 


8118459 


Minus 


400750 


8119067 


Plus 


401197 


9719705 


Plus 


401519 


6649315 


Plus 


401597 


3293210 


Plus 


401744 


2576349 


PJus 


401B07 


7331538 


Plus 


402474 


754717S 


M b us 


402802 


3287156 


Minus 


405695 


9967547 


Bus 


4031QO 


6954402 


Mnus 


403389 


943B331 


Mnus 


403423 


7105492 


Plus 


403851 


7708672 


Plus 


404390 


88B7030 


Minus 


405141 


8980911 


Plus 


405934 


6758795 


Plus 


406038 


83B9537 


Plus 


406247 


7417725 


Minus 



NLposificn 

115386-116348 

7352573644 

198991.199168,199316-199548 

176341-176452 

157315-157950 

65838-66031 

14535-14751 

152325-152912 

53526^3628,55765-55920,5753057757 
53242-53432 

85537-85671.8637^86469 

13683-13874 

163415-163634 

69 34a 69615 

22733-23007 

39624-10072 

99861-100054 

159913-160605 

37764-37877 

46234-46461 
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Tabto 63A fists about 366 garves significantly down-regulated In prostate cancer compared to normal prostate. These were selected from 596BQ prabBsels on the Affymetn^Eos Hu03 
GeneChip array such that the ratio of ■average 1 ' normal prostate to 'average" prostate cancer tissues was greater than or equal to 2.5. The "average' normal prostate level was set to 
the 75* percentile of normal prostate tissues. The "average" prostate cancer I aval was set to the 75* percentile amongst tumor semples. In order b remove gane-specfflc 
background levels of nonspecific hybridization, the 10* percentile value amongst all the tissues was subtracted from both the numerator and the denominator before the ratio was 
evaluated. 

TABLE 63A ABOUT 366 GENES SIGNIFICANTLY DOWN- REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE 



Pkey : Unique Eos probeset identifier number 



ExAccn: 


Exemplar Accession number, Genbank accession number 




UntgenelD: 


Unigerte number 






Unlgane THJe:Unfgeoe gene tjtJe 






R1: 


Ratio of normal prostate tissue to prostate tumor tissue 




Pkey 


ExAccn 


Untgene 10 


Unlgene Title 
semertogelin 1 


R1 


425932 


M61650 


Hs.196B 


89.4 


407245 


X90566 


Hs/172004 




35.4 


425645 


N98529 


rls.158296 


Homo sapiens, done MGG12401, mRNA, com 


32_2 


426752 


X69490 


Hs.172004 


Udn 


29*5 


400440 


X63957 


HS.83B70 


nabulin 


207 


412519 


AA19S241 


Hs.73980 


troponin Tt, skelatal, slow 


13.9 


413778 


AA090235 


Hs.75535 


myosin, light poly peptide 2, regulatory. 


12.0 


420813 


X51501 


Hs.99949 


prdadiri-inducfid protein 


11.B 


41533? 


Z44881 


Hs.9012 


ESTs, Weakly similar to S26650 DNA-blndl 


10.2 


407013 


U35637 




gbjHuman nebtilln mRNA, partial cds 


9,1 


433331 


AI738815 


HeJ 17323 


ESTs 


a.9 


432117 


AL03B195 


Hs.2909 


prolamine 1 


8.3 


453B63 


X02544 


Hs.572 


orosomucold 1 


8,2 


431647 


AI791314 


Rs,31S946 


gb:as46g1 2y5 Stratagene lung carcinoma 


7.4 


408655 
446059 


TB3061 


Homo sapiens mRNA for KLAA1727 proteh, 


7.1 


AI459021 


Hs.170425 


ESTs 


6.B 


403812 






Target Exdfi 


64 


405001 


U58196 




Interieuldn enhancer binding factor 1 


6.2 


441490 


N46901 


H3.2S6720 


ESTs 


6.2 


435805 


AW470260 


Rs.48496 


ESTs 


6.0 


401917 


AL050149 




RAM binding protein 3 


5.9 


455549 


8E065051 




gb:RC1-BTO3l3-1105O0^>17-cO4 BT0313 Homo 


5.7 


450218 


R02018 


rls.168640 


ankylosis, progressive {mouse} homolog 


5.6 


404606 






Target Exon 


5.6 


432326 


A12B030B 


Hs.274361 


aritaridB-sensiBv9 cation channel 2, ne 


5LS 


453706 


AA776881 


Kb.168812 


Homo sapiens cDNA FU14132 fis, clone MA 


5.5 


44080B 


AK001339 


Rs7432 


hypothetical protein FU 10477 


5.5 


458339 


AW97GB53 


HsJ72a43 


ESTs 


5.2 


454276 


AF217S25 


H6.49D02 


Down syndrome cell adhesion molecule 


5.2 


417D32 


AA192469 


Hs^7183B 


ESTs 


5.2 


434352 


AF129505 


He.86492 


smal rnusde protein, X-flnted 


5.1 


439175 


AFOB6Q21 


H^271113 


ESTs 


5.0 


439749 


AL389942 


Hs.157752 


Homo sapiens mRNA tult length Insert cDN 


4.9 


417364 


N73749 


Hs.222475 


ESTs 


4.9 


400631 






C11Q00936^l|3746443|gb|AAC63969.1 j (AFO 


4.8 


416935 


AA190712 




gb:zp87f0%r1 Stratagene Hela eel s3 93 


4.B 


442062 


R41823 


Hs7413 


ESTs 


4.B 


452625 


AA724771 


Hs.61425 


ESTs 


47 


440965 


AI523B46 


Hs.169859 


ESTs 


47 


456815 


N&L01334B 


Hs.1440lt 


potassium Inwardly-rectlfytng channeUu 


4.7 


430611 


AA837120 


Hs.156481 


ESTs 


4.6 


415981 


R35694 




gb:yg67b04.r1 Scares intent brain 1NIB H 


4.6 


441040 


AW449762 


Hs.178803 


ESTs 


4.6 


442764 


A1762254 


Hs.131122 


ESTs 


4.6 


411426 


BE141714 




gb:QV0-HT010l-061099632«O4HTO1O1 Homo 


4.6 


433061 


23 5386 


HS.1B8757 


Homo sapiens, clone MGC:5564, mRMA> comp 


4.6 


401696 






Target Exon 


4.6 


445323 


AW207282 


rte,213049 


ESTs 


4.5 


456895 


AI535663 


Hs.39379 


ESTs 


4.5 


417729 


243796 


Hs.6777 


ESTs 


4.5 


431627 


AW6O9720 


Hs^65540 


HSPC042 protein 


4.5 


420721 


AA927802 


Hs.159471 


ZAPS protein 
ESTs 


4.4 


449519 


W04244 


Ha.49829 


4.4 


442089 


A1801500 


Hs.128457 


ESTs 


4.4 


436781 


A1914535 


rfe,221377 


ESTs 


4.4 


402797 






Target Exon 


4.4 


404267 






MM_00434B*:Homo sapiens lunt-relaled tra 


43 


442931 


A1024376 


Hs.150473 


ESTs 


4.3 


41B626 


AW29950B 


Hs.135230 


ESTs 


4.3 


423772 


AA306637 




EAP30 subunlt of ELI complex 


4.3 


457136 


AA428240 


Hs.126083 


ESTs 


4.3 


438404 


AW968556 


Hs.137240 


Homo sapiens mRNA tor partial 3TUTR, seq 


43 


458840 


A1580296 


Hs.174762 


ESTs, WeaWy sMarto WAA1437 protein 


43 


403649 






Target Exon 


4.2 


435B66 
423671 


AA704S36 


Hs.119740 


ESTs 


4.2 


AA3319Q6 




gb:E5T35805 Embryo, 8 week 1 Homo sapien 


4.2 
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436190 
400164 
435519 
425034 
41592B 
415/45 



4S3752 
404260 
453412 
401411 
415925 
402B17 
449233 
435457 
444105 
430610 
411320 
409679 
415386 
446175 
459707 
406917 



443313 

452351 

4004S9 

4U823 

408229 

427318 

417620 

404660 

409144 

438524 

459200 

41BB33 

424SB7 

404964 

454549 

426736 

434552 

438040 

443518 

442097 

435186 

429B53 

458521 

453282 

400749 

450295 

421154 

434378 

421342 

406704 

429043 

438211 

40525B 

440207 

445045 

408177 

438535 

405735 

445797 

444266 

442027 

447889 

405152 

402460 

44B516 



AK001059 

AJ21B950 

A1955956 

R46799 

AA167670 

AKJ28767 

AL12QB0Q 

AJ003290 

H0S474 

H24185 

BE048401 

AA6B2421 

AW189097 

AJ821465 

AW636646 

BE250521 

Z430&7 

AL036568 

AA631362 

X659B4 

BE154173 

AI796730 

AA025647 

AW865749 
AW176091 
AF166081 
R02530 

AVW341167 

AA922236 

Y09306 

AW974599 

AA348112 

AW806910 

AA431615 

AA63361B 

A1222422 

TB2009 

AW015799 

AL1 19470 

AB020625 

AW373450 

AKG0QM3 

AI766732 

AA284333 

AA631739 

AA504749 

M21665 

AI824S77 

T08401 

A1371978 
AI6S2676 

L09076 

AI253414 
AS25304 
A1652926 
AW469180 



447600 
4Q60B5 
417959 
418872 
446593 
446516 
415BS6 
410140 
458539 
422399 
4133B1 
429422 
428561 



AA010319 
At420990 

AW977642 

L442B4 

W79572 

AW996692 

H08311 

AL134435 

AT733637 

AW410380 



AKG014W 
AW973652 



Hs.1 25461 
Hs.21417 
Hs.23966 
Hs.21413 
Hs.262603 



Hs.202341 

Hs.196511 
Hs.59126 
Hs.1 66597 
HS.1BB810 



Hs.291 
Hs.120866 



Hs.55513 



Hs.175783 
Hs.191198 

Hs.279714 

Hs,221037 

Hs.30148 

Hs.292776 

Hs.259461 



Hs.1 30722 
Hs.325116 
Ha.121846 
Hs.300700 
Hs.128474 
Hs.145631 
Ha.225949 
Hfc.2BS212 
Hs.32922 

HS.210S26 
Hs.287631 
Hs.335440 

Hs.929 

Hs.145319 

Hs.290856 

Hs.126326 
Hs.147256 



rte,190312 
Hs.128395 
HS.34639B 



Hs.60389 



Hs.291742 

Hs.12915 

H5,13277 

Ns.257557 

Hs.14822 

H&547B37 

Hs.145661 

HS.1160SS 

Hs.202595 
Hs,283105 



gfcHomo sapiens cDMA FU1 01 97 lis, clone 4.2 

Eos Control 4,1 

hypothetical protein FU11S39 4.1 

ESTs 4.1 

ESTs 4.1 

solute carrier family 12, {potassurrvchl 4.1 

ESTs 4.1 

oo:DKFZp762£152_r1 762 {synonym: hmeJ2} 4.1 

Target Exon 4.0 

gbAJ 003290 Selected chromosome 21 cONA 4.0 

ENSP00000247172^YPOTHETICAL 126.2 kDa 4.D 

Homo sapiens cDNA: FU23573 fls, ctone L 4.0 

hypotberjcal protein 4.0 

ESTs 3,9 

ESTs 3.9 

ESTs 3.9 

ESTs, Weatcly sMarto ALU6J1UMAN ALU S 3,9 

gb:PM3-LTGQ32-ra01 0 WXH-aOS LT0032 Homo 3.B 

ras hnmotog gene family, member A 3.B 

gb:HSCt3A121 normafced infant brain cDN 3.8 

glutamyl eminopepfidase [aminopBptidase 3,8 

gb:rip&6b01.s1 NC|_CGAFLThy1 Homo sapiens 3.8 

gb:H.saolen8 nestta gene. 3.8 

gb:PM1-HT0340-201 299-C04-fl2 HT0340 Homo 3.8 

ESTs 3 8 

gb;zeB5d01.r1 Scares JeteUbearUlbHHI 9 W 3^ 

C1000ia585i|6CT9l24|rel|NP_rj32759.1| ne 3.7 

Qb:QV3^N0021-100500-185-c03 SN0021 Homo 3.7 

gbcQVD-BT0107'25089&-007-K)8 BT0107 Homo 3.7 

iJnc transporter 3.7 

ESTs 37 

C9000841*:gl|12654691|grj|AAH01 185.1 [AAHO 3.7 

ESTs 3,7 

ESTs 3.7 

horneodomaln^rrteractlng protein kinase 3 3.7 

ESTs 3.7 

ESTs 3,7 

Target Exon 3L7 

gb:QV4-ST0023-160400-172^04 ST0023 Homo a7 

ESTs 3.7 

Homo sapiens, done MGC!2962, rnRNA, comp 3.6 

ESTs 3.6 

hypothetical protein FU20727 3.6 

ESTs 3,6 

ESTs 3.6 

butyophfim-llke 3 3.8 

hypcthah'cal proton FU11729 3.6 

hypothetic^ protem FU2D036 3.6 

NMJX)3105*:Homo sapiens sorfflirvretated 3.6 

ESTs 3.5 

Homo sapiens cDtf A FU14269 fis, clone PL 3.5 

EST 3.5 

gb^a53108.r1 NCI,.0GAP_GC81 Homo sapiens 3.5 

myosin, heavy polypeptide 7, cardiac mus 3.5 

ESTs 3.5 

ESTs, Moderately similar to AUMJHUMAN A *5 

Target Exon 3.4 

ESTs 3.4 

ESTs 3.4 

Target Exon 3.4 

gftHomo sapiens mRNA fragment 3.4 

BISP000002521fi4':KrAA1578 protein (Fragm 3,4 

gir.aq14f04.x1 Stanley FronWl NS pool 2 3.4 

ESTs 3,4 

ESTs 3-4 

ESTs 3.4 

Target Exon 3.4 

C1001261^l|2695979lemblCAA70854.1| (YO 3.4 

gb£RC1-NhM07M6040CM)11flG9 NN0073 Homo 3.4 

ESTs 3,4 

ESTs 3.4 

Target Exon 3,3 

ESTs 3.3 

ESTs 3.3 

hypothetical protein FU22054 3.3 

ESTs, WeaWy slmlarto 138022 hypothsti 3.3 

ESTs, WeaWy similar to J78B85 seraie/lh 3,3 

neurexin3 3.3 

ESTs 3J 

mesenchymal stem eel protean DSC43 3u3 

gl>:RG1-BTu720-28030CW)1 1-g02 BT072D Homo 3.3 

Homo sapiens cONA FU10632 fis, clone NT 3.3 

ESTs 3,2 
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445609 
449017 
415908 
457733 
449340 
428134 
434407 
429973 
418092 
411356 
443366 
422069 
420778 
403451 
4515S6 
423B37 
422731 
450049 
428656 
401278 
4180B7 
421813 
402490 
448001 
445316 
405150 
413784 
425198 
433224 
421894 



405422 
42DS43 
410527 
409079 
406600 
408025 
426349 
444576 
430661 
441335 
459547 
435713 
407275 
456103 
435200 
449245 
428132 
443454 
423732 
411944 
432718 
435534 
430348 
410289 
453126 
4S9421 
441543 
442252 
419254 
443727 
419696 
453898 
415098 
415131 
405398 
433942 
40M61 
442640 
43B814 
43B361 
422482 
407142 
4543t9 
442907 
456075 



AI830851 

AW0D2425 

H08623 

AW374812 

AW2357B6 

AA421773 

AW815333 

A14233I7 

R45154 

H45377 

AI053501 

AJ010063 

AW970512 

AA059246 

AW937D53 

AL13B411 

A1681234 

AB03779B 



AW293237 
AI219833 

BE165819 
AA352090 
AB040919 
A1416464 
AA236667 



AA276221 
M181339 
WB7707 

AI692784 

A)308855 

AI400974 

AC005551 

BE222470 

R34107 

AAB99313 

A^364186 

Z39430 

AA670310 

A1636539 

AA421765 

AKJ57494 

AF058056 

AW877139 

AA563943 

AA830927 

AA476915 

AW901618 

AA032155 

AA233154 

AI733014 

AI733395 

AMB9453 



452570 
408493 
447556 
451632 



AI205646 

AA826278 

AA805666 

AI43990S 

AA412535 

AW247736 

A1023763 

N73442 

AAB15031 

AW86t293 

BE205854 

AJ88S187 



Hs.200014 

Hs.224142 

Hs.22833 

Hs.291971 

Hs.195359 

H8.1610DB 

HS.1646B0 
Hs.338439 

HS.278B69 
HsJ43603 



Hs.110293 
Hs.275150 

Hs.256509 
Hs.tfiB790 



AA961613 Hs.127838 



Hs.202037 
Hs.166767 



Hs.207664 
H6.128003 
H3.21095B 
Hs-190836 



402422 

435849 BE305242 Hs.16098 



Hs.173344 

Hs.929 

Ha.82065 

Hs.41767 

Hs^01497 

Hs.182045 

Hs.130714 

Hs.150625 

Ha521450 

Hs.1 14071 

HsJ2350 

Hs.145903 

H«L2242S6 

Hs.133421 
Hs.132183 

Hs.244371 

Hs.117306 

Ha.1B9225 

Hs.61935 

Hs.61822 

Hs.12532 

Hs.269715 

Hs.129t24 

H3.222760 

Hs.18459 



AW003512 Hs.232770 



D61119 

AW272166 Hs.123465 



Hs.147220 
HsJ249596 
Hs.146217 
Hs£44476 

Hs.101617 
Hs.79707 

Hb.123598 

Hs.336940 

Hs.46039 

Hb_210710 

HsJJ10625 



ESTs. Weakly similar to AQDIC PROUNE-R 

ESTs 

ESTs 

ESTs 

hypothetical protein MGC10954 
ESTs 

gb:QW3TD215-G6010£W}B3fl0l ST02l5H D ino 

ESTs 

ESTs 

gb.7n99fi03.r1 Soares adult bran N2h5HB5 
ESTs, Moderately similar to 2109260A B c 
titErhCs?> (teletfionin) 

gb:EST3B2593 MAGE resequences, MAGK Homo 

Target Exon 

ESTs 

gb;PM3CfT0037-231299«rt^t1 DTOD37Homo 
gb;DKFZp434A1229_r1 434 (synonym: htes3) 
EST 

KIAA1377 protein 
Taiget Exon 
ESTs 

gtdz49b05.y1 NC!_CGAP^Bm52 Homo sapian 

TaigetExon 

ESTs 

ESTs 

Target Exon 
ESTs 

hypothetical protein FU21213 

K1AA14B6 protein 

ESTs 

ESTs. WeaWy similar to 135022 tiypotheti 
Target Eron 
claucfci 2 

ENSP00000216658':rJyPO7HETICAL 133.5 kDa 
ESTs 

myosin, heavy polypeptide 7. cardiac mus 

Interteuta'n 6 signal transducer (gpl30, 

TargelExon 

PT DO 02 protein 

aitfnytasfarase 1 

ESTs 

Homo sapiens HSPC323 mRNA, partial cds 
ESTs. Weakly sTmflar to anffgan NY-C033 
pregnancy specie beta-1-glycoprotein 1 
ESTs 

gb:qw34h07.x1 Na.CGAPJJWRomo sapiens 

ESTs 

ESTs 

ESTs 

gb:zu24g02jBl Soares_fftHMPu^S1 Homosapi 
ESTs 

sduts carrier family 16 (morvocarboxyfc 
gb^V2-PT0010-160400-133-90t PTOH0 Homo 
ESTs 
ESTs 

ESTs, WeaSdy similar to 138022 hypothetf 
Homo sapiens mRNATcONA DKFZp76ll071 (fr 
ESTs 

chromosome 1 open reading frame 21 

ESTs 

ESTs 

ESTs 

ESTs 

gb:HSZ9B362 DKFZpharnyl Homo sapiens cDNA 
crachidonale lipoxygenase 3 
gb:HUM056E10B Ctontech human fetal brain 
gb:HUMl58CtlB Ctontech human fetal brain 
Target Exon 
ESTs 

Targe* Exon 

ESTs 

ESTs 

Homo sapiens cDNA: FU23077 fis, done L 
Bb^57g0fljt1 NCLCGAP_Lym12 Homo sap an 
Bbxt99b10^1 Scares jBstJsJiHT Homo sap 
ESTs, Weekly similar to T32527 hypothatf 
ESTs 

pbyz31h09.r1 SoarBa_multtple_5cterosls_ 
ESTs 

Homo sapiens cDMA; FU20B61 lis. dona A 
phoEphoglycarate mutasa 2 (muscle) 
ESTs, Moderately dmHar to ALU6_HUMAN A 
EST 
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426481 


AWS63941 




401656 






407269 


AJ245210 




435754 


AA700752 


Hs.1 17341 


433565 


AA599763 


Hs.11 2520 


451004 


AA044967 


425223 


AA352825 


Ha.546065 


405770 




456227 


T84239 


H5.18978B 


454445 


AW749432 




419494 


W01060 


HsJ4382 


427639 


AW444530 


Hs. 105362 


428079 


M421020 


Hs.208919 


406337 




401 804 






4068B1 


D16154 




428986 


AA4428B4 


Hs.201201 


450044 


R60444 


Hs.51891 


403630 




445514 


A1241260 


KG.14B9Q6 


446362 


AW6124M 


Hs.104105 


432492 


AW275110 


Hs.271106 


4308B9 


U22491 


Hs.248117 


434316 


AW411330 


Hs.118796 


413155 


BE067952 




433329 


AF01S041 


Ha.199291 


446523 


NMLG030G3 


Hs.334629 


449923 


BE258051 




453826 


AL138129 




405576 






432789 


D26361 


Hs.3104 


455791 


BED906B9 




449109 


AW270992 


Hs,120949 


436255 


F01143 


Hs.284284 


415984 


R19Q46 


Hs.5010 


402844 






458665 


AA452512 


Hs.76719 


414620 


AA150120 


Hs.244621 


404979 




412318 


AW935911 


Hs.328729 


424361 


AK001551 


Hs.145944 


412542 


Avraeisie 


Hs.95097 


441975 


AW17324B 


Hs^44285 


457021 


AW968334 


Hs.173108 


457B51 


AW450205 


Hs.305890 


439204 


AF067987 


Ks.42696 


415642 


U1S87B 


Hs.336224 


446847 


T51454 


Hs.B2845 


443359 


AI792563 


H&135354 


447336 


AWI39383 


HS245437 


449045 


BE072483 


Hs^78337 


422165 


AL117530 


Hs.1 12822 


436030 


R02287 


Hs.121052 


449569 


AW752437 


Hs.135258 


436092 


AI345595 


Hs. 127383 


415064 


AI733S07 




412908 


AA121913 


H&2938B6 


409583 


AW440117 


Hs.256879 


410463 


BE1 63567 


435383 


R61083 




426629 


A1203933 


Hs.97142 


415831 


H15145 


Hb.30309 


412281 


AIB10064 


Hs.14119 


434898 


AW5O0458 


H&.29956 


422229 


AF134414 


Hs.113271 


447518 


T60061 


458546 


AI215667 


Hs.l7SD44 


43664a 


AA813125 


Hs.146335 


450399 


AW511049 


H&.202DO7 


420833 


R47946 


H3.168732 


453903 


AW298606 


Hs^32777 


443650 


A1S93330 


Hs/151444 


427419 


NM_000200 


Hs.17788a 


423741 


AA330362 




451577 


N69101 


H5.40730 


441358 


AW173212 


Hs.12904t 


402706 




436054 


AI076262 


Hs.119813 


402749 




442472 


AWQ06659 




445762 


A1734002 


He.264590 


441420 


AA932872 



gb:EST37S014 MAGE resequences. MAGH Homo 
Target Exon 

gtcHorno sapiens mRNA for itnmunogtobufin 

ESTs 

ESTs 

gb^f53d09.r1 Soares retina N2b4HR Homo 
gb:EST60B8O Activated T-celfe XX Homo sa 
NM_002362:Hamo sapiens melanoma antigen, 
ESTs 

gb:RC3-BT0386-301299-Of t-*09 BT0386 Homo 
ESTs 

Homo sapiens, clone MGG18257, mRNA, com 
ESTs 

C14000021fli|7242973|dbj|BAAg2547.li {AB 
Target Exon 

gb:Human gene for cytochrome P-450c 11, e 

ESTs 

ESTs 

C300l70B*^I|475a028Iref|NPJ)04360.1| Co 

ESTs 

ESTs 

ESTs 

G protefTHxjirpled receptor 7 
annarfn A6 

gb:CM0-BTO3654)6l299Ll22^09 BT0365 Homo 

numb (Drosophlla) hamotog4lke 

sarcolipm 

B b:601111O34F1 NIKLMGQJ6 Homo sapiens c 
gb:DKFZp647F152_/1 547 {synonym: hfbrl) 
CX00l454:gl|8393794|renNP_058681.1| myo 
KJM0042 gene product 

9bcRC1-BT0720-280300^11^08 BTO720 Homo 
ESTs, Weakly almlfar to ALU7JHUMAN ALU S 
zinc finger 1111 

gb: ro 21H1.r1 Soares jrfarrt brain 1 NIB H 
C10001 18\c#9951913lief|NPJIB28321J pr 
U6 sriRNA-assoclated Bm-TSIce protein 
ribosomalprotefaSl4 
Target Exon 

hypothetical protein MGC11082 

Hbmo sapiens CDNA FU1D6B9 fo, done NT 

ESTs 

EST 

Homo sapiens cDNA: FU21697 te, done H 

BCL2-Jike1 

EST 

transmembrane protein with EGF-like and 
Homo sapiens cDNAt FU21930fls. clone H 
ESTs 
ESTs 

Homo sapiens cDNAFUl 1537 fls, clone HE 

DKF2P434B172 protein 

ESTs 

ESTs, Weakty slmlarto ALU7JHUMAN ALU 8 
ESTs 

gb:zo86h09.y5 Stratagane ovarian cancer 
pregnancy-assodated plasma proteln-E 
ESTs 

0b:QV3-HTO4SO-23O2OO-1Ol-bOa HT0460 HomD 

weel (S. pombe) homotog 

ESTs 

ESTs 

ESTs 

KIAA0460 protein 

ABO Wood group (transferase A, alpha 1- 

gb^J22TO8.s1 Soares fetal liver spleen 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

HSTs 

NstBun3 

gb:EST34074 Embryo. 12 week II Homo sapi 

ESTs 

ESTs 

Target Exon 
ESTs 

Target Exon 

gb:MR0^T002f>08119WI04-c03STO02O Homo 
ESTs, Moderately sirniferto ALU5_HUMAN A 
gbxx>57d07j1 NCUCGftP_Lu5 Homo sapiens 
Target E*on 
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Target Exon 


Z6 




Attn 00 tw 


Hs. 346429 


ESTs 


26 


44 i oa i 




Hs. ZD 3625 


ESTS 


26 


411280 


N50617 


Hs. 90506 


small nuclear ribonirdeoproteln polypept 


26 


4205O9 






tumor necrosis factor receptor superfamT 


26 




A A ROACH 1 


L>_ 'Vienna 

lis. 23290a 


ESTs 


26 




BE172762 


HsJ292710 


ESTs, WeaMy similar to ALU5_HUMAN ALU S 


26 


4*lDM/ 


T7B001 


HS. 1 33546 


hypothetical protein RJ21 120 


26 


40171 G 






r k iflnnnjQfw-n7i/n , wjtnfrtkii a A7iicn n /kuki 

\ji™UUyUz.y][HUJ44U^D[AjKM/0 lt)0.1| (MDbf 


1 c 






gbiza62dG4.r1 Soaras fetal Infer spleen 


26 


WOW 


N37059 


HS.30/50 


ESTs, Weakly similar to I3B022 hypothec 


26 


440702 


AA904178 


Hs.146233 


ESTs 


26 


456310 






gb:nc25c0bj1 NCLC6AP_Prt Hot» sapiens 


2.6 






HS.9U/44 


ESTs 


2.6 


400/0/ 






gb:QV4-flT0536-27^299-059-b12 BT0536 Homo 


26 






13-, q>rji£-v«j 


to IS 


26 


408432 


AW4 QCOCO 

AW1 95262 




grocn67c05,x1 NCLCGAP_CML1 Homo sapiens 


26 


J^A07A 
4400/4 


Hkdibbr 1 


Hs. 170058 


ESTs 


26 


441U03 


A ACM 464 O 

AAa loo19 


Hs.1 88025 


ESTs 


26 


455505 


AW97D64Q 


Hs.309071 


ESTs 


26 








gb:DKFZp434F02l3j , 1 434 {synonym: hiesj) 


26 


455048 


AW852749 




gb:PM1^T0247^0100-OOa4irjS CT0247Homo 


2.6 




mm rvMKftd 
NM^U045U4 


Hs.171545 


Hiv-1 Rev binding protein 


26 


444130 


All 25263 


Hs.170410 


ESTs 


2.5 






ns. 1 71937 


steroid dehydrogenase-! Ike 


25 


4Z4£4I 






Homo sapiens pyruvate dehydrogenase klna 


25 


4Q (DUB 




nS./8*lo 


Aib-7d anaidg protein protein 


25 


4W4t/ 


AHU449d4 




histamine H4 receptor 


25 


410443 


BE062906 


Hs. 28338 


KJAA1546 protein 


25 


433Z1U 


AUfDCCEOQ 

AVVobboXJ 




gb:QV4-SN0&24-21 0400-1 8 VblO SN0Q24Homo 


25 




AW977030 


rl5.1o4a&) 


GGi-203 prolan 


25 


441191 


AJ69393Q 


Hs. 148816 


ESTs 


25 


413489 


BE1 4422a 




gb:rW»-HT0165-l402D&fl09-d04 HT0165 Homo 


2.5 




N34740 


Hs.6658 


ESTs 


2,5 






Hs.1252 


apolipoproleln H {oeta-2-glycoprotetn I) 


25 


44761 Q 


AW295286 


Hs 255 534 


ESTs 


25 


45U7Z4 


R55428 




g"bcyj79b05.r1 Scares breast 2NbHBst Homo 


25 


414523 


Al I/YtCC'Sq 
AUUfDboi 


Hs.76353 


serfne (or cysteine) proteinase rihibito 


25 


414776 






nypotneucai protein ru i*t iyo, na/vw 14 


JLo 


447730 


A1421251 


Hs.114085 


Homo sapiens mRNA far KIAA1 755 protein, 


25 


434152 


AI792665 


Hs.291190 


ESTs 


25 


412671 


AW977734 


Hs.37931 


gb:EST38S953 MAGE resequences, MAGO Homo 


2.5 


456401 


W28146 




Sb:43f11 Human retina cONA randomly prim 


25 


404678 






Target Excn 


25 


408520 


AA22S063 


Hs,161614 


ESTs 


25 


411332 


AW837212 




gb:QV2-LT0036-14O300-O81-cO1 LT0038Homo 


25 


448920 


AW406009 


Hs. 22580 


a^ylgVcarone phosphate synthase 


25 


429906 


AUDB0137 


Hs.193743 


ESTs 


25 


433712 


AF090B87 


Hs.306562 


Homo sapiens clone H 00085 


25 


438353 
446224 


BE539951 


HS.30G99B 


Homo aapJens, clone IMAG&3447073, mRNA, 


25 


AW450551 


Hs.13308 


ESTs 


25 



TABLE 63B 



Pkey: Unique Eos probesel identifier number 
CAT number: Gene duster number 
Agaroton: Genbank accession aurabens 



Pkey 


.CAT Number 


408229 


1048462J 


408432 


1058667 Jl 


409679 


114787 1 


410483 


1204995 1 


411320 


1238624J 


411332 


1239102^.1 


411356 


1240273J 


411426 


1245515 1 


411829 


1260309 1 


411944 


126648ZJ 


413155 


1351 148.1 


413381 


1365950 1 


413489 


1373392J 


415054 


15182? 1 


415098 


1522174.1 


415131 


15236 B0 J 


415366 


1535560J 


415S81 


1564242^1 


416628 


1604848L1 


416935 


163179.1 


419386 


184356J 


419896 


1BB8662.1 


420778 


196389.1 



Accession 
AW176091 H24234 
AW195262 R27B6B AW611262 

BE250521 AA076837 BE249870 AA984291 AW5D2442 AW501551 AT22149! AA194239 D63046 AA193426 AA773243AA1 93293 

BE163567 BE073669 BE073747 BE073780 BE073739 BE073748 BE163495 AW75017B BE163491 BE073763 BED73671 

AW836646 AW836BB0 AW636610 AWB36636 AW836603 AW836632 

AW837212 AW637406 AW837255 AW837380 AW837213 AW83741 1 AW637418 

H45377H21137AW838640 

BE141 714 AW845993 AW84S989 

AW865749 BE179419 BE179492 

AW677139 AW677135 AW87701B AW991 835 AW87712B AW677106 AW87701 7 AW8771 07 

BE067952 BE067945 BE067942 BE067943 BEG67949 BE067954 BE067944 BE067953 BH067956 BE067946 

BE090690 BE0906B8 BE090681 BE090693 BE090675 

BE144228BE144291 

AI733907 AA159708 A1732614 

D5S6B7 D59694 D59656 D59589 

061119 D81S0BD81734 

Z43087 F0T410H15S06 H5410SR95033 H96000 

R3S694 H12035 R53312 

W03955 H82332 H69247 H72486 

AA1 90712 AA190665 AA252564 

AA236B67 AA237068 AA354236 AW957759 H08961 



AW970512 AA280251 At652287 BE466438 At650725 AA551854 AA281574 AW571481 
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421342 
421B13 
422731 
423741 
423772 



423B71 
426481 
428132 
431B47 
434407 
43S3S3 
43B190 
438535 
441420 
442472 
445797 
447518 
447600 
448516 
449923 
450724 
451004 
452351 
453412 
453491 
453752 
463326 
454445 
454549 
455046 
455210 
455649 
455737 
455791 
455B87 
456075 
456310 
45640* 



201427.1 
207654_1 
220507J 
231582J 
231BB.1 



232749.1 

267876.1 

2d7430_1 

338402.1 

3B5744.1 

40536QJ 

41555.1 

45346.1 

515748 1 

54337U 

650943^1 

724787.1 

728288J 

766241J 

B1926.1 

B44585J 

B5453J 

91233.1 

966264J 

969172_-1 

97969RJ 

982669.1 

1204469J 

1223789.1 

1250599.1 

1260650J 

1348708J 

1353892.1 

13B5954J 

13B0835J 

147675S 1 

177089.1 

1844549 w 2 



AA504749AA28749B 
BE048255 AA313083 AA258419 
AL138411 AL13B412 AA315860 
AA330362 AW962525 HB7796 

AA306637 NM.007241 AF156102 BE38B339 BE614027 AW935016 AA315261 AW37641B AW304951 AI859B20 AA367436 AW516957 BE614519 
A1453152 At453149 AI453139 AW166378 A1139491 A1538368 AW468227 Ai6BQ027 AW090513 AA662830 F30395 AI3519B5 A1424349 AW009599 
C02215AI6525 
AA331906AA332484 

AW963941 AWSS3944 AA373825 AA379564 

AA421765 AA456078 A1290275 AA455579 

AI791314 AI791 434 AA516511 

AW815333 AW815409 AA632563 

R61 083 R13743 AA679174 AA679193 Z42903 

AKD01059AA633055 

L0907B LO3145L09094L0909B L03165 L09102 
AA932872 W280S8 W2B643 T96110 
AW8Q6B59 AW806B52 AF049582 
AI253414A1366014R34822 
T80061 AI382804 
AM20990 Al 399725 Al 401757 

AW898595 AW898588 AW898590 AW898563 AW898592 AI525093 

BE25B051 R45758 AA004732 BE255126 

R5542B AIS20704 AI732283 R54983 

AA044967 H&6327 AA013079 AA058776 BE242713 AA019987 

AA025647 R45716 AW753786 

AJC0329O AJ0032aB AW276947 

AL040177 

AL120BQOBE37B580 
AL138129 AL13B1 79 BEQ64231 
AW749432 AW749434 R47332 

AW8O6910 AW866461 AW856396AW86B373 AW86B611 AW866616 
AWS52749 AW852755 AW852620 
AW866599AW866294AW866468AW866467 

BE065051 BE155165 BE064764 BE155231 BE054648 BED64671 BE0B4636 
BE07224S BE072229 BE072225 BE072210 BE072221 BE072256 BE&72211 BE072242 
BEO90689 BE0906B5 BE090697 BE090680 BE090G91 BE09G696 BE090696 BE09D6B6 
BE154173 BE15409B BE154096 
N73442 R98100BE41O38O 

AA225522 AA225465 A1B2Q979 AW9739B5 AI791S35 AA558735 
W28148W28187 



TABLE 63C 



Pkey : Unique number corresponding to an Eos probeset 

Raft Sequence source. The 7 digit numbers in this column are Genbank Identifier [Gt) numbers. "Dunham I. et sl r refets to Ihe pubtarinn en b' ted "Hie UNA sequence of 

human chromosome IV Dunham L et aL» Nature {1993) 402:469495. 
Strand: Indicates DNA stand from which exons were predicted. 

NlPQsHJon: Indicates nucteafida positions of predicted axons. 



PkBy 


RBf 


Strand 


NLposlbon 


400461 , 


9929654 


Phis 


32727-32846.3292^33051 


400499 


9796071 


Minus 


143495-148803 


400749 


7331445 


Minus 


9162-9293 


400831 


8576271 


Mmis 


56502-57407 


401278 


9799936 


Plus 


98428-96573 


401411 


7799787 


Minus 


144144-144329 


401656 


9100664 


Minus 


1-382 


401716 


6715703 


Plus 


174722-174911 


401884 


8140731 


Minus 


8916*90053 


401896 


8569194 


Plus 


115123-115294 


401917 


9502466 


Plus 


25054-25229 


402422 


9796344 


Minus 


32643-33008 


402460 


9796884 


Minus 


10B901-1Q9254,11024&110581 I 113E13-113960 


402430 


9797648 


Plus 


149982-150929 


402706 


8B94426 


Minus 


148640-148805 


402749 


9212740 


Minus 


68787-68882,76602-76768 


402797 


3421043 


Minus 


15758-15930 


402817 


6822166 


Minus 


48611^9012 


402844 


9369286 


Plus 


54958-55313 


403451 


9836240 


Plus 


773B2-783Q0 


403612 


8469060 


Minus 


94723-94859 


403630 


8569999 


Minus 


13909-14466,15251-15760,16898-17431,41742-42440 


403649 


8705159 


Wnus 


27141-27247 


404260 


9366879 


Plus 


51396*51513 


404267 


9581792 


Minus 


12209-12313,18241-18397 


404606 


9212936 


Minus 


22310-23269 


404660 


97970BB 


Plus 


168215-168916 


40467B 


9797204 


Plus 


115196-115448 


404979 


4160139 


Mnus 


87762-88217 


404994 


G9398B2 


Plus 


67221-87505 


405001 


6015406 


Minus 


104646-104819 


405150 


99297SB 


Pius 


126475-126773 
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405152 


9965561 


Minus 


137662-137969 


405258 


7329310 


Plus 


129930-130076 


405422 


7243869 


Minus 


101938-102079,102261-102443,102896-103202 


405564 


2114222 


Minus 


16766-17344 


406678 


4079670 


Phis 


151821-152027 


405735 


9931 101 


Minus 


29854-29976 


405770 


2735037 


Plus 


61057-62075 


406003 


6247800 


Plus 


42079-42516 


406035 


9123888 


Plus 


18665-18843 


406177 


7279760 


Minus 


1B930-19148 


406337 


9213455 


Phis 


90117-90337 


406399 


9256275 


Minus 


118691-118959 


406600 


8248616 


Minus 


36296^6610 



Table 64A lists about 703 genes up-regulated In BPH compared to norms) adult tissues. These were selected from 5S6B0 pro besets on the AflymeUWEos Hu03 GeneChlp array 
such that Ihe ratio of "average" BPH to "average* normal aduK b'ssues was greater than or equal to 3SL The "average" BPH level was set to the 75" percentile amongst BPH tissues. 
The "average" normal adult tissue level was set b tha 85* percenHe amongst non-malignant tissues. In order to remove gene-specific background levels of nonspecific hybridization, 
the 10* parcenfife value amongst the non-malignant tissues was subtracted from both the numerator end the denominator before Hie ratio was evaluated. 

TABLE 64A: ABOUT 703 GENES UP-REGULATED IK BEfjjGH PROSTATIC HYPERPLASIA COMPARED TO NORMAL ADULTTIS5UES 



Ptey: Unique Eos probeset identifier number 
ExAocn: Exemplar Accession number, Genbank accession number 
UnigensID: Uragene number 
UYiJaene THteUnigene gene title 



R1: 


Ratio of BPH 


tissue to normal adult body tissue 




Pkey 


ExAccn 


Unlgene ID 


Unlgene TlrJe 


R1 


420154 


AIU93155 


Hs.95420 


JM27 protein 


An c 
49.D 


42B747 


AA535210 


Hs.171995 


kaJRkreln 3, (prostate decide antigen 


47.2 


419526 


AIB21B95 


Hs.1 93481 


ESTs 


43.6 


432441 


AW292426 


Hs.163484 


ESTb 


42,7 


431616 


AA508552 


Hs.195B39 


ESTs, WeaWy sfcrtlar to I38022 tiypothetl 


30.2 


407202 


K58172 


Hs.1 09370 


ESTs 


26.1 


432101 


AJ91895D 


Hs.123642 


EphA3 


25.6 


400259 


X07730 


Hs,171995 


kellfoein 3, (prostate specific antigen 


25.5 


425075 


AA506324 


Hs.1B52 


add phosphatase, prostate 


24.6 


414569 


AF109298 


Hs.1 18258 


prostate cancer associated protein 1 


24.4 


410929 


H47233 


Hb.30643 


ESTs 


21,1 


400287 


S39329 


Hs.1fi135D 


kelFikrein 2, prostatic 


203 


445057 


AM2Q227 


Hs.149358 


ESTs, Weakly similar b A4601O X-Unksd 


19.8 


423073 


BE252922 


Hs.123119 


MAD (mothers against decapsntepleoic, Dr 


18.6 


415989 


A1267700 




ESTs 


17.8 


428336 


A. ACM44C 

AA503115 


Hs.183752 


micrDsemiraprotdn, beta- 


Ay ft 


450093 


AW450461 


Hs.203965 


E3Ts 


16.7 


400298 


AA032279 


Hs.61635 


six transmembrane epithelial antigen of 


167 


4Q716B 


R45175 


Ua.117183 


ESTs 


15.5 


406369 


R38438 


Hs.182575 


solute carrier family 15 (H??? transport 


15.5 


454119 


BE54g773 


Hs.40510 


uncouplng protein 4 


14.5 


428B19 


AL135623 


Hs,193914 


KIAA0575 gerra product 


14,5 


400302 


N480SS 


Hs.1915 


folate hydrolase (prostate-speeffle rnemb 


144 


400297 


A11 27076 


Hs^06201 


hypothetical protein DKFZp5640127B 


14.3 


416432 


M14156 


HS.B5112 


InsufuvJike growth factor 1 (somatomedl 


14.2 


432674 


AA641092 


Hs.257339 


ESTs, Weakly similar to 138022 hypothetj 


14.2 


433444 


AW975324 


Hs.129616 


ESTs 


13.8 


432240 


A1694767 


H 3.1 25179 


Homo sapiens cDNA FU135B1 lis, clone PL 


13.5 


428342 


AI73916B 




Homo sapiens cDNA FU13458 lis, clone PL 


12.9 


401424 






NMLQD 1 17 2:Ho mo sapiens erginase, ty pe ll 


12.7 


432435 


BE218B85 


H3.28207D 


ESTs 


12.5 


448999 


AF179274 


Hs.22791 


transmembrane protein. wfih EGHfce and 


12.3 


446100 


AW967109 


Hs.13804 


Hypothetical protein d J462023.2 


1Z0 


4251S4 


NWL0O1851 


Hs.1 54850 


collagen, type IX, alpha 1 


11.9 


407275 


AI3S4186 




gtxqw34h07ji1 NCLCGAPJM Homo sapiens 


11. B 


452340 


NMJ02202 


Hb.505 


ISU transcription factor, LlM/homeodoma 


11.7 


432473 


AI202703 


Hs.152414 


ESTs 


11.3 


410330 


AWQ23630 


Hs.159425 


ESTs 


11.2 


431579 


AW971062 


Hs.22288G 


ESTs. Weakly simitar to TRHY_HUMAM TRtCH 


11.2 


452792 


ABD377B5 


Ks.30652 


K1AA1344 protein 


115 


41BB4B 


AJ820961 


Hs.193465 


ESTs 


10.9 


400292 


AA250737 


HSJ2472 


BMP-RIB 


105 


433647 


AA603357 


Hs.222294 


ESTs 


10.8 


453160 


A1263307 


Hs.239864 


H2B histocie family, member L 


10.8 


409282 


AKOQ0631 


Hs.52256 


hypothetical protein FU20624 


10.6 


431474 


AL133990 


Hs.190642 


CEGP1 protein 


10.3 


429220 


AW207206 




ESTs 


10.3 


428134 


AA421773 


Hs.1 61008 


ESTs 


10.2 


408622 


AA05S060 


Hs.202577 


Homo sapiens cONA FU12166 fis, clone MA 


10.1 


456497 


AW96795S 


Hs.123648 


ESTs. Weakly similar to AF10B460 1 ubfnu 


9.8 


434792 


AA64B253 


Ks.13245B 


ESTs 


9.7 


433466 


AAS08353 


Hs.105314 


relaxin 1 (H1) 


as 


439176 


AI446444 


Hs.190394 


ESTs, WeaWy drnllarto B2B09Q ltae-1 pr 


9.5 


428398 


AI249368 


HS.9855B 


ESTs 


9.4 
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429290 


AF203032 


Hs.19B760 


neurofilament, heavy pdypeptfde (2DCKO) 


9.3 


407709 


AA456135 


Hs.23023 


ESTs 


9.3 


415293 


R49462 


Hs. 105541 


ESTs 


S.1 


429918 


AW873986 


Hs. 119383 


ESTs 


9.1 


440260 


AI972667 


Hs.7130 


copine IV 


9.1 


453095 


AW294531 


Hs. 11325 


ESTs 


9.1 


410467 


AF 102546 


Hs.63931 


dachshund (DrosopYiila) homolog 


8.9 


456616 


BE172704 


Ha .222746 


KIAA1610 protein 


8.8 


430722 


AW968543 


Hs^03270 


ESTs, Weakly simlar to AUJ1J4UMAN ALU S 


8.8 


458072 


AIB90347 


Hs^71923 


Homo sapiens cDNA: FU22785 clone K 


8.8 


453006 


A1352575 


Hb,303171 


ESTs 


S.7 


418339 


AA639902 


Hs.104215 


ESTs, Moderately simitar to SPCrlHUM AN S 


8.7 


420424 


AB033036 


Hs.97594 


K1AA1210 protein 


8.7 


450842 


R30773 


Hs.7130 


copine W 


8,7 


434036 


AJG59131 


Hs.1 97733 


hypothetical protein MGC2849 


8.7 


428927 


AA441837 


Hs.90250 


EST* 


8.6 


410B29 


AA51K31 


Hs.55999 


MK hcmeobox (Droaophila), family 3, A 


8.5 


420345 


AW295230 


Hb.25231 


ESTs 


8.5 


446336 


AW815036 


Hs.151251 


ESTs 


8.4 


419743 


AW408762 


Hs.5957 


Homo sapiens clone 24416 mRNA sequence 


8.4 


42B728 


NnU>16625 


Hs.191361 . 


hypothetical protein 


8,3 


417169 


R13550 


Hb-246773 


ESTs 


8.2 


453387 


A1990741 


Hs.252809 


ESTs 


&2 


434666 


AF151103 


Hs.112259 


T cell receptor gemma locus 


8J 


41795B 


AA7673B2 


Hs. 193417 


ESTs 


ao 


433433 


A1692623 


Hs.121513 


Homo sapiens dons 7*3-1 placenta expres 


7.9 


444609 


AW571559 


Hs.278081 


ESTs 


7.B 


424099 


AF071202 


Hs.139336 


ATP-binding cassette, sub-family C {CFTR 


7.8 


433923 


A1B23453 


Hs.146625 


ESTs 


7.7 


4375B7 


A1591222 


HsJ2325 


Human DNA sequence from clone RP1-187J1 1 


7.7 


441390 


R81733 


Hs.33106 


ESTs 


7.6 


453370 


AI470523 


Hs.139336 


ATF-binding cassette, sub-family C (CFTR 


7.6 


441247 


AW11B681 


Rs.126051 


Homo sapiens thymic stromal lymphopoieti 


7.5 


439444 


AI277652 


Hs.54578 


ESTs. WasM/ simitar to I3SQ22 hypotheti 


7.5 


431446 


AL137517 


HS.3062M 


hypothetical protein DKFZp56401278 


7.4 


4135Q7 


AW302B85 


HB.1 17183 


ESTs 


1A 


404592 






HM_02273raomo sapiens £3 ublquitm lig 


7.3 


409557 


BE1B2B9B 


Hs-211193 


ESTs 


7.3 


420352 


BE25S835 




gbSOl117374F1 MIH_MGC_16 Homo sapiens c 


7.3 


438231 


AW594539 


H&.155689 


ESTs 


7.3 


440529 . 


AW207640 


Hs.16478 


Homo sapiens cDNA: FU21718 fts> clone C 


7.1 


449300 


AI656959 


Rs.346514 


ESTs 


7.1 


417332 


AW972717 


HS.28B462 


hypothetical protein FU21511 


7.1 


453930 


AA419466 


Hs.36727 


hypothetical prolan PL) 10903 


7.1 


449249 


T52285 


Hs.193115 


Homo sapiens mRNA for KIAA1 764 protein, 


7.0 


430187 


AJ79S909 


Hs.158959 


ESTs 


7.0 


434217 


AW014795 


Hs.23349 


ESTs 


7.0 


431217 


MM.013427 


Hs.250830 


Rho GTPase activating protein B 


G.9 


4301 BB 


AL049242 


Hs.234794 


Homo sapiens mRNA; cOMA DKFZp564B0B3 (fr 


6.9 


415785 


AW419196 


Ks.257924 


hypothe&cal protein FU13782 


6.9 


421059 


AtG54l33 


Hs.30212 


thyroid receptor Interacting proton 15 


6.8 


426384 


AM7207B 


Hs.303662 


ESTs 


6.8 


45B509 


AA654650 


Hs^82906 


ESTs 


6.B 


449821 


A1671141 


H^11122 


ESTs 


&a 


407110 


M156790 


Hs^&2036 


ESTs. Weakly similar to Z223JHUMAN ZING 
ESTs 


6.8 


443250 


A1041530 


Hs.132107 


6.7 


426330 


122524 


Hs.2256 


matrix metal toprotekrase 7 (rnatrifysb, 


6.7 


406197 


AA262262 


Hs.107410 


ESTs, Weakly slmferto A46010 linked 


6.6 


454457 


AW753456 




gb:QV2-CT0261-26l 099-01 1-d1 1 CT0261 Homo 


6.6 


421154 


AA284333 


Hs.237631 


Homo salens cDNA FU14269 Its, clone PL 


&5 


429433 


AA452899 


Hs.213586 


ESTs, WeaWy sirnikr to K1AA1353 protein 


&5 


431467 


N71831 


H^256398 


Homo sapiens mRNA; cDNA DKFZp434E052B [f 


as 


424433 


H04607 


Hb.9218 


ESTs 


6.5 


450164 


AJ239923 


Hs.63931 


ESTs 


6.5 


423566 


AW976434 


Hs.3623 


hypothetical proton FU11220 


6.5 


450497 


H64139 


Hs.15328 


ESTs 


6.5 


425312 


AA354940 


Hs.145958 


ESTs 


6.4 


426140 


AF13579B 


Ha.343768 


Homo sapiens clone 251 19 mRNA sequence 


6.4 


440911 


AA909536 


Hs.143562 


ESTs 


04 


412350 


A165S306 


Hs.73826 


protein tyrosine phosphatase, ncn-recept 


6.3 


432600 


A1621095 




gb3is95a12.y5 NCLCGAPJPrt Homo sapiens 


6.3 


435375 


AJ733610 


Hs.1 67832 


ESTs 


6.3 


416370 


N90470 


Hs.203697 


ESTs, Weakly similar to 138022 hypothetf 


6.3 


431359 


AW993522 


Ks.292934 


ESTs 


6.2 


424736 


AF230677 


Hs.152701 




6.1 


403657 






Target Exon 


6.1 


424846 


AU077324 


Hs.1832 


neuropeptide Y 


&1 


439569 


AW602166 


Hs^22399 


CEGP1 protein 


6.1 


436396 


A16S3487 


Hs.152213 


Wlngless-typa MMTV iritegntfon site fami 


6.1 


448004 


AW451477 


Hs.257456 


ESTs 


6L1 


431770 


BE221B8Q 


Hs.268555 


exortbo nuclease 2 


ao 


453B61 


AI026838 


Hs.30120 


ESTs. Weakly annv to NUCUWMAN MUCLE 


6.0 


415621 


AI64B602 


Hs.55468 


ESTs 


6.0 
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436873 
442481 
432527 
447156 
404003 
424853 
427726 
433043 
43B13B 
440129 
434973 
406627 
459241 
4344BS 
429716 
443960 
453200 
428002 
432966 
445143 
420948 
425810 
414312 
404571 
424850 



427078 

422605 

417511 

449625 

433927 

441676 

421513 

432682 

435 021 

454171 

42G5B1 

439079 

443635 

400080 

410196 

436578 

424051 

450325 

421663 

432161 

435072 

428647 

441111 

433087 

450244 

400294 

415539 

437267 

441916 

452784 

448715 

452531 

426503 

423101 

445704 

450613 

409079 

415890 



AI093577 

AA033B32 

AA41S703 

AA650114 

NMJ016578 

AB016B93 

AI923627 

M155694 

AA151057 
AA830149 
AI676062 



441054 



437752 
446072 
419536 
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N23874 Hs.50477 RAB27A, member RAS oncogene family 

N9982B gh:za32c04rl Scares fetal Rver Epteen 

AW975028 Hs.102754 ESTs 

AW274731 Hs.157920 ESTs 

Target Boon 

BE54973? Hs.132967 Human EST ctonB 1 22BB7 mgrtner Iran&poso 

A)359144 Rs.143688 Homo sapiens cDWA: FU23Q31 tts, done L 
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AA865B1 B Hs.1 74936 ESTs, WesWy simile to S71 BBS Ste2CWik 

AW449ZB5 Hs^13636 EST 
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AA032276 Hs.99010 ESTs, Moderately BlmBarto T14342 NSD1 
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gb:yv04a07^1 Soares fatal liver spleen 

C4001100*:gil5852342|gb|AAD54O15.1l (AFO 

ESTs, Weakly simflarto DYH9_HUMAN QUA 

gb:nh05d12s1 NCI_CGAP_Thy1 Homo sapiens 

hypothetical protein HTC23 

heparan sulfate (glucosamine) 3-O-suifot 
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3,0 


TQfi.HA 




EfiTe 


3.0 


71107 




rjoxtaunur Jti ooares_>eiai_r)ean jVDnniaw 


3.0 


AA547959 


He 1 i^n^R 

XtS.I [3ojO 


COT,, 

CO IS 


3.0 


LJlZ 1 f OOtKJ 


He TfnRlA 
rl5.£f QOOH 


ESTs 


3^0 




He i ) J ltn j \ 


nypouieDCai protein UKrZp/ d AJt>7 B 


3.0 












Lin 11A74*1 


nypotnebcaJ protein FU 14744 


3.0 




Ue 0*51031 


tCT r VftbsnLtu mmlnrln lr>4AQT DMA t, -T— 

tb I sv WBBWy simaar to JUIOor RNA naltc 


3,0 


A AVIATED 


ns.1 A^you 


ESTs 


3.0 


nnflUUDW 




Unmnsanlonp rtlVklA CI MH fXA fin iJnna UC 

nomo sapiens cuwAhui 1004115, none rtb 


3.0 


A ATA*0jlft 


We- <H mo 
ns.y iso£ 


ESTs 


3.0 


Al£R4£££ 
MloO'tOOO 




tblS 


3.0 


Mil fWTTEQ 


H8.2743o2 


proteb kinase, Interferon-Inducible dou 


3.0 


JAffxAQfrr 




gDjsvoWiiUf IjLO Homo sap tens cUNA clone 


3.0 


tJhol4U/U 


LI* Kiel 46 

HS.3ZB41E 


Homo sapiens mRNA* cDNA rjKrZp5b4H1916 (f 


3.0 


AUfCflVIQft 

Avvoudoro 


rlSJflubtid 


bo is, Moderately simiar to loaDZZ nypn 


3.0 




He OQTOC 


nypotneiKai protein njioi»7 


3.0 




ns.i43Uiu 


lipapolysaccaride-specffic response 5-11 


3.0 


A AQMQQO 


Rs.245022 


ESTs 


3.0 


H92962 


nS.l/W 1 o 


nypouieticai proietn Mbui4oi7 


3.0 




Hs. 3330 16 


ESTs 


3.0 


AWUW70# 




j]b:CMO-CTOl03-12DB99-037-{07 CT0103 Homo 


3,0 


AA 7744 AC 




1-2 pdn receptor 


3.0 


T97307 




gb:ye53n05.si Soaras fatal nver spleen 


3.0 


N40087 




ESTs 


3 0 


AJ001443 


Hs.195614 


spOc^g (actor 3b. subunft 3, 13DKD 


3X> 


AW014345 


Hs.161690 


ESTs 


3.0 


AKO0Q039 


Hs.132826 


Homo sapiens cONA FU14913 fs, cbne PL 


3.0 


AAJ504142 


Hs.293726 


ESTs, WeaWy stm?ar to TSGA RAT TESTIS 


3.0 


AW794931 


Hs,100B61 


hypothetical protein FU 14600 


3.0 


AA228776 


Hs.191721 


ESTs 


3.0 


AA811845 


Hs.106290 


Kefch motif contalnhg protein 


3.0 


AA521136 


Hs.t30176 


ESTs 


3,0 


AF1 50262 


Hs.182898 


ESTs 


3.0 


AW205507 


Hs.32360 


ESTs, Highly similar to 138587 reiiovi'rii 


10 


AA017410 


Hs.40568 


ESTs 


3.0 


AI933416 


Hs.189674 


ESTs 


3.0 



TABLE 648 



Pkey: Unique Eos probeset Identifier number 

CAT number. Gene duster number 

Accession: Genbank accessBon numbers 



Pkey CAT Number Accession 

410790 1221131.1 AW803357 AWA03423 AW812233 R06S14 

410869 1 2251 23_1 AW808361 AW80B4O4 AWS0B385 AWB0S594 AW808554 AW80B81 3 AW806551 AW80B676 AW808350 AW8084O5 AW8C8694 AW80B934 
AW008B29 AW80B3B5 AW80B422 AW806401 AW808409 AW808760 AW8DB853 AWB0B521 AWBQ8539 AW808609 AW808472 AW80B739 
AW608704 AWB0B558 AW60871 4 AW60B420 AW8 

411436 1245660J AWB46433AW846159 AWB46377 AW&4B528 



736 



WO 03/042661 



PCT/US02/36810 



411479 1247077J 



4129B8 
413051 
413525 
415989 
416009 
416288 
416BB2 

418259 
418856 
418948 
419536 
420111 
420352 
423412 
424200 
426413 
426503 
426991 
428002 
428342 

429163 
429220 
430535 
432600 
432755 
433523 
434763 
436295 



439092 
439306 
440840 

440947 
442481 
445432 
448044 
449343 



449570 



450317 
450580 



452462 
453682 
454037 
454096 
454171 
454457 
454860 
45496B 
455276 
455646 
455710 
457374 
45B912 



1342150.1 

1348563.1 

1374635J 

156454 1 

1566379.1 

1585983.1 

162718J 

17338BJ 

179786J 

180808 1 

185688.1 

190755J 

192979.1 

228001.1 

23659SJ 

256650.1 

268283J 

27415J 

285602J 

29CQ35_2 

300543 1 

301334_1 

319643.1 

350959.1 

353907J 

368873.1 

392847.1 

41733.1 

41795J2 



437866 44433^2 



468554J 
47GB8J 
50357 J. 

505904.1 
543588.1 
63943 1 
747196.1 
B0517J2 



81018J 
8113.1 

831956J 

SLf 



83933.1 

84327J 

918560.1 

977454_1 

996287.1 

1007449^1 

1049240 1 

1207274 1 

1237732J 

1247029J 

1272541J 

1348557.1 

135236BJ 

328758J 

823104.1 



AVV848047 AW84B202AWB48&31 AW848142AW84B702AWB48121 AW848632 AW848140 AW846571 AW84B009AW848067AW84B069 
AWB4B905AWB4B214 
BE04668O BE04673S9E044958 
BE064415 BE064430 BE06444B 

BE145B99 BE14584BBE145B49 BE145B53 BE145927 BE145925 

AJ267700 AJ720344 AA191424 A1023543 AI469633 AA172056 AW95B465 AA172236 AW953397 AA355086 
Z43062R13213H14422 

H51299 H44619 H46391 R86024 H51892 T72744 

A1633044 AW016212 AW241 143 AA759058 R43272 AW06895B AA210918 AA293774 AI748815 AJ763294 A1333114 AJ2773B4 AI08B297 AJ46B477 
AW24624 AW189606 A1631751 Z40749 AI984673 A1B71316 M189024 AW235412 BE178426 R24677 R40635 H05100 R40597 
AA215404 AI990909 BE464132 AW271459 N74332A1262061 
T65754 AA229857 AA22966B 

AJ217097 AW886090 W38035 W38792 AA232835 AW935043 
AA603305 AA244095 AA244183 
AA255652 AA2B091 1 AW967920 AA262684 

BE258835 AW96B316 AA25B918 AW843305 R14744 AI5803B8 BE071923 R36280 
AF1 09300 AI299378 AI202654 
AA337221 AA336756AW9561G6 
AA377823 AW954494 A10226B8 
AA380153 AA380233 AW963529 

AK001536 AA191092 AW510354 AJ554256 A13539SB AA134266 
AA41S703 AA418711 BE071915 BE071920 BE071912 

AI73916B AA42B249 A1199636 AW505198 AW977291 AA6245B3 AA8B3419 AA724079 AI01 5524 A077728 AW293B62 AI928140 AA731438 

At092404 A1085630 AA731340 

AA884766 AW974271 AA592975 AA447312 

AW207206 AW341473 AA448195 AI951341 

AW95B4BS AW968670 AA480922 BE350425 

AJB21GB5 AW973464 AA554802 AIB21831 AA557438 M640756 AA650339 
AJ003429 AJ003367 AA564825 

H298B2 AW665533 AW1 49901 AI572917 AA59B5D0 A168646S A1336390 AW864390 AW864320 

AA648616 AW974389 H51771 

N73895AJ0Q1872 

BE085236 BE085317 X04236 AA577934 AA578392 AA502836 AA595852 AA576258 AW270791 AA5071 51 AA559152 T57040 BE5032&1 
AW593405 AI825755 AI350499 A1655710 AI972281 AJ654949 BE073961 BE073962 8E041399 AW750214 AA228488 BE07401 6 AI9087C6 
AW270601AW873282 

AA156781 AW293839 U52054 AA024963 AA77B446 BE073977 AW444M4 AW602574 BE164040 BE1 64012 BE163972 BE163974 BE163992 
AAB37481 AW46B444 BE1 85091 AW4B3002 AA&87333 AA81 1830 AA5B180G AWG668S A1572124 AA043777 AAW0926 D20160 A1536733 
AAB12489 AWB74142 AI4718B 
AA830149 AW978407 M85983 AW503637 
BE220199 WD181 3 AF0B611B N7076Q BE221405 

AVU629GG6 AW959831 AW205739 BE620243 AA412367 AW300025 AW051920 AI288591 AW238114 A1302852A1038546 AA534496 AJ797207 
AA921877 

AA910403 AI615593 W66361 AW162520 A1B16550 
N9982B BE079873 A11 10738 AF074645 
AVB53771 B EOS 9370 

A1458682 H24240 R14537 R18426 AW867082 

AI151418W60401 AW631238 AI346936 AA862855W60310N7250i K9D0GO BE1S0445 AW3B0821 AI540906 C048B1 WD3542 AA641764 H97053 
AWB89353 AA52130B AA0012D3 W9282B AJ207798 AA746656 R78710 W24617 AA024605 C01747 AW173095 W61229 W926B5 AA742487 
HQ0789R76925AW1826 
AAQ01793AA0Q1871 

NMJH4253 AF100772 BE06B769 AU02271B BE161779 AW863569 BEI61640 ALO3906O BE168542 AW296554 AA323193 AA235370 AW779760 
K48574 A1375997 R45432D59344AI203107 F07491 R35360 K25094A191 3631 AJ49B402T61 382 A1016320 N45526 T61 415 AA331 486 
AI692689R14223R18395 

N400B7 H1 2925 AA460779 AA096372 BE160B47 AW8168S0 AWB1 6893 AWB16577 AWB16S91 AW816BB2 AW816B68 AWB16BS9 AW816940 
AW816B92 AW816941 AW816578 AA0291 83 T83320 H86850 AA010295 BE1 60823 Z25353 AA1506B3 AWB87764 AWO23806 AW0220Q5 
AA16451 8 AA730973 W00417 W65303 

A039732 AA01D300 AW515D41 M768334 N29BBD AA425B74 AA425118 AA865B29 AW936678 
AM95BO0 AA495737 AA010736 AA654716 AA640726 
9E173515 BE173560 A19Q2860 
T79703 796307 AL07S725 
AW998716 AW022148 N68Q20 

AW062757 AW176B90 AW062758 AW1 76895 AW176B69 AW176900 AW176897 AW17692B AW17686B AW176892 
AWB54832 AW354798 AW854857 AW854&T6 AW854834 AW854817 
AW753456 AW753036 AW854868 AW854862 
AW835767 AW335537 BE160187 

AW849046 AW647956 AW849039 AW847957 AWB4B279 AWB4B69B AWB49034 AWB49033 

BE176479 BE176678 BE176357 BE176550 AW866079 BE176676 BE176615 BE176555 BE17G4S9 BE176610 BE176362 
BE064420 BE064435 BE064429 BED 644 14 BE064400 BE06451 7 

BE072049 BE069471 BE0694B9 BE069476 BE069479 BE069492 BE0694B5 BE072122 BE072124 BE0B9491 BE069486 BEQ694&4 

M4936B2 AWB97398 BE154B14 

A191 1068 A1933734 A16B08B8 AJ003599 



TABLE 64C 



Pkey: Unique number carrespomSng to an Eos pro beset 

Ref: Sequence source. The 7 digit numbers in this column are Ganbank IdentrBar (Gl) numbers. "Dunham I. at al." refers to the publication antirJeti The DNA sequence of 

human chromosome 22." Dunham I et at.. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which axons were predicted. 

NLposifion: indicates nucteoWe pcaroons of predicted exons. 



Pkey Ref Strand NUiosifion 



737 



WO 03/042661 



PCT/US02/36810 



400533 




Minus 


9771*30 lT7fiOC 


400746 




Minus 


14/ tVJ-\A(QV0 


401132 




Minus 


JjrflTQ CC7DC 


401403 


7710966 


Pine 


I^D tOV- iHOZiW 






Minus 


i^14bD-lilo2b 


401424 




Plus 




403242 


7637017 




1 1 1 


403461 


9965004 


PIUS 


93496-93633 


403510 


7652047 


Plus 


61856-62027 


403667 


6B50483 


Minus 


1344-1442,1545-1697 


403696 


3135242 


Minus 


143467-143634 


404003 


8656946 


Plus 


19B34M99D96 


404561 


9795980 


Minus 


69039-70100 


404571 


7249169 


Minus 


112450-112648 


404592 


9943965 


Minus 


39067-39225 


40464B 


8240647 


Minus 


23955-24034,25143-25264 


405254 


7329374 


Plus 


2A55S-2BG64 


405346 


2914717 


Minus 


4331043462 


405510 


7630909 


Minus 


101028-101174 


405546 


153215B 


Plus 


11552-11686 



Table 65A Eats about 347 genes up-regulated in BPH compared to prostate cancer tissues. These were selected from 59680 probesete on the Afiymelrix/Eos Ku03 GeneChtp array 
such that the ratio of "average" BPH to "average" prostata cancer tissues was greater than or equal to 3.D. The ■average" BPH level was set to the 75* percentile amongst BPR 
tissues. The ■average' prostata cancer tissue level was set to the B5* peroenfle amongst malignant prostate tissues. In order to remove gene-specHIc background levels of non- 
specific hybridization, the 10* percentile value amongst rKm-mat?gnant body tissues was subtracted from both foe numerator and the denormnalor before the rafio was evaluated. 

TABLE 65A: ABOUT 347 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO PROSTATE CANCER TISSUES 

Btey: Unique Eos probeset Identifier number 



ExAcciu 


Exemplar Accession number, Gen bank accession number 




UnigenaJD: 


Unigene number 






Unlgene Title: Unigene gene title 






R1: 


Ratio of BPH tissue to ores late tumor tissue 




Pkey 


ExAccn 


Unigene ID 


Unigene Tibe 


R1 


428134 


AA421773 


HS.16100B 


ESTs 


9,4 


446336 


AWB15036 


Hs.151251 


ESTs 


9l3 


456373 


BE247706 


Hs.89751 


membrane-spanning 4-dama'ns> subfamily A 


6 t 9 


458072 


AI890347 


Hs.271923 


Homo sapiens cDN A: FU22765 lis, done K 




400533 






ENSP(X)M0209376*:PRED65 protein (Frogmen 


a.7 


41B310 


AA314100 


Hs.86693 


ESTs 


8.7 


404592 






NM_022739':Romo sapiens ESublquitin llg 


8.1 


464457 


AW753456 




0b:QV2^T0261-26109W)tW11 CT0261 Homo 


7.7 


4000BO 






Eos Control 


7.4 


459646 


AW8B396B 


Hb.321190 


grjcQV3<)TO063-29O300-l3&<:04 OTO063 Homo 


74 


420352 


BE258635 
AW594539 




gb:B011l7374Fl NII-LMGC_18 Homo sapiens c 


7.3 


438231 


Hs.155689 


ESTs 


7.3 


416387 


R1BQB5 




Qb:yg16b12j1 Soaras Inter* brain IN IB K 


72 


442481 


N9982B 




gtxza32c04.r 1 Soares fetal fiver spteen 


7.1 


449249 


T52285 


Hs.193115 


Homo sapiens rnRNA for KIAA1764 protein, 


70 


457653 


AI820719 


HS.1546S2 


Drta) (Hsp40) homolog. subfamily A. rnembe 


6.9 


404967 






Target Exon 


6.9 


430535 


AW96B485 




gb:EST3B0561 MAGE resequences, MAGJ Homo 


&9 


454B60 


AWB35767 




gb:QV4-LTC016^402O0-1lO4>0fl LTOOtt Homo 


6.9 


423789 


AK0020B4 


Hs.132551 


hypothetical protein FU11222 


6.6 


4129B8 


8E046680 




9b^n42b03jc1 NCLCGAP_RDF2 Homo sapiens 


ae 


400440 


XB3957 


Hs,B3B70 


neixfin 


6.G 


454171 


AW854832 




gb:QV2-CT0261-20t09W)1 W05 CT0261 Homo 


6.6 


400086 






Eos Control 


6.4 


440911 


AA909535 


Hs.143562 


ESTs 


6.4 


425312 


AA354940 


Hs.145958 


ESTs 


6.4 


428140 


AF131798 


Hs.343768 


Homo sapiens clone 251 19 rnRNA sequence 


6.4 


419Q15 


T79262 


Hs.14463 


ESTs 


G.3 


453789 


AA628517 


Hs.118502 


ESTs 


6.2 


424940 


AA985309 


Hs^83902 


ESTs 


6.1 


403667 






Target Exon 


6.1 


4290U 


AIB0051B 


H8.11&15B 


ESTs 


6.0 


4177S8 


U27699 


Hs.82535 


& orate earner fainiy 6 (neurotransmrtts 


6.0 


419099 


AI760942 


Hs.191754 


ESTs 


6.0 


405348 






C7O01664^l]12G9805HdbjIBAB21B49.1) (AB 


6.0 


404003 






Target Exon 


5.9 


453200 


AA033832 


H&.212433 


ESTs 


5.7 


428002 


AA418703 




gb2v98c03.s1 SoaresJmHMPu S1 Homosapl 


57 


432319 


AW510770 


HS.1263B6 


ESTs 


57 


445444 


AA3S0576 


HsJZ70 


plecksfrin tiomology, Sec7 and coll edit oi 


5.6 


443361 


AT792628 


HsJ33273 


ESTs 


5.6 


411518 


AW850246 




gb:IL3-CT0219-291099421-E07 CT021 9 Homo 


5.6 


439079 


AF085937 


Hb.38348 


ESTs 


5^ 


422031 


AW136820 


HS.196Q11 


ESTs 


5.5 


408137 


AA282262 


He.107410 


ESTs, WesMy ssmilar to A46010 X-ltoked 


5.5 


423529 


T07316 


Hs.120411 


ESTs 


5.5 



738 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



■sOOOl o 


AlfKll Alt. 


HS»1J4BW 




A hflU CDC 


rj_ ■* UMil 


435072 


A\AWQH7fl 
AWOSil fd 




**i44U3 


AUWEOCKI 


U_. 7C/V5A 


AAXiAA 


Aluo4o^o 


T l_ 074CAD 




AlrZUODt> 


L>_ ■iCQC'in 
nS.l aZoZu 


441916 






414B1 8 


DEO 1 * It!/ 




40<W 1 






4095*00 


AVto4JAMo 








H8.jiOO/lZ 




A1*UDDD04 


L). ilCM 


HOf (Xi£. 


AA7R7S7C 


n 5.291631 


44b4£l0 




HS.IbtMW 




AI733610 ( 


Hs.1 67832 


4<D/4B 


ALU404UU 


Hs.97177 


44^Vsa4 


A u toe Ac 4c 




405321 






416706 


AAJ140/O 


Hs>2oa94o 


4s5C£UO 


AA/oUjod 


HS.187BB5 




A A DQKV7-1 

AA883971 


Hs.1 67506 


ADO/ 4/ 


bbU/4yiU 






Alannc-t i\ 
AIBQ951Q 


He. 11 B971 


4QfO/4 


AA43O0O2 




AAltHKA 


a a cam dc 


Ms. 125466 












u. o<2 1 Jcvi 

nS^o14v3 


4Q34B1 






4Jo1 


AAH07U7b 


Hs.1 22060 


403333 






450317 






433597 


AA7D8205 


1 > JAM J 

Hs. 100343 


430172 


AA468591 


Hs. 161669 


452643 


AI796769 


Hd, 20 13320 




AAr4l0l4 


Hs.865 


4273D2 


AA40054Q 


H8.1352B2 






fiS.159961 


43B727 


AW97o756 


Hs.205679 


420991 


AF044197 


Hs. 100431 


442160 


AIJ37127 


Hs.156325 




AIOZ724B 


hi5 ,22439a 


4345B3 


AF 14736 3 




404995 






4^-cyuo 


UoU//J 


Hs, 121500 


405549 








A A AajfonT 

AA4o4207 


lis, 21 1867 


443998 


AI62Q661 


Hs.296276 


455 B0 1 


BE1 40643 




444BUU 


AMI44rVYTH 

AW119071 


Hs.153287 


403371 






43B431 


AYV2076c0 


Hs.293l16 


421926 


AA300591 




439752 


T7B96B 


Hs.14411 


414441 


AA234759 


Hs.1 32950 




AlAffH'MCC 

awdi^ddo 




4WD0O 






452976 


a Anmnnj 
AAQ29994 


Hs.61523 


436042 


Jk 1 AMC1 Cn~7-f 

AW296971 


Hs.1 80610 


416059 


AA211586 




401 HW 


NM_Ul4o03 


rts.26441 


433072 


AiyztjUd/ 


Hs. 158832 


443318 


AKJ51cD3 


Hs. 1331 41 


A^ACnn 
4Z4W9 


AA340004 








Hs.16247 


4104 Br 


KbUoSi 


HS.2bUZ74 


Arson ak 
40BU4D 


No91Qi 


u« jnnn 
HS .40730 


415417 


F1ZD3B 


Hs. 140970 


409111 


AL0 43362 


Hs,7964 


441520 


R59595 


Hs.26676 


427906 


AA417272 


HS24122 








446106 


AW300Q2! 


Hs.170685 


436345 


AA&730C6 


Hs.121572 


409500 


UOS098 


Hs.54576 


413525 


BE145BS9 




403305 


NWL008825 




436338 


W92147 


HS.11B394 


427521 


AW973352 


Ha^90585 


430124 


AW204994 


Hs.253450 


410790 


AW803357 




434360 


AW015415 


Hs,127780 


416385 


H54253 


Ks^05241 


442786 


H50733 


H8J56261 



ESTs 
ESTs 
ESTs 

ribosamal protein 127a 

ESTs, Weakly slmffar to I7B865 serineflh 



ESTs 
ESTs 

ESTs, WeskJ/ similar to 136022 hy pothed 
gb:tL3-CT0214-1503004J85-H06 CT0214 Homo 
ESTs 

unc5 (C.elegans homolog) c 

ESTs, ModeratBly similar to 565657 alpha 

ESTs 

ESTs 

ESTs. Weekly similar to AUJ INHUMAN ALU S 
gb;EST3 66595 MAGE resequences, MAGC Homo 
Target Exon 

hypothetical protein FU13448 

ESTs 

ESTs 

9b:RCSST0580-l7030a021-Fl2 BT0580Homo 
ESTs 

gbnihD5dl2^1 NCC_CGAP_Thyt Homo sapiens 

ESTs 

ESTs 

ESTs 

Target Exon 
ESTs 

NM_002518*:Homo sapiens neuronal PAS dom 

gbwd66g05j{1 MQ_CGAP^bj24 Homo sapiens 

ESTs 

ESTs 

ESTs 

RAP1A, member of RAS oncogene famU/ 
Homo sapiens cDNA FU1 1 554 Rs, done HE 
EST 
ESTs 

small Inducible cytokine B subfamily (Cy 
ESTs 

Homo sapiens cDNA FLJ1 1469 fe, clone HE 

gb:Homo sapiens full length Insert cDNA 

ENSP000QQ25169F:Monocytlc leukemia zinc 

Human EST dons 42944 mariner transposon 

C7roi976rg1H7587l2|ie1lNP^004659Llial 

ESTs 

ESTs 

Bb:RCO-HTODl55105S&4Jl6 HTD015 Homo sap 
ESTs 

Target Exon 
ESTs 

gb:EST1 3437 Testis tumor Homo sapiens cD 

ESTs 

ESTs 

Ob:RC3-ST0186-30010W)l7-b03 ST01 B6 Homo 
gb:CW3-BTD379-3101004>71-g06 BT0379 Homo 
ESTs 
ESTs 

gb:znB6d05j3l Stratagsne muscle 937209 H 

K1AA0319 gene product 

ESTs 

ESTs 

gb:EST5l529 Gall bladder ll Homo sapiens 
ESTs, Weakly similar to 2004399A chromos 
ESTs 
ESTs 

ESTs, Vfeakly similar to ALU6_HU MAN ALU S 
plecksvin homotogy f Sec7 and colledfooi 
ESTs 
ESTs 

gb AKW3429 Selected chromosome 21 cDN A 

ESTs 

ESTs 

sulfbtransferaaa, Bstrog en- preferring 

gb:MRO-HT020&-221299-204-blO HT0208 Homo 

transmembrane protein (63kDJ» endoptasmi 

ESTs 

ESTs 

ESTs 

gb^L2-UMQD79-09O300-O5CWV08 UM0O79 Homo 
ESTs 

ESTs. Weakly simitar to S65857 alpha-IC- 
ESTs. Moderately similar to ALUft.HU MAN A 
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5.5 

5.5 

5.4 

5.4 

5.4 

5.3 

5-3 

SJ3 

5.3 

5.2 

5.2 

5.2 

5.2 

5.2 

5.2 

5.2 

5\2 

5,1 

5.1 

5,1 

5.1 

5.1 

5.1 

5.0 

5.0 

4.9 

4.9 

4.B 

4.8 

4.8 

4.8 

4.8 

4.8 

4,7 

4.7 

4.7 

4J 

4.7 

4.6 

4.6 

4.6 

46 

4.6 

4^ 

4.5 

4.5 

4.5 

4.5 

45 

4.5 

4.5 

4.5 

4v5 

4.5 

4.5 

4.4 

4.4 

4.4 

4.4 

4.4 

4.4 

4.4 

4.3 

4.3 

4.3 

4.3 

4.3 

4^ 

4.2 

4.2 

4,2 

4.2 

4.2 

A2 

4.2 

4.2 

A2 

i2 

4.2 

4,1 

4,1 

41 
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445206 ABS01&9 Hs.269990 ESTs 4.1 

424200 AA337221 gb:EST41 944 Endometrial tumor Homo sapte 4.1 

448152 AI741053 Hs.170770 ESTs 4.1 

4537*3 R20640 Hs.79133 cadherin 8, type 2 4.1 

5 400746 Target Exon 4.1 

413081 BE0G4415 gb:RC4-BT031 1-241 199-012-b03BT0311 Homo 4.1 

442338 AI761976 H5.156C80 ESTs 4.0 

455388 AW936234 gb:QV0-DT<X)2O-0902O0-106-gO5 DT0020 Homo 4.0 

455276 BE176479 gb:RC3-HT0585-1G0300422-b09 HT0585 Homo 4.0 

10 421312 AA824627 Hs.291670 ESTs 4.0 

431421 AW96911B Hs.108144 ESTs. We aWyslrnilar to unnanwd protein 4.0 

416882 AJ001531 Hs_22404 protease, serine, 12 (neurotryp sin, moto 4.0 

441568 AJ733322 Hs.127176 ESTs 40 

455646 BE064420 gb:RC4-BT03 1 1-241 199-012«OBBT031« Homo a9 

15 424765 M428211 Hs.2d4256 hypothetical protein FU 14033 similar to 3.9 

454024 AA993527 Hs.293907 hypothetical protein FU23403 3L9 

453367 AI99Q741 Hs.252809 ESTs 3.9 

434222 AF11988B Hs,2B3941 Homo sapiens PR02591 mRNA, complete cds 3.9 

^ 454B06 AWB72430 HU73743 ESTs 3.9 

20 429066 AAB66555 Ks.178222 ESTs 3.S 

435878 RG833D Hs.20152 ESTs 3.9 

463065 AW291436 Hs.31917 Homo sapiens, clone MGG9558. mRNA, comp 19 

447530 AW192063 Hs.248865 ESTs, Moderately similar to JCS238 galac 3,9 

407834 AW084991 Hs.26100 ESTs 3.9 

25 40039B AM 37395 Hs.2B3fl79 ubiquilin3 3.9 

421353 AW292857 Hs.255130 ESTs 3.9 

401459 C14000462 i :gl|9790241 |refjNPJ)6262B.1l S 3.9 

403433 «M_001622:Homo sapiens alpba-2-HS^glycop 3.9 

_ _ 444911 13061 17 Hs.250 xanthene dehydrogenase 3.9 

30 436350 AA713661 Hs.121091 ESTs 3,9 

447930 R44574 Hs,l0751O ESTs 19 

410559 AW754192 #>:RC2-CT0321-1 31 299-01 2-a04 CT0321Komo IB 

452320 AA042873 Ks.160412 ESTs 3.8 

402145 Target Exon 3.8 

35 45843B AM41520 Hs.151464 ESTa, Weakly similar to ALUC HUMAN ill! 3.8 

440450 At333129 Hs.156147 ESTs 3.8 

437667 A1591222 Hs.72325 Human DNA sequence from clone RP1-187J 11 3,8 

416009 Z43062 gbcHSC12E041 normalized Infant brain cDN IB 

Arx 434381 AA631834 nbcnp77h0Ss1 NCLCGAP_Pr2 Homo sapiens 3.B 

40 416534 K69043 rk224961 Homo sapiens cONA FU 14366 ffe, done HE 3.8 

433523 H29B82 ESTs 3.8 

411436 AW846433 gb;QV0-CTD1 79-070300-1 43*02 CT0179 Homo 3.8 

424630 AW2705BO Hs.l8S311 ESTs, Weakly similar to putative p1 50 |H 3.8 

431447 AA505138 Hs.291341 ESTs 18 

45 435932 W03928 Hs.1 14524 ESTs 18 

442447 AAS9B723 Hs.129607 ESTs aB 

419831 AW448930 Hs£41S ESTs 3.8 

403242 Target Exon 3.8 

455490 AW953477 gb:EST365547 MAGE resequences, MAGB Homo 3,8 

50 449264 AB37649 Hs.t9B105 ESTs 18 

443635 A108023O Hs.134214 ESTs 17 

42B200 AI039624 Hs.98368 ESTs a7 

452462 BE173515 gb:RC2-HT0560-21 0200-012-103 HTOSGOrlomo a7 

„ 418759 AA227879 Hs.187621 ESTs 17 

55 450497 H64159 Hs.1532B ESTs 3.7 

447785 AU041765 Hs.340375 ESTs $J 

451746 M86176 Hs.311255 ESTs 3.7 

416321 H94331 Hs.34024 ESTs a7 

448135 AJ470874 Ks.343799 ESTs 17 

60 405510 EW8P0DD00233779*:HypoMcal 6&0kt>a p 17 

407437 AF220264 gb:Homo sapiens MDST-1 mRNA, complete cd 3.7 

451859 H44491 Hs.25293& ESTs, Weekly sim^r to AUJ1_HUMAN ALUS 3.7 

455135 AW857989 gbPM2-CT0328-261299-003-e04 CT0328 Hcrno a7 

409189 AA1259B4 gb3n27h06\rt StrategeneneuroepHhelium a7 

65 420300 AA25B245 Ha.127573 Homo sapiens FKSG41 (FKSG41) mRNA, compl 17 

427726 AI359144 HsU43688 Homo sapiens cDNA: FU23031 fe, clone L 3.7 

434306 AW081757 Hs44241 Homo sapiens cDN*FU21447 lis, done C 3.7 

456354 X56411 Bs.1219 alcohol dehydrogenase 4 (class II). pi p 3.7 

419261 X07876 Hs.B979l wingless-type WMTV integration site femi 3.7 

70 422899 D16471 Hs.121571 Human mRNA, Xq terminal portion a7 

44745B A17410S2 Hs.158961 ESTs a7 

411421 BE272110 Hs.21177 ESTs 3.7 

439075 AF0B5933 Hs£92620 ESTs 17 

440947 AA910403 ESTs 3.7 

75 417565 AQ03405 Hs-4783t ESTs 3.6 

430096 U91935 Hs.233321 RBtjnB-derived POlWornain factoM 3.6 

44865B H71739 Hs.200227 ESTs, Moderately similar to A53959 throm 16 

404561 trtcta Mnophalangeal syndrome 1 gene (T 16 

ort 429073 AA446167 Hs.473B5 ESTs 16 

80 419002 T76626 Hs36fl594 ESTs 16 

450630 AA01D429 Hs.191939 ESTs 3.6 

455067 AW854536 gb:RC3-CT0255-20010<M3244j02 CT0255 Homo 3.6 

412766 AW996044 Hs.26239 Human ONA sequence from done RP11-438B2 3.6 
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439481 AF0B6294 Hs.125B44 ESTs 3.6 

419622 AA452054 Hs.119336 ESTs 3.6 

449821 AI671141 Ks.211122 ESTs 3.6 

451193 N23850 Hs.4409B ESTs 3.6 

5 412701 AW9B4757 gb:RC1-HN00l5-04040a-011-g10HN0015Homo 3.6 

419611 AB031479 Hs.91600 SEEK1 protein 3.6 

433563 AI732637 Hs,277901 ESTs 3.6 

427235 A11262B8 Hs.192232 ESTs 3.6 

. _ 449045 BE0724B3 Hs.278337 Homo sapiens cDNARJU537fe. done HE 15 

10 401132 C12000517*:g1|4758712|reflNP 004659.1] a 3L5 

421105 AA766501 Hs.125113 ESTs 3.5 

405264 NM_030613*:Homo sapiens suppressor of po 3,5 

417675 AI808607 Hs,3781 similar to mume teuclnwich repeal pr 3.5 

. _ 449986 AW372068 Hs.201420 ESTs, Moderately similar to ALU7_HUMAN A 3.5 

15 434763 AA648618 gb:as07a1 1,r1 NCtCGAP_Ewl Homo sapiens 3.5 

416379 N 38857 Hs.203933 ESTs 3.5 

407557 Z63803 g^H^apfensmRNAfaraxonefnaldynelnlie 3.5 

437956 AA7732B3 rte.203559 hypothetical protein FU1270J 3.5 

„ 403510 TagetExon 3.5 

20 451722 H86374 Hs.40861 ESTs 3.5 

4356B1 AA694192 Hs.148979 ESTs 3.5 

427125 AA6B3362 Hs.97612 ESTs 3.5 

424287 AU33105 Hs.144633 hypothetical proteb DKFZp434F2322 14 

434497 A1B21803 Ks.136580 ESTs 3.4 

25 4536B2 T79703 gb:yd7leC6.r1 Soares fetal liver spleen 3.4 

404848 ENSP00000240769':BG15303.1 {similar to C 3.4 

417435 NKL0051B1 Hs.82129 carbonic anhydrase H nuscte specific 3.4 

442231 W02434 Hs.222413 ESTs 3.4 

^ . 450297 AW901347 Ks.36592 hypothetical proteki FU23342 3.4 

30 436340 R42246 Hs.21606 ESTs 3.4 

441596 AA939300 Hs.206768 ESTs 3.4 

456995 T89H32 Hs/I70278 ESTs 3.4 

456304 AJ820973 gb:nc21c02.y5 NCLCGAPJ , r1 Homo sapiens 3.4 

427033 A1457449 Hs.192317 ESTs 14 

35 400379 MAJrtB432 Homo sapiens ovaian cancer related prol 3.4 

410551 R36730 Hs.21B16 ESTs 14 

423357 AI285124 Hs.157505 ESTs 3.4 

450582 AI339732 G-rich RWA sequence binding factor 1 3.4 

437662 AA765387 Hs.145095 ESTs 3.4 

40 442388 AW663442 H&.129485 ESTs 3.4 

445004 AJ204616 Ks.1467OT ESTs 3.4 

450597 AI701635 Hs.207D77 ESTs 3.4 

438801 AA825971 HsJ24284 ESTs 3.4 

. _ 445432 AV653771 gb: AV65377 1 GLC Homo sapiens cDMA done 3.4 

45 450003 AA777B09 Hs,191995 ESTs 3.4 

431408 AA504757 Hs.l0573B ESTs 3.4 

428923 BE047698 Hs.1887B5 ESTs 14 

407344 AI038025 Hs.271418 gb:ax29TO7jct SoaresJotaLfelusjNb2HFB^ 3.3 

425523 AB007948 Hs.158244 KIAAD479 protein 3.3 

50 427473 AW274439 Hs^52709 ESTs 3.3 

437115 AA744703 Hs.129030 ESTs 3,3 

434520 AA205Z73 Hs. 177011 hypotlieQcal piotab 3.3 

4472S2 AI989963 Hs. 197505 ESTs 3.3 

_ _ 453328 AW292635 Hs.346145 ESTs a3 

55 416965 AIQ42330 HS.6712B ?iypothetic&! protein FU23309 3.3 

449256 AA059050 Hs.59847 ESTs 3.3 

432550 AW297206 Hs.164018 ESTs 3,3 

423101 M 53941 Hs. 123642 EphA3 3.3 

424653 BE549737 Hs.132967 Human EST clone 122887 mariner transpose 13 

60 410700 AA352335 Hs.65641 hypothetical protein FU20073 as 

418719 AW975590 Hs.161707 ESTs 3.3 

402703 Target Excn 3.3 

455710 BE072049 GfrPM4-BTQ532-1701 QO-004-fD6 BT0532 Homo 3.3 

„ 452011 AW628911 Hs. 21 1429 ESTs 3.3 

65 453973 AI291695 Hs.61993 ESTs, WeakJy sbiHar to ALU1_HUMAN ALU S 3.3 

420355 AW968263 Hs.123126 ESTs 3.2 

41B898 T66W7 Hs.194040 ESTs r WeaWy similar to I38Q22 hypottiefi 3.2 

418986 AI123555 Hs.B1796 ESTs 3.2 

449570 AA001793 ntah86c06.r1 SoaresJetaUlver.spteen, a2 

70 447516 W05356 Hs.102971 Hypothetical protein RJ14751 a2 

40B304 AW810279 gb:MR4-ST01 25-1 51 299-0 29-a09 ST012S Homo 3.2 

402454 C10Q25Or:gi|129092|splP23270|OlJF7JWTO 3.2 

411552 AW851255 Qb:IL3^CT0220-1 6 020043 66-H02 CT0220 Homo 3.2 

__ 407896 D76435 Rs.41164 Zlcfarrfiy member 1 {odd-paired Drosoplii 12 

75 435786 H09175 Hs.25065 ESTs 12 

447597 AI886036 Hs.213675 ESTs 12 
432625 AI243S96 Hs£4830 ESTs, Moderately similar to T03094 A-fclh 3.2 
459535 AV654907 gtxAV654907 GLC Homo sapiens cDN Actone 3.2 
0/x 447183 At554733 Rs.173182 ESTs 12 
©0 426629 A1203933 Hs.97142 ESTs 3,2 
447B92 A1435B4B Hs.172978 ESTs 12 
457136 AA426240 Hs.126083 ESTs 12 
443665 AW466983 Hs.283949 enarnelin 11 
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451399 
450687 
416B12 
452102 
434512 
458251 
439950 
42S569 
403B05 
422666 
408799 
423350 
430454 
441817 
427773 
415901 
446758 
403291 
436235 
455772 
418037 
438142 
416423 
448766 
457021 
420901 
420721 
436013 
441492 
410869 
413386 
435628 
444326 
413774 
434352 
434269 
416422 
41G642 
439182 
418948 
433796 
427813 
405733 
426743 
449913 
454096 
440129 
410929 
458187 
429430 
451391 
414951 



AU04211G 

AA495800 

H91010 

UQ4343 

AW139932 

AL04O927 

AW937417 

AA454993 

AA677981 

AA059412 

AI754634 

AW46S011 

AW969706 

AA412290 

RQ8396 

R60715 

W73B95 

W2B799 

AI990212 

T90309 

H54375 

R15337 

AW968934 

AA767526 

AA927802 

M703419 

AI14999S 

AW808361 

AA236857 

W8B732 

AI939357 

AA131782 

AF129505 

AK001991 

H60457 

T96118 

AFOB6D30 

AI217097 

AA810B67 

D29833 

AA383833 

AA004696 

AVKB2757 

AA865818 

H47233 

DS6919 

A13S1837 

AA017410 

AW794931 



Hs.326728 ESTs 



Hs,44940 

Hs.27954 

Hs/1 88941 

Hs310422 

H&JZ93561 

Hs.138343 

Hs.119023 

H&.47986 

Hs.131987 

H5.105635 

Hs_293332 

Hs.98124 

Hs.76118 

Hs.25804 



Hs.86447 

Hs.269651 

H8.268921 

Hs.21958 

Hs.173106 

Hs.22030 

Hs.159471 

Hs387749 

Hs.146346 



Hs.35107 

Hs.270710 

HS.1B2314 

Hs.86492 

Hs,3781 

Hs.226313 
Hs.21621 

Hs.186997 
Hs^207 

Hs.245022 
Hs.333016 

Ha.174936 

Hs.30643 

HS.265B46 

Hs.155335 

Hs.405f» 

Hs.100861 



gb3w05b07.s1 Soares_NhHMPu,_S1 Homosapi 
ESTs 

CD86 antigen {C028 antigen Rgand 2. B7- 
ESTs 

ESTs * 
ESTs 

ESTs, Weakly similar to 178885 serine/m 
Targel Exon 

SMC2 (struciuraf maintenance of chromoso 

hypothetical protein MGC10940 

ESTs 

ESTs 

ESTs 

ESTs 

ubiquitin car boxyl -terminal esterase L1 
ESTs 

Target Exon 

gb:zaS2d0S,s1 Scares fetal liver spleen 

gb:52g11 Human reflna cDNA randomly prim 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp547O036 {fr 
Homo sapiens cDNA: FU21897 fis, ctone H 
paired box gena 5 (B-caR Fneage specif 
ZAPS protein 

Homo sapiens cONA: FLJ23593 tis, ctone L 
ESTs 

flb:MR1-ST0t11-t1l09M03-f04ST0H1 Homo 

ESTs, WeaWy similar to 138022 hypoihefi 

ESTs 

ESTs 

ESTs 

small muscle protein, X-i Inked 
similar to murine fa urine-rich repeat pr 
ESTs, Moderately simitar to 2W91JHUMAN Z 
ESTs 

hypotheficrf protein DKFZp762O076 
Qb:qd43h07.x1 ScarasJetaUiearUYbHH19W 
ESTs 

salvay profine-rich protein 

NM_021 140-:Homo sapiens ubiquitously tra 

ESTs 

ESTs 

db:CMfJ-CT0103-12Q899-037-o,07 CT0103 r 
ESTs, Weakly simlar to S71886 Sle2Wlfe 
ESTs 



ESTs 
ESTs 

hypothetical protein FL314G0O 



3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3,1 
3,1 
3,1 
3.1 
3.1 
31 
3.1 
31 
31 
3.1 
3.1 
3L1 
3,1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 

ao 

3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 

ao 

3.0 
3.0 

3.0 
3.0 
3,0 



TABLE 65S 



Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
408304 



410559 
410790 
410869 



411436 
411518 
411552 
412701 
412988 
413081 
413525 
416009 
416422 
418059 
418387 
418946 



CAT Number 
1050845.1 



409189 110687J 



1206283J 
1221 131J 
122S123J 



1245660_1 

1248692.1 

1249255.1 

1322288J 

1342150 1 

1348563.1 

1374635J 

156S379.1 

1593811J 

171B79_1 

174731.1 

180608.1 



Accession 

AWB10279 8E146684 BE146693 BE146694 BE14B679 AW810472 AW3102Q8 AW810356 AW8t D193 AW178838 AW176837 AW178857 AW81Q515 
AWB10330 AW8105U AWB10441 AW810358 AW17B852 AW810359 AW810322 AWB10327 AWB1021 1 AW1 76835 AW61 0635 AW810288 
AW8102S3 AWB10325 AWB10443 AW8 

AA125984AA127189 AA0^075AA070377M10(»17AAU79B91 AA1 1 3255 AA0751 68 AA082764 AA083380 N84829AA084752AA076512 
AA0851 19 AA085206 AADB5045 

AW754192 W0 05 54 AW857797 AW754203 AW754197 AW754193 
AW803357 AW803423 AW812233 H06814 

AW808361 AW808404 AW808386 AW808594 AW80a654 AW608813 AW808551 AW808676 AW808350 AW808406 AW80B694 AW808934 

AW808829 AW80B385 AW808422 AW808401 AW308409 AW808760 AW80B863 AW808521 AW808539 AW808609 AW808472 AW80B739 

AWa087O4AW808558AW808714AWfi08420AW8 

AW346433 AW846159 AW646377 AW646528 

AW950246AW8S0251 AW85D302 

AWB51255 AW851432 AW850955 

AW9847S7 AW984797 AW984734 AW984745 

BE046680 BE04673B BED44958 

BE064415 BE064430 BE064448 

BE145899 BE145848 BE145B49 BE145B53 BE146927BE1 46925 

Z43062R13213H14422 

HB0457 H6B709 H73528 H54335 R87154 

AA21 1585 F35799 AA211641 R29720 AW9373B7 AW937408 

R18085 AA21902B R17712 244345 

A1217097 AW8B609D W3B035 W38792 AA232835 AW936043 
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419386 


184356 1 


AA236B67 AA237066 AA354236 AW957759 H0BS61 


420352 


192979 1 


BE2S8B35 AW968316 AA258918 AW843305 RU744 A158D3BB BE071923 R36280 


421926 


209246 1 


AA30O591 AW963893AA300493 


424200 


236595.1 


AA337221 AA336756AW966196 


424686 


242466 1 


AA345504 AA345251 AW963243 


424994 


2457B6 1 


AW954625 AJ3726B5AA349501 AI372687 H10564 


428002 


285602 1 


AA418703 AA41B711 BE07191S BE071920 BE071912 


430535 


319643 1 


AW968485 AW968670 AA480922 BE350425 


432766 


353907J 


AJ003429 AJ003367 AA564825 


433523 


368B73J 


H296B2 AW665533 AW149901 AI572917 AA59850D AI686466 AI336390 AW864390 AW864320 


4343B1 


385155J 


AA631834 AA633425 AA632455 AI792312 AI792311 


434589 


38929 1 


AF147363 T47219T47218 


434763 


392847J 


AA648618 AW974389 H51771 


436295 


41733J 


M73895AJ0D1B72 


440947 


5059O4J 


AA9104O3 AI815593W58361 AW1 62520 A18 16550 


442481 


543588,1 


N99628 BE079673 All 1 0738 AF074645 


445432 


63943J 


AV653771 BE069370 


449570 


8101BJ 


AA001793AAO01871 


450317 


631956J 


AJ692689R14223R18395 


450582 


83933 1 


AI339732 M01 0300 AW515041 AA768334 N29660 AA425B74 AA4251 1 B AA 865829 AW936878 


450587 


84327_1 


AA495B00 AA495737 AA01O736 AA654716 AA640726 


452452 


91B5BQ 1 


BE173515 BE1 73560 A1902860 


453682 


977454_1 


T79703 T96307AU079726 


454096 


1007449J 


AWD62757 AW176B90 AW0627S8 AW176895 AW176869 AW176900 AW176897 AW176928 AW1 76868 AW176892 


454171 


1O4S240J 


AW854832 AW854798 AW654857 AW654816 AW654834 AW854317 


454457 


J207274J 


AW753456 AW753036 AWB5486B AW854B62 


4545S5 


1225B52_1 


BE069128 BE069023 AWa09375 


454665 


1228599 1 


AW812866 AW812746 AW812747 AW812884 AW812763 AW812722 


454860 


1237732J 


AW835767 AW835537 BE160187 


45496B 


1247029J 


AWB49Q46 AWB47956 AW34903S AW847957 AW848279 AWB48698 AWB49034 AWB49033 


455067 


1252D5QJ 


AWB54538 AW854418 AW854412 


455135 


1254729 1 


AW857989 AW858016 AW961677 AWB616B9 AWB61691 AWB58056 


455276 


1272541_1 


BE176479 BE17667B BE176357 BE176550 AW896079 BE176676 BE176615 BE176555 BE176489 BE176610 8E176362 


4553BB 


1287904 1 


AW935234 AW936074 AW936181 AW936179 AW936217 AW936077AW936227 AW936191 


455490 


1297826J 


AW953477 Z41970 F12435T73989 T09387 


455646 


1348557J 


BEC 64420 BE064435 BE064429 BE0544U BED 54400 BE064S17 


455710 


1352368.1 


BE072049 BE069471 BE0B9489 BE069476 BEQS94 79 BE069492 BE069435 BE072122 BE0721 24 BE069491 8EQ69486 BE059464 


455747 


1355877 1 


BE07491D BE074913 BE074911 BE074903 BE074892 BE074935 


455772 


1363114 1 


W28799BEO86076 


455801 


1370506J 


BE140643 BE140645 BE140644 BE140657 BE140560 BE140659 BE140661 


456304 


17682£U 


A1B20973 A1734077 AIB20984 AA225796 AA225060 AA225101 


457374 


32B758_1 


AA493662 AW897396 BE154814 



TABLE 65C 



Pkay: Unique number corresponding to an Eos probBsst 

Ref: Sequence source. The 7 digit numbers in IHs column are Gen bank Identifier (Gt) number "Dunham I. el el." refers to the publication enBUed Ttie DMA sequence of 

human chromosome 22." Dunham L el al„ Nature (1999) 402rtB9*495, 
Stand: Indicates DNA strand from which axons were pmdcted. 

Nt posrEon: Indicates nudangdg poshjona of predicted exons. 



Pkay 


Ref 


Strand 


NLpositfon 


400533 


6981826 


Minus 


277132-277S96 


400746 


732932B 


Minus 


H770M47896 


401132 


8705350 


Mnus 


65679-85795 


401459 


9212270 


Minus 


182001-183323 


402145 


8018280 


Phis 


113086-114800 


402454 


7534025 


Minus 


14826-15803 


402703 


8705069 


Minus 


15335-15500 


403242 


7637817 


Mnos 


11297-12511 


403291 


7230870 


Plus 


96177-95435 


403305 


8099945 


Phis 


114632-114805 


403333 


856B833 


Minus 


124794-124941 


403371 


9087278 


Plus 


105655*106050 


403433 


9719611 


Minus 


7222572437 


403481 


9965004 


Plus 


93496-93633 


403510 


7652047 


Pkis 


6186862027 


403667 


6850483 


Minus 


1344-1442,1545-1697 


403605 


8140491 


Minus 


51483-51742,53429-53511 


404003 


8655948 


Plus 


198349-199096 


404561 


979598D 


Minus 


69039-701 00 


404592 


9943965 


Minus 


38067-39225 


404848 


8248647 


Minus 


23955-24034,25143-25264 


4049B7 


7523744 


Minus 


89944-90729 


404995 


6006247 


Minus 


t64015-154123 


405264 


7329374 


Plus 


28556-28684 


405321 


3419B46 


Minus 


4465+4S210 


405348 


2914717 


Minus 


43310-43462 


405510 


7630909 


Minus 


101028-101174 


405549 


1552494 


Plus 


10878-11048 


405733 


9884689 


Plus 


124832-125051 
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Table 66A lists about 370 genes up-regulated in QPH compared to prostate cancer and normal aduH tissues. These were selected from 59680 probesets on the Affymatrix/Eos Hu03 
GeneChlp array such lhat the ratio of "average" BPH to "average 1 ' prostate cancer and normal adult tissues was greater than or equal to 3.0. The "average" BPH level was set to the 
75* percentile amongst BPH b'ssues. The 'average" prostate cancer and normal aduH tissue level was set to the OS 1 * percentile amongst malignant prostate tissues and normal adult 
tissues. In order to remove gene-specific background levels of non-specific hybridization, the 10* percentJa value amongst non-malignant body Tissues was subtracted from both the 
numerator and the denominator before the ratio was evaluated. 

TABLE 66A: ABOUT 370 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO PROSTATE CANCER AND NORMAL ADULT TISSUES 

kkbv: unique tos prooeset oenaner numDer " 

ExAccn: Exemplar Accession number, Genbank accession number 
LMgenelD: UnJgene number 
Unfgene TWerUnlgene gene title 

R1: Ratio of BPH tissue to prostata tumor and normal body tissue 



Pkey 


ExAccn 


UnlgenelD 


Unlgens Title 


R1 


410329 


H47233 


Hs,30643 


ESTs 


21.1 


450693 


AW450461 


Hs.203955 


ESTs 


16.7 


418432 


M14156 


Hs.85112 


hsulln-llke growth factor 1 (somatomedl 


14.2 


432473 


AJ2027D3 


Hs.152414 


ESTs 


11.3 


446336 


AW815C3S 


Hs_1 51251 


ESTs 


10.9 


407275 


AI364186 




gfcqw34h07jc1 NCI_CGAPJJt4 Homo sapiens 


10.7 


428134 


AA421773 


Hs.161008 


ESTs 


10,2 


400297 


A1127076 


Hs.306201 


hypothetical protein OKFZp56401278 


9.8 


433466 


AA508353 


Hs.105314 


relaxln t {H1) 


9.5 


415293 


R494B2 


Hs.106541 


ESTs 


9.1 


458072 


AJ89Q347 


HsJ7l923 


Homo sapiens cDNA: FU22785 lis, clone K 


8.8 


428927 


AA441837 


Hs.90250 


ESTs 


8.6 


420345 


AW295230 


Hs.25231 


ESTs 


8.5 


453367 


AI990741 


Hs.252809 


ESTs 


8.2 


454457 


AW753456 




gb:QW2-CT0261-261099-011-d11 CT0261 Homo 


7.7 


441247 


AW118681 


Hs.12a051 


Homo sapiens thymic stromal lymphopoletj 


7.5 


431576 


M76665 


Hs_275215 


hydroxysteroid (1 1-beta) dehydrogenase 1 


7.4 


400QB0 






Eos Control 


7.4 


431446 


AL137517 


Hs.303201 


hypothetical protein DKF2p56401276 


7.4 


404592 






NFvL022739 4 :Horno sapiens E3 ubiouitin lig 


7.3 


420352 


BE25883S 




gb:6011 17374F1 NlH_MGC_16 Homo sapiens c 


7.3 


436231 


AW594539 


Hs.l556B9 


ESTs 


7.3 


410330 


AW023630 


Hs.159425 


ESTs 


7.2 


449300 


AI658S59 


Hs.346514 


ESTs 


7.1 


449249 


T522B5 


Hs.193115 


Homo sapiens mRNA for KIAA1764 protein, 


7.0 


426384 


AI472078 


Hs.303662 


hypothettcal protein FU13189 {FU13169) 


68 


454171 


AW854332 




gb:QV2-CT0261-201099-011>f05CT0261 Homo 


6.6 


40B197 


AA282262 


Hs-107410 


ESTs. Weakly srnilsr to A46010 X4inted 


6.6 


429732 


U20158 


Hs.2488 


lymphocyte cytosofic protein 2 (SH2 doma 


8.6 


421154 


AA2B4333 


Hs.287631 


Homo sapiens cDNA FU14269 te, clone PL 


6.5 


431467 


N71831 


Hs.256398 


Homo sapiens mRNA; cDNA DKFZp434ED528 (f 


6.5 


424433 


H04607 


Hs.9218 


ESTs 


6.5 


442481 


N99B2B 




gb^o32c04.rt Scares fetal liver spleen i 


6.4 


425312 


AA354940 


Hs.145958 


ESTs 


6.4 


426140 


AF13179B 


Hs-343768 


Homo sapiens clone 25119 mRNA Sequence 


&4 


440911 


AA909538 


Hs.143562 


ESTs 


6.4 


400533 






ENSPC0000209376*iPRED65 protein (Frag men 


6.2 


419310 


AA814100 


Hs.86693 


ESTs 


6.2 


403667 






Target Exon 


6.1 


436395 


A16B3487 


Hs.152213 


vdrigtess-type MMTV integration sub fami 


6.1 


404003 






Target Exon 


5.9 


424853 


BE549737 


Hs.132967 


Human EST clone 122687 marhertransposo 


5.9 


436138 


R9B299 


Hs.177502 


ESTs 


5.9 


424940 


AA98530& 


Hs.283902 


ESTs 


5.8 


434465 


AJ623511 


Hs.1 18567 


ESTs 


5.8 


453200 


AAD33832 


Hs.212433 


ESTs 


5l7 


426002 


AA418703 




gbrzv9Bc03t£l Soares_NhHMPu_S1 Kcmo sapi 


5.7 


424650 


AA151057 


H&153498 


chromosome 18 open reading frame 1 


5.6 


445444 


AA360B76 


Hs.270 


ptectetrin homology, Sec7 and colted/col 


5.6 


443361 


A1792626 


Hs.133273 


ESTs 


5.6 


421513 


X00949 


Ha.105314 


ralaxm 1 (HI) 


5,5 


439079 


AFC65937 


Hs.38348 


E8Ts 


5.5 


430535 


AW9684B5 




gb:EST3B0551 MAGE resequences, MAGJ Homo 


as 


43657B 


A1091435 


Hs.134859 


ESTs 


5.5 


424051 


AL110203 


Hs.138411 


Homo sapiens mRNA: cONA DKFZpSB6J1922 (f 


5.4 


421863 


A1952677 


Hs.108972 


Homo sapiens mRNA; cDNA DKFZp434P226 {fr 


5.4 


435072 


AW592176 


Hs.116932 


ESTs 


5.4 


435375 


AI733610 


Hs-187632 


ESTs 


5.4 


444609 


AW571659 


Hs,Z78061 


E6Ts 


5.4 


416602 


NM_C05159 


Hs.79389 


ne){chlcken)-nke2 


5.4 


433087 


A172Q686 


Hs.152520 


ESTs 


5.3 


439092 


AA630149 




otoc44f08.s1 NCt_CGAP_GCB1 Homo sapiens 


5.3 


437267 


AW511443 


Hs.258110 


ESTs 


5.3 


441916 


AA993571 


Hs.129075 


ESTs 


5.3 


452784 


BE463857 


Hs.151258 


hypothetical protein FU21 062 


5,3 


452531 


AA429462 


Hs.293946 


ESTs, Weakly similar to 138022 hypothett 


5.3 
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423101 M83941 Hs.1 23542 EphA3 5.3 

437567 AI591222 Hs.72325 Human DNA sequence from clone RP1-187J1 1 5,2 

415890 H0B225 Hs.268712 ESTs 5.2 

_ 45496B AWB49046 gklL3^0214-150300-0a5-H06a0214Romo 5.2 

5 408385 Af 055634 Hs,44553 unc5 {Celegans homotag) c 5.2 

437752 M767376 Hs.291631 ESTs, ModGralolysimlar to S656S7 alpha 5.2 

446495 DGQ923 Hs.1 53460 ESTs 5.2 

422906 U80773 Hs.121580 Human EST clone 42944 mariner iransposon 5.2 

427725 AI359144 Hs.1436fla Homo sapiens cDNA: FU23031 fis, done L 5.2 

10 42674B AL045409 Hs.97177 ESTs, Weakly simitar to ALU INHUMAN ALU S 5.2 

43633B W92147 Hs.1 18394 ESTs 5.2 

4167CG AA314676 Hs.266945 hypothetical protein FU13448 5.1 

440129 AA85581B Hs_17493B ESTs, Weakly similar to 37 1BS6 Sfe20-!lk 5,1 

44523a AA8B3971 H&.1B7506 ESTs 5,1 

15 450582 AI339732 G-rich RNA sequence WwJIng factor 1 5,1 

420533 AIB09510 Hs.118971 ESTs 5.1 

438447 A10828B3 Hs.3fl732 hypothetical protein FU13409; KIAA1711 5,1 

440354 AAS89386 HS.12546B ESTs 5.0 

452891 N755B2 Hs.212875 ESTs, WeaWy similar to DYH9_HUMAN CILIA 5.0 

20 457374 AA493662 gb:nh05d12.a1 NCLCGAP_Tt»y1 Homo sapiens 5.0 

40B829 NMJH6042 Hs.483B4 hep^an suJfate (glucosamtae) 3-O-suJfbt 5.0 

448041 AW292769 Hs.206228 ESTs 5.0 

457653 AJB20719 Hs.154662 DnaJ (Hsp40) homolog, subfamily A, membe 4.9 

„ 450497 H64159 Hs.1 5328 ESTs 4.8 

25 43B132 AA907076 Hs.122060 ESTs 4,8 

414818 BE541217 Hs£3606 ESTs 4,8 

429433 AA452899 Hs.2135B6 ESTs, Weakly similar to KIAA1 353 protein 4.8 

433523 H298S2 ESTs 4.8 

_ _ 450317 A1592689 gb^B5g05jt1 Ma_CGAP_Lu24 Homo sapiens 4.8 

30 443635 AI0BO23O Hs.134214 ESTs 4.8 

430172 AA468591 Hs.161889 ESTs 4.B 

452843 AI796769 Hs.208320 ESTs 4.7 

427302 AA4QQ540 Hs.135282 Homo sapiens cDNAFU 11 554fc, done HE 4.7 

452031 AA741314 Hs.865 RAP1A, member of RAS oncogene femily 4.7 

35 423789 AK002084 Hs.132851 hypothetical protein FU 11 222 4.7 

417173 U61397 He.81424 ublqinGn-lSoa 1 (senfrin) 4.5 

420931 AF044197 Hs.1 00431 small Inducible cyloWne B subfamfy {Cy 4.6 

454024 AA993527 Hs. 2939 07 hypothetical protein RJ23403 4.6 

405346 C70016B4^ll1269B06l|dbjlBAB21849.1l(AB 4.6 

40 419511 AA429750 Hs.75113 general transcription factor IIIA 4.6 

447058 AJ939456 Hs.160870 ESTs 46 

42B218 AA424266 Hs.123542 EphA3 4.6 

430597 AA484207 Hs.211B67 ESTs 4.5 

Ac . 433280 AA581404 Hs.2B9037 Homo sapiens cDMA FU14136 %, clone MA 4.5 

45 449821 AIB71141 Hs.211122 ESTs 4.5 

420905 AA521307 Hs,186651 ESTs 4.5 

439752 T7B96B Hs.14411 ESTs 4.5 

452055 AI377431 Hs. 141 693 hypothetical protein MGC10858 4.5 

43018B AL049242 Hs.234794 Homo saplans mRNA; cDNA DKFZp554B033 (fr 4.4 

50 423352 AA324808 H3.193576 ESTs 4.4 

438042 AW296971 Hs.lBQ6lO ESTs 4,4 

452976 AA029994 Hs.61523 ESTs 4.4 

420154 AI093155 Hs.95420 JMZ7 protein 4.4 

428249 AA130914 Hs.1 83291 zinc finger protein 266 AA 

55 442242 AV647908 Hs.90424 Homo sapiens cDNA: FU232B5 fe, done H 4.4 

452495 AKQQ0101 HsJ97Q0 hypothetical proiefo RJ20094 4,4 

459527 AW977556 Hs.291735 ESTs, Weakly slmflar to 178885 serine/lh 4.3 

433597 AA708205 Hs.100343 ESTs 4.3 

416312 W02640 Hs.16247 ESTs, WeaWy similar to 200439 9A chromes 4.3 

60 420111 AA255652 gb2s21Mt.r1 NCJ^CGAP_GCB1 Homo saplans 4.3 

459045 N69101 Hs40730 ESTs 4,3 

409705 M37762 Hs.56023 hraTr>^rwedneurotrophte factor 4.3 

433433 AI692623 Hs.1 21513 Homo sapiens done Z^1 placenta oxpras 4.2 

427908 AA417272 Hs.24122 ESTs 4.2 

65 432755 AJOQ3429 gbcAJCQ3429 Selected chromosome 21 cDNA 4.2 

413525 BE145899 gb: MRO-HT020 8-221 299-204-61 0 HT02O8 Homo 4.2 
420355 AW968263 Hs.123126 ESTs 4.2 
427521 AW973352 Hs.290585 ESTs 4.2 
„ 45B912 AI911066 ESTs 4.2 

70 410790 AW803357 gb:lL^^0T79^9O30C-05O^08 UW0Q79 Homo 4.1 

415245 N59650 Hs.27252 ESTs 4.1 
434360 AW01541S Hs.127760 ESTs 4.1 
442786 H50733 Hs.2562fi1 ESTs, Moderately sirilar to ALUB_HUMAN A 4.1 
„ 450597 AJ701635 Hs.207077 ESTs 4.1 
75 424200 AA337221 gtr.EST41944 Endometrial lumor Homo sapie 4.1 

445206 A1350199 HsJ269990 ESTs 4.1 
438431 AW20786O Hs.293116 ESTs 4.1 
448152 AJ741053 Hs.170770 ESTs 4.1 
or . 438875 AA827640 Hs.1 89059 ESTa 4.1 
iSU 400746 Target Exon 4.1 

4130B1 BE064415 gb;RC4-B7031 1 -2411 99-01 2-D03BT0311 Homo 4.1 

404967 Target Beon 4.0 

427203 AW829517 Hs.244855 ESTs 4.0 
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442338 AI761976 Hs.1560a0 ESTs 4.0 

455278 BE1T6479 S b:RC3-HT0585-16030tM)22^09 HT0585 Homo 4,0 

452903 AJ953425 Hs.345291 ESTs. WeakJy similar to I38022 hypolheti 4.0 

435136 R27299 Hs.10172 ESTs 4.0 

5 416760 HB51B2 Hs.191327 ESTs. Highly similar to KIAA11 02 pratein 4.0 

421312 AA824627 Hs.291670 ESTs 4.0 

441568 A1733322 Hb.127176 ESTs 4.0 

441736 AW292779 Hs4182 ESTs 4.0 

448982 AJ001531 Hs .22404 protease. sarinB, 12 (neurotrypsin, moto 4.0 

10 434222 AF1 19886 Hs.283941 Homo sapiens PR02591 mRN A, complete cds 3.9 

42D301 AA767526 Hs.22030 paired box gene 5 (B-celNlneage specif 39 

429066 AA86B555 Hs.176222 ESTs 3.9 

43587B R0B330 Hs.20152 ESTs 3.9 

447530 AW192063 Hs.248865 ESTs, Moderately similar to JCS238 galac 39 

15 412988 BE046S80 gb:hn42h03jc1 NCt_CGAPJ*DF2 Homo sapiens 39 

410196 AJ936442 Hjl59B38 hypofoaticri protein FU10B0B 3.9 

411084 T189S7 Ks. 125472 ESTs. Moderaldy similar to KIAA0877 pro 39 

419629 AB02O695 Hte.91852 KIAA088B protein 39 

455646 BE064420 gfcRC4-HT031 1-241 1 99-01 2-cOBBT03t1 Homo 3.9 

20 414441 AA234759 Hs.t32950 ESTs 3.9 

425810 A1923627 Hs.31903 ESTs 39 

40911 1 . ALQ43362 HsJ984 plectatrln homolofly, Sec7 and coiled/col 39 

411479 AW84B047 gb:IL3^214-29129M>52-At2CT0214Homo 3.9 

__ 440450 A1333129 Hs.156147 ESTs 3.8 

25 426342 A1739168 Homo sapiens cDMA FU13458 fc. dons PL 3.8 

431421 AW9B9118 Hs.t08l44 ESTs, Weakly sirrilar to unnamed prolan 38 

418009 Z43062 gb:HSC12BM1 normalized Infant brain cDN 38 

416534 H69043 Hs>224961 Homo sapiens cDMA FU1 4366 fis, done HE 3.B 

„ 407138 H91679 gb:yv04aD7.s1 Soaresfetat fiver spleen 38 

30 419831 AW448930 Hs.5415 ESTs 3.8 

424630 AW27058Q Hs.189311 ESTs, Weakly similar to putative p150 [H 3.8 

431447 AA50513B Ha.291341 ESTs 3.8 

435932 W03928 Hs.114524 ESTs 38 

__ 442447 AA999723 Hs.129607 ESTs 38 

3 5 403242 Taipei Exon 38 

4339QB AW29B141 Hs.157975 ESTs 3.8 

454037 AW99B716 gb:PM4-BNOO&7-2503aJ-002-f11 BN0067 Homo 3.7 

432101 AI918950 Hs.123642 EpnA3 3.7 

Arx 41B759 AA227679 Hs.1S7621 ESTs 3.7 

40 452462 BE173515 gb:RC2-HT0560-210200-012ffi3 HT0560 Homo 37 

448568 AA140121 Hs.71947 ESTs 3.7 

447785 AL041765 Hs340375 ESTs 3.7 

451746 M86178 Hs.311258 ESTs 3.7 

436345 AAQ73008 Hs.121572 ESTs 3.7 

45 446862 AV660697 Hs.282700 ESTs 37 

419B75 AAB53410 Hs.93557 praankephdin 37 

421040 AA715026 Rs.1 35280 ESTs 37 

454860 AW835767 o^W4^T0016^240200-1104>06LT0016Horno 3.7 

450687 AA49S80Q gteo»05b07.st Soares_pfiHMPu_S1 Homo sap) 3.7 

50 407437 AF220264 gfcHomo sapiens MOST-1 mRNA, complete cd 3.7 

431474 AL133990 Hs.190642 CEGP1 protein 37 

448004 AW451477 Hs.257465 ESTs 37 

419261 XD7&76 H&B9791 fcingtaas-typa MMTV iitegration site far™ 3.7 

_._ 422899 016471 Hs.12.1571 Human mRNA, Xq terminal portion 3.7 

55 439075 AF085333 Hs.2&2520 ESTs 3.7 

440947 AA910403 ESTs 3.7 

447458 AI7410B2 Hfi.158961 ESTs 37 

403481 Target Exon 36 

404561 tnchorhlnophaiangeai syndrome I gene (T 36 

60 417565 AJ203405 Hs.47831 ESTs 36 

430096 U91935 Hs.233321 FkOreHterived POU-domaintactor-1 3.6 

446658 H71739 Hs.200227 ESTs, Moderately similar to A53959 Qirom 36 

449655 AID219&7 Hs.59970 ESTs 36 

450630 AA010429 Hs.191939 ESTs 36 

65 455067 AW&5453B gb:RC3-CT0255-20010<W)24-brj2CT0255 Homo 36 

423566 AW976434 Hs.3623 hypothetical protein FU1 1220 3.6 

461193 N29850 Ha.44096 ESTs 3.6 

420026 A1B31190 Hs,l66676 ESTs 3.6 

_ 426917 AAS13B14 Hs.172854 DKFZP56680923 profain 36 

70 423430 AJ381837 Hs.155335 ESTs 36 

433563 A1732637 Hs.277901 ESTs 3.6 

443744 A1084326 Hs.271548 ESTs, Weakly similar to 178685 serine/Ui 35 

456373 BE247703 Hs.89751 mernbrarie-spannirig 4-domalns, subfamily A 35 

„ 419569 AW973708 Hs.201925 Homo sapiens cDNA FU13446 fis, cbna PL 35 
75 401132 C12000517*:gi|4758712Ire!lKP_004653.1|a 35 

421105 AA7G6601 Hs.125113 ESTs 3.5 

435177 AI016174 Hs.42936 ESTs 35 

434763 AA646818 gfcnsOJalljl NCl_CGAP_Ew1 Homo sapiens 3.5 

_ rt 449668 AW372068 Hb.2Q1420 ESTs, Moderately simitar to ALU7J4UMAN A 3.5 

SO 450216 AAB73345 Hs.60226 Homo sapiens, clone 1MAGE3621638, mRNA, 3.5 
403510 Target Exon 35 

435681 AA694192 Hs.14B979 ESTs 35 

451722 R86374 Hs.40861 ESTs 3.5 
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Hs.5232 


A A AQ ft ft 


AW1 19071 


Hs. 153287 


A*iPCA t 




Hs. 124344 




AA7Ua046 


Rs^27552 


A1QZA1 


AA4558B9 


HSu16727S 


4 I »difcl4 


R44013 


Hs.1 64225 




AVViAJ4yy4 


HS,25d45D 






HS.29J971 




AA728964 


HS.2S3339 


404848 






WWDU 


AI350337 


11,. icitrcu 


412484 


AL13390Q 


Hs.792 


417435 


NMJJ05181 


Hs.82129 


430409 


AF039390 


Ha.241382 


436340 


rt Ami r> 

R42245 


F&21606 


441390 






442231 


W0Z434 


H&.22Z413 


447124 


AW976438 


Hs.17426 


Ac KVbrbT 

450297 


AYV901347 


Ms. 38592 


453692 


T79703 




456995 


T89632 


Ha. 170278 


405510 






400379 


Nf«U)18432 




419964 


AAB11657 


Rs.220913 


4Z7033 


AJ457449 


Hs. 192817 


422321 


AA906427 


Hs. 181035 


43DS29 


Alm4r4 nnn 

AW451999 


*u_ 4 n. Ann j 

Hs. 194024 


420721 


AA927802 


Hs. 159471 


433628 


A1821784 


Hs,1 66578 


436703 


AW8BQ&14 


Ra.146381 


429716 


R25585 


Hs.211933 


427235 


An 26288 


Hs.192232 


459646 


AW8B396B 


Hs.321190 


428923 


BE047698 


Hs. 188785 


431406 


AA504757 


Hs.105738 


438801 


AA825971 


Ks.1 24284 


445432 


AV653771 




450003 


AA777809 


Hs.191995 


446108 


AW300021 


Hs.170685 


425523 


ABO 07 948 


Hs. 158244 


427473 


AW274439 


H3.2527D9 


434520 


AA205273 


Hs.177011 


447282 


A1989963 


Ks. 197505 


4471 B2 


BE24186B 


Hs.175B5 


432229 


AW290976 


Hs. 143567 


418985 


Al 04 2330 


Hs.67128 


41B719 


AW975590 


Hs. 161 707 


455710 


SE072049 




420300 


AA256245 


Hs.l 27573 


416662 


T25853 


Hs.7538 


411435 


AW846433 




420851 


AA281062 


Hs.29493 


428715 


AW293716 


Hs.53126 


429318 


AW561930 


Hs.1O250O 


444346 


m not OH 


Hs.10710 


453973 


A1291695 


Ks.61993 


434497 


AI821B03 


Ks.1365B0 


439306 


BE2201S9 




420606 


BE548277 


Hs.103104 


4)8966 


All 23555 


H5.81796 


438118 


AW7 53311 


Hs.346690 


447516 


W05355 


Hs,t02971 


449570 


AA001793 




424994 


AW954525 




417676 


AJ9U8607 


Hs.3781 


402145 






448131 


AI675054 


Hs.200481 


422352 


AA766298 


Hs.99200 


432625 


A1243595 


Hs.94830 


447163 


AI554733 


Hs.173182 


44fOW/ 


AI8 86036 


Hs.213675 


459535 


AV654907 




415467 


RS0891 


H5J260274 


434408 


At031771 


Hs.132586 


456354 


X58411 


Hs.1219 


453769 


AA628517 


HS.11B502 


412666 


AU3B0116 


Ks.74420 


418319 


AW611703 


Hs.l90m 


419088 


AI538323 


Hs.52620 


420397 


NMJKJ7018 


Hs.97437 


422165 


AL041199 


Ha.1481 


448044 


AW5S682 




454665 


AWB12866 





HSPC1 25 protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp586N2424 (f 

FYVE-finger-conteWng Rab5 effector pro 

ESTs 

ESTs 

ESTs 

ESTs 

ENSP0D00024Q769';BG153O3.1 (sirrila- toC 
fibroblast growth factor 7 (keralinocyte 
ADP-ribosyletion factor domain protein 1 
carbonic antydrase 111, musde specific 
tumor necrosis factor (ligand) superfarai 
ESTs 
ESTs 
ESTs 

RBPMta protein 
hypothetical protein HJ23342 
gbiyd7le08.r1 Soares fetal liver spleen 
ESTs 

ENSPTX0D0233779 # :Hypo&ietical 68,0 kDa p 
Homo sapiens ovarian cancer related prat 
ESTs 
ESTs 

hypothetical protein MGC11 296 

ESTs y 

2AP3proteh 

ESTs 

RMA binding motif protein, X chromosome 

collagen, type XJll, alpha 1 

ESTs 

gb:QV3-OT0063-290300-1 35^04 OT0063 Homo 

ESTs 

ESTs 

ESTs 

gtrAVB 53771 GLC Homo sapiens cDNA done 

ESTs 

ESTs 

K1AA0479 proton 
ESTs 

hypothetical protein 
ESTs 

KtAA0801gene product 
ESTs 

hypothetical p/otein FU23309 
ESTs 

gbcPM4^T0532-170t(XMl04-fl]6 BT0532 Homo 
Homo sapiens FKSG41 (FKSG41) mRMA, com pi 
ESTs 

gb:avTJ-CTQ179-070300.143-tj02 CT0179 Homo 

hypothetical protdn FU20142 

ESTs 

hypolhetfcal protein dJ51 1E16.2 

hypothetical protein FU2D417 

ESTs. Weakly similar to ALU 1 JIUMAN ALU S 

ESTs 

WD40 protean Qao1 

ESTs 

ESTs 

ESTs 

hypothetical protein FU14751 

gbtzh86cOBji SoaresJe^aL 1 hwr_sp tee 

gb:EST366595 MAGE resequences, MAGCHomo 

similar to murine feuclne-rich repeat pr 

TargBtBcon 

ESTs 

ESTs 

ESTs, Moderately similar to TO3O04 A-Wn 

ESTs 

ESTs 

gbAV654907GLC Homo sapiens cDNA clone 

ESTs 

ESTs 

^cohol deli yd rag en aae 4 (class 11), pi p 
ESTs 

origin recognition complex, subunrt 3 (y 
ESTs, Weatiy similar to A46D10 X-linked 
Integrin»beta8 
centrosomal pmleln 1 
WstMine decarboxylaBe 

gb:tk13801,x1 Na_CGAP_U24 Homo saplere 
gt>:RC3-ST0186-30010fW>174)03 ST0186 Homo 
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405321 






Target Exon 


3.1 


429569 


M454993 


Hs.138343 


ESTs, Weakty similar to 178685 sefine/lh 


3.1 


434512 


AW139932 


Hs.188941 


ESTs 


3.1 


4521 02 


UQ4343 


Hs.27954 


CD85 antigen (C023 anBgBn Sgand 2, 87- 


3.1 


433444 
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Hs.129815 


ESTs 


3,1 


410B21 
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ESTs, Weatty similar to T00365 hypothecs 
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ESTs 
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gb:Hwno sap'cens F-box protein FW3b (FBL 


3.1 


416423 


H54375 
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ESTs 
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416037 
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Hs.86447 


ESTs 
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N4B921 


Hs,27441 


KIAA1615 protein 


3.1 


420179 
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ESTS 
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436295 


N73895 




gb:za62d06.s1 Soares fetal liver spleen 


3.1 


405548 
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423595 
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ESTs 
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ESTs 
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Homo sapiens PRQ1 082 mRNA, complete cds 


3.1 
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NM_030B1 3*:Homo sapiens suppressor of po 
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410869 
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425354 


U62027 
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441492 
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ESTs 
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ESTs 


3.1 


435021 


AA922192 


Hs.54709 


ESTs 


ao 


408832 


AW0B569Q 


Hs.6342B 


ESTs, Weaty similar to Z195J4UMAN ZINC 
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450580 


N400B7 




ESTs 


3.0 


432319 


AW510770 


HS.12B386 


ESTs 


3.0 


453713 


R20640 


Ks.79133 


cad term 8, type 2 


3.D 


445784 


AI253155 


Hs.146065 


ESTs 


3.0 


416642 


T9B118 


Hs.226313 


ESTs 


ao 


41B94S 


AE57097 




gb:qd43h07ji1 SoaresJelaLheartJlbHH19W 


3,0 


433796 


AA810867 


Hs.1B5997 


ESTs 


3.0 


439182 


AFO8603O 


Hs.21621 


hypothetical protein DKFZp7620576 


3.0 


404995 






ENSPO 0000251 890*: Monocytic teukemiazfnc 


ao 


444794 


A1419991 


Hs.145225 


ESTs 


3.0 


443634 


H73972 


Hs.134460 


ESTs 


ao 


42D133 
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Hs.155543 


ESTs 


3.0 


407629 


AA045084 


Hs.29725 


hypothab'cal protein FLJ13197 


3.0 


426743 


AA383B33 


Hs.245022 


ESTs 


ao 


442326 


H92962 


Hs.124813 


hypothetical protein MGC14817 


ao 


449913 


AA004896 


Hs.333016 


ESTs 


ao 


454095 


AWD62757 




gb£fuUKJIt)t03-120899-rj37-g07 CT0103 Homo 


ao 


419622 


AA452054 


Hs.119338 


ESTs 


3.0 


449745 


AI688593 




gb^l38a05j(5 Soares breast 3NbHBst Homo 


ao 


42W12 


AA428240 


Hs.126083 


ESTs 


ao 


42B200 


A1039624 


Hs.98388 


ESTs 


ao 


414951 


AW794931 


Hs.100861 


hypothetical protein FIJI 4600 


3.0 


431669 


AA521136 


Hs.190176 


ESTs 


ao 
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ESTs 


ao 
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TABLE 668 



Pkey: Unique Eos probesel tienflHer number 
CAT number. Gene duster number 
Accession: Gen bank accasaion numbers 



Pkey i 

410790 
410869 



4H43S 
411479 

412988 
413081 
413525 
416009 
41B94B 
420111 
420352 
424200 
424994 
426002 



CAT Number Accession 



1225131J 
122M23J 



1245660J 
1247077J 

13421KL1 
134B5B3J 
1374635J 
1566379J 
18080BJI 
190755_1 
192979J 
236S95JI 
245786J 
285602J 



AW803357 AW803423 AW812233 RD6814 

AW808361 AWB08404 AW8083B6 AWS0B594 AW808B54 AW80B813 AW80B551 AW80B676 AWB0B350 AW80B406 AW808694 AWBC8934 
AW808829 AW808385 AW606422 AWBQB401 AWB08409 AW808760 AW808883 AWS08521 AW808539 AW606609 AW80S472 AW808739 
AW80B704 AW80B558 AWB08714 AW803420 AWB 
AW846433 AW846159 AW84G377 AWS46528 

AW848047 AW648202 AW948631 AWB4BW2 AW848702 AW84S1 21 AW848B32 AW840140 AW84B571 AW848009 AW848067 AW848Q69 
AWB48906AWB48214 
BE04668D B EC 45758 BE044958 
BE064415 BE064430 BE064446 

BE1458B9 BE145B4S BE145849 BE145B53 BE145927 BE145925 
243062 R1 321 3 H14422 

AI217097 AWB8609D W3B035 W3B792 AA232835 AW936043 
AA2S5652 AA2B091 1 AW967920 AA262684 

BE258B35 AW968316 AA2S8918 AW843305 R14744 A1580388 BE071 923 R3S2B0 
AA337221 AA3367S6AW966196 
AW954525AI372685 AA349501 AI3726B7 H10564 
AA418703 AA418711 BE071915BE071920 BE071912 
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426342 

430536 
432765 
433523 
434763 
436295 
439092 
439306 
440947 
442461 
445432 
448044 
449570 
449745 
450317 
4505 BO 



450582 
4506B7 
452462 
453682 
454037 
454096 
454171 
454457 
454665 
454660 
454968 
455057 
455276 
45S646 
455710 
457374 
455912 



290035^2 

319643_1 

353907J 

36B873.J 

392847_1 

41733J 

468554J 

4706B_1 

5059Q4_1 

54358B.1 

63943J 

7471 96_1 

81018J 

614534 1 

831956J 

83929.1 



B3933J 

84327 1 

918580 1 

977454.1 

99G237J 

1007449.1 

1049240_1 

1207274.1 

122859U 

1237732.1 

1247029J 

1252050J 

1272541.1 

1348557.1 

1352368 1 

92B758L1 

823104.1 



AI7391 68 AA426249 AH99636 AW50519B AW977291 AA824583 AAB83419 M724079 A101 5524 AI377728 AW233682 AI928140 AA73 1 43B 

A10924C4 AI085630M731340 

AW968435 AW968670 AA46092Z BE350425 

AJ003429 AJ003367 AA5B4825 

H29B82 AW6B5533 AW149901 AI572917 AA5985D0 AI6B6466 AI33G390 AW864390 AWB64320 

AA648618 AW974389 H51771 

N73395AJ001872 

AA830149 AW978407 M85983 AW503637 
BE220199 W01B13 AF0B6113 N70760 BE221405 
AA910403 AI815593 W58361 AW16252OAIB16550 
N99828 BE079873 All 10738 AF074645 
AV653771 8E089370 

A145B682 H24240 R14537 R18426 AW667082 
AA001793 AA001871 

A1668593 AI820774 R86205 H39971 H22177 H26241 
AI6926B9 R14223R16395 

M40087 H12925 AA460779 AA096372 BE160847 AW816B90 AW816B93 AW816577 AW516891 AW81 6882 AW816868AW81 6889 AW81 8940 
AW816892 AW816941 AWB16576 AA0291B3 TB3320 HB5850 AA010295BE1 60823 Z25353M150BB3 AW8B7764 AW023806 AW022095 
AA164518 AA730973 W00417 WB5303 

A1339732 AA 01 0300 AW5l5Mt AA768334 N29860 AA425B74 AA4251 16 AAB65829 AW936B7B 

AA49SB00 AA495737 AA010736 AA654716 AA640726 

BE173515 BE173560 A1902B60 

T79703T96307AL079725 

AW99B716 AW022148 N68O20 

AW062757 AW176890 AW0B2756 AW176B95 AW176B69AW17690D AW176697 AW176928 AW17686B AW176892 

AW854832 AWS54798 AW854857 AW85481 6 AW854B34 AW854817 

AW753456 AW753036 AW854868 AW8548B2 

AW812866 AWB12746 AWB12747 AW812884 AW812763 AW812722 

AWB35767 AW835537 BE16D187 

AW649046 AW847956 AW849039 AWB47957 AWB48279 AW848698 AW849034 AW849033 
AW854538 AW854418 AW854412 

BE176479 BE17667B BE17S357 BE176650 AWQ86079 BE176676 BE176615 BE1765S5 BE1764B9 BE176610 BH176362 
BE0B442O BE064435 BE064429 BE064414BE064400 BE064517 

BE072049 BE069471 BE069489 BE0B9478 BE069479 BE069492 BE069485 BE072122 BE072124 BE069491 BE0S9486 BE069464 

AA493662 AWB9739B BE154S14 

AI91 1066 A1933734 W680888 AJ003599 



TABLE 66C 



Pkcy : Unique number corresponding to an Eos probesat 

Ret: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gt) numbers. 'Dunham I. el el* refers to fta publication entitled The DMA sequence of 

human chromosome 22/ Dunham I et a)., Nature (1999) 402:489495. 
Strand: Indicates ONA strand from which exons were predicted. 

Nlposflton: Indicates nucleotide positions of predicted axons. 



Pksy 


Ftef 


Strand 


KtLposrOon 


400533 


6981826 


Jrinus 


277132-277595 


400746 


7329326 


Minus 


147703-147896 


401132 


8705350 


Mnus 


85679-85795 


402145 


6016280 


Plus 


1130B6-11460O 


403242 


7637817 


Minus 


11297-12511 


403481 


9965004 


Plus 


93496-93633 


403510 


7652047 


Pius 


61856-62027 


403657 


6650463 


Minus 


1344-1442,1545-1697 


404003 


8655948 


Plus 


198349-199096 


404561 


9795960 


Minus 


69039-70100 


404592 


9943965 


Mnus 


39067-39225 


404B4B 


8248647 


Minus 


23955-24034.25143-25264 


404967 


7523744 


Minus 


8994440729 


404995 


6006247 


Minus 


154015-154123 


405254 


7329374 


Plus 


26556-28684 


406321 


3419846 


Minus 


44654-45210 


405348 


2914717 


Minus 


43310-43452 


405510 


7630909 


Miius 


101028-101174 


405548 


1532158 


Plus 


11552-11686 



TABLE 67A! 689 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL AQULT TISSUES 

Table 67A lists about 689 ganes up- regulated in prostate cancer compared So normal adult tissues. These were selected tram 9009 probesets on the Affytnetrix/Hos Hu03 GeneCtilp 
array that showed some expression h human xenograft tumors and ceS lines. The 6B9 genes were selected such that the ratio of "average" prostate cancer to * average" normal adult 
tissues was greater thai or equal to 2,5. The "average" prostate cancer level was set to the 75th percentile amongst prostate cancers. The "average 1 normal adult tissue level was set 
to the 851h percentile amongst non-malignant issues. In order io remove gene-specific background levels of non-specific hybrttfealion, the 10th percentile vshib amongst nan- 
malignant (issues was subtracted from bom the numerator and the denominator before the ratio was evaluated. 

Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenslD: Uragene number 

Unigene Title: Untgene gene title 

R1: Ratio of tumor to norma! body tissue 
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Pkey ExAccn UnigenelD Unigene Title R1 

443999 AF179274 Hs,22791 Iransmaribrare protein wilh EGF-Jike and 73.2 

426747 AA53S210 Hs.171995 kailikrefo 3, (prostate specific antigen 65. D 

5 432441 AW292425 Hs. J 63464 ESTs 56.0 

446057 AI42D227 Ms. 149358 Trp~p8 transient receptor potential cati 55.0 

400302 N48056 Hs.283946 folate hydrolase (prostate-specific memb 48.1 

414569 AF109298 Hs.118253 prostate cancer associated protein 1 41.4 

432240 AI694767 Ks.129179 Homo sapiens cDNA RJ135B1 lis, done PL 40.3 

10 419526 A1821B9S Hs.1 93481 ESTs 35.9 

400298 AA032279 Hs.6l635 six transmembrane epithelial antigen of 35.7 

400299 X07730 Hs.171995 fcalfikrefo 3, (prostate specific antfcen 34.9 
453370 AI47D523 Hs.139335 ATP-Wndlng cassette, sub-femly C (CFTR 33.9 
409731 AA1 25985 Hs.56145 thymosin, beta, identified in neuroblast 32,8 

15 424099 AF071Z02 Hs.139336 ATP-bnrfing cassette, sub-family C (CFTR 32.6 

400292 AA250737 Hs.72472 8MP-R1B 28.9 

400287 S39329 Hs.161350 kalBkreln 2. proslafc 2B.a 

425075 AA506324 Hs.1652 actf phosphatase, prostate 28.6 

409361 NM_005982 Hs.54416 ^eoculshomefiboxflJrDsorjrnla) homolo 27.8 

20 415539 A17338B1 Hs.72472 BMP-RIB 27.0 

42B819 AL13S623 Hs.193914 KIAA0575 gene product 25.7 

409262 AKD00S31 Hs.52256 hypothetical protein FU20624 25.5 

432374 W68815 Hs.301885 Homo sapJens cDNA FU1 1346 fls, clone PL 24.B 

407168 R45175 Hs.117183 ESTs 24.5 

25 400296 AA305627 Hs.139338 ATP-Wndlng cassette, sub-family C (CFTR 23.6 

428336 AA5Q3115 Hs. 183752 mlOTsemlncipiotel r>. beta- 23.5 

433043 W57554 Hs.125019 lymphoid nuclear protein (LAF4) mRNA 23,4 

403047 NM_0O5656*:Horno sapiens transmembrane pr 23.4 

444484 AK002126 Hs.11260 hypothetical protein RJ11264 22.0 

30 423073 BE252922 Hb. 123119 MAD (mothera against decspenlaptegic. Dr 21 .6 

407709 AA45S135 Hs.23023 ESTs 21.5 

401424 NMJXm72:Homo sapiens arginase. type It 20.6 

448290 AK002107 Hs.20343 Homo sapiens cDNA FU11245 fis, done PL 19.7 

„ 434666 AF151103 Hs.1 12259 T ceH receptor gamma iocus 18.5 

35 415989 At2fi7700 ESTs 17.9 

437062 AA851697 Hs.120591 ESTs 17.a 

451939 U80456 Hs.27311 srnglB-rrindsd (Drasopbila) homolog 2 16.8 

43154B AJB34273 Hs.9711 novel protein 16.6 

425B2B NM.004476 Hs.283946 folate hydrolase (prostate-specific memb 16.5 

40 428B52 N&UMQ346 Hs.2316 SRY (sex deterrrnnirig region Y)-boK 9 (ca 16.5 

450096 AffiB2088 Hs.79375 l«locarboxylase synthetase tbiolto-Iprop 16.5 

413597 AW302885 Hs.117183 ESTs 16.1 

411095 U8D034 Hs.68583 mKochondrtat Intermediate peptidase 15,7 

435677 AASB4142 Hs.293726 ESTs, Weakly sunllar to TSGA RAT TESTIS 153 

45 419743 AW40B762 Hs.5957 Homo sapiens clone 24416 mRNA sequence 15.3 

407122 H20276 Hs.31742 ESTs 15.1 

4Z79S8 AA41B000 Hs.98280 potassium Intermedla^smailconduchrJx^ 14.1 

418829 AA516531 Hs.55999 NK homeobox (Drusophila), family 3. A 13.9 

426501 AWD43782 Hs.293616 ESTs 119 

50 428598 AB033Q7Q Hs.194408 KIAA1 244 protein 13.7 

418961 AW967646 Hs.23023 ESTs 13.3 

448519 AW 1 75665 Hs^78695 Homo sapiens prosleCn mRN A, complete ctis \Z2 

418848 A1820S61 Hs.193465 ESTs 13.1 

42B39B A124936B Hs.98558 ESTs 13.0 

55 429220 AW207208 E8Ts 12.7 

401451 NM_C04496*:Homo sapiens hepatccyte node 12.4 

424012 AW36B377 Hs.1 37559 tumor protein 63 kDa vrith strong tiomotog 12.4 

419078 M93119 Hs.69584 insulnoma-as sedated 1 12.3 

450382 AA397658 Hs.60257 Homo sapiens cDNAFU1359B fa. clone PL 11.9 

60 430376 AW292053 Hs.1 2532 ctiromosorrie 1 open reading frame 21 11.8 

408826 AF216077 Hs.48378 Homo sapiens clone HB-2 mRNA sequence 11.4 

429918 AW873986 Hs.1 19383 ESTs 113 

427212 AW293849 Hs.5B279 ESTs, Weakly simitar to ALU7_HUMAN ALU S 11.1 

412446 AR88015 ESTs 11.1 

65 433404 T329B2 ESTs 10.9 

420757 X7B592 HsJ999l5 androgen receptor ({firrydroteatoBterone r 10.9 

427674 IWL003528 Hs^17B H2B hfetone famly, member Q 10.6 

444151 AW972917 Hb.128749 atpha-mathylacyi-CcA racemase 10.4 

„ 447342 AI19926B Hs.19322 Homo sapiens, Similar to RIKENcDNA 2010 103 

70 418278 AK)fiB489 Hs. 83937 hypothetical protein 10.3 

430228 BE245562 Hs,2551 aoYenerglc, beta-2-, receptor, surface 10.2 

433927 AI557019 Hs.1 16467 small nuclear protein PRAC 10,2 

4Q8000 L11690 Hs.198689 bulous pemphigoid antigen 1 (230/240kD) 10.1 

449825 NM.014253 odz(odd OJtown, Dogopliila) homolog 1 10.1 

75 431117 AHJ03522 Ha.25G500 delta (Drosophlta)-lfce 1 10.0 

439444 A1277652 Hs.54578 ESTs. Weakly simitar to I38022 hypothec 9.8 

437716 A19272B8 Hs.196779 ESTs 9.8 

440529 AW207640 Hs.1G478 Homo saptens cDNA; FU21718 fis, clone C 9.7 

„ 453160 AI263307 K2B htstone farrfly. member L 9.7 

oO 432715 AA247152 Hs.200483 ESTs. Weatiy similar to KIAA1074 protein 9.7 

450325 AI935962 Hs.91973 ESTs 9.6 

434170 AAB2650S Hs,122329 ESTs 9.6 

428600 AW883261 Hs.136860 hypothetical protein DKFZp434K1421 9.6 
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447033 AI357412 Hs. 157501 ESTs 9.5 

401747 Homo sapiens kerafin 17 (KRT17) 9.3 

434423 NM_006769 Hs.3B44 UM domain only 4 9,3 

411887 AW1B2924 Hs.1 28790 ESTs 9,2 

5 457211 AW972565 Hs.32399 ESTs. WeaWy similar to S51797 vasodibl 9.2 

4152S3 AA948033 Hs.130853 ESTs 9.2 

432729 AKOO0292 Hs.130732 typoMcal prdain FU20285 9.2 

421895 N62293 Hs.45107 ESTs 9.1 

_ 432101 AI918950 Hs.123642 EphA3 9.1 

10 451640 AA195601 Hs.26771 Human DNA sequence from clone 747H23 on 9.0 

428046 AW812795 Hs.337534 ESTs, Moderately similar to I3B022 hypol 8.9 

420218 AW958037 ribosomat protein L4 8.8 

434926 BE54326S Hs.50252 mitochondrial ribosomal protein L32 8.6 

424692 AA429834 Hs.151791 KIAA0D92 pme product 8.6 

15 417333 AL1 57545 Hs.1 73179 bromodomalh and PHD finger containing. 3 B.6 

452594 AU076405 H&.299B1 sduts carrier family 26 (sulfate bansp B.5 

446100 AWSB710S Ks.13B&4 hypothetical protein dJ462023-2 8.5 

400294 N95796 Hs.278695 Homo sapiens proslelri mRNA, cofnptets cds 8.5 

_ 416182 N^L0O4354 Hsjg069 cyc&iG2 8.5 

20 431542 H63O10 Hs.5740 ESTs 8.4 

447397 BE247676 Hs.1 8442 E-1 enzyme Bu3 

432674 AA641092 Hs.257339 ESTs, Weakly similar to 138022 hypotheb' 8.1 

456660 AA9Q9249 Rs.1122B2 sobtecarrierfamily 30 (zinc transport 8.1 

443822 A1067412 Hs. 143611 ESTs, WeaWy simllafto2004399Achromos 8.1 

25 451684 AF2167S1 Hs.26813 CDA14 8.1 

437124 AA55445B WAA0666 protein 8.0 

416239 AL038450 Hs/48948 ESTs 8.0 

432415 T16971 Hs.269014 ESTs, WeaWy similar to A43932 mucin 2 p 7.9 

„ 424800 AL03558B Hs.153203 MyoO famEy inhibitor 7.9 

30 428728 NrvL016825 Hs.191381 hypothetical protein 7.9 

418564 AA631143 Hs.278695 Homo sapiens proslem mRNA. complete cds 7.8 

442049 AA310393 Hs.190044 ESTs 7.8 

42S597 NMJ0O3S16 Ks.2442 a dtetotegrln and matalloprotefnase doma 7.6 

4108B9 X91662 Hs.66744 twist {Drosophila) nomolog (acrocaphfltos 7.7 

35 443180 R15875 Hs.258576 daudin12 7.5 

433285 AW975944 Hs_23739G ESTs 7.5 

. 410B70 U81599 Hs.66731 homeobc*B13 7.4 

418B36 AI655499 Hs.161712 ESTs 7.4 

444922 AI921750 Hs.144871 Homo sapiens cDNA FU 13752 fis 4 clone PL 7.3 

40 431992 NM_002742 Hs.2891 protein kinase C, mu 7.3 

42S905 AB032B59 Hs.318584 novel C3HC4^pB Zinc finger (hngfinge 7.3 

432586 AA568548 ESTs 7.3 

431325 AW025751 Hs.5794 ESTs, Weakly simlar to 2109260AB cell 7.3 

435047 AA454985 Hs.54973 cadhenn-llta protein VR20 7J3 

45 447476 BE293466 Hs.20880 ESTs, Weakly similar to 13BQ22 hypdhati 7.3 

422728 AW937826 Hs.1 03262 ESTs, Weakly similar to ZN91_HUMAN 2JNC 72 

42610B AA622D37 H&.16646B programmed cell death 5 7.1 

433323 AA805132 Hs.159142 ESTs 7.1 

423349 AF010258 Hs. 127428 homeoboxA9 7.0 

50 416815 U41514 Hs.a0120 UDF^r^ecefyl^lpr^0^aJac(osam1ne:po!yp 6.9 

425154 NMJJQ1851 Hs.154850 collagen, type IX, alpha 1 6.9 

438869 AF0750Q9 gbrHomo sapiens full length insert cDMA 6.9 

418329 AW247430 Hs.B41S2 cys!flfotornne4>eta-synthase 6.8 

_ _ 452367 U71207 Hs.29279 eyes abseni (Drosophlls) homolog 2 6.8 

55 418601 AA279450 Hs.8636B calmegai 6.6 

423769 NM.004917 Hs.218366 ksJIfereln 4 (prostase, enamel matrix, p 6.B 

436962 AW377314 Hs5364 DKFZP5641052 protein 8.3 

425465 L1B9B4 Hs.1904 protein Kkiase C, iota &6 

_ _ 450377 AB033091 WAA1265 protein 6,6 

60 451418 BE397790 Hs.26389 hypothofcat protein FU20287 6l6 

418250 U29926 Hs.8391B adenosine monophosphate deaminase (Isofo 6.6 

416111 AA033813 Hs.79018 chromatin assembly factor 1, submit A { 6.S 
426350 NMJJ03245 Hs^022 transglutaminase 3 (E polypeptide, prats 6.5 

440774 AI420511 Hs.163934 ESTs 6*5 
65 413992 W26276 Hs.1 36075 RMA, U2 small nuclear 6.4 

421928 AF013758 Hs. 109643 polygeny late Writing proletr>4riterac&i &4 
433293 AF0O7835 Hs.32417 hypothetical protein MGC4309 6.4 
452340 NM.O022O2 H*505 ISL1 transcription fader, UtWhomeodoraa 6.4 
„ 451027 AW519204 Hs.40808 ESTs 6.4 
70 436063 AK00DO2B ribosomeJ protein S24 63 
440749 W22335 Hs.7392 hypoUietrcat protein MGC3199 6.3 
432103 T15803 Hs .272458 protein phosphatase 3 (formerly 2B), cat 6.3 
431 958 X63629 Hs.2877 cadherln 3, type I.P-cadherin (placenta 6.2 
„ 411019 AW993097 Hs.48617 Homo sapiens cDMA FU12540 fis, clone NT 6.2 
75 421568 NMJ00399 Hs.1395 early growth response 2 (Krax-20 (Drcsop 6,2 
452744 A12B7652 Hs.246107 Homo sapiens mRNA; cONA DKFZp434E082 (fr 6.1 
425259 AU049280 Hs.155397 Homo sapiens mRNA; cDNA DKFZp564K143 {fr &1 
442772 AW5O3660 Hs.5957 Homo sapiens clone 2441 6 mRNA sequence 6.1 
_ _ 448045 AJ297436 Hs.20166 prostate stem cell antigen 6.0 
80 428342 AJ739168 Homo sapiens cDNA FU13458fls. done PL 6.0 
453439 AJ572438 Ha.32976 fluamna nudeotide btndtng proton 4 6.0 
433517 AW022133 Hs,189638 ESTs 6.0 
424036 AA7706B8 H2A his tone family, member L 5.9 
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AI523B9B Hs.17617 

AUM2613 Hs.262476 

AA24275B Hs.79136 

H18636 Ks.31608 

AB020725 Hs.58009 

AL1S7424 Hs.612B9 



420092 AA814043 Hs.88045 ESTs 

410258 AA316181 Hs.61635 six transmembrane epithelial antigen of 

425071 NW_0139a9 Hs.l 54424 delodlnase, iodothy renin* type It 

441866 BE464341 Hs.21201 nectin 3; DKFZP566B0M6 protein 

450244 AAO07534 Hs.125062 ESTs 

429165 AWQ0S686 Hs.1 18258 prostate cancer associated protein 1 

451952 AL120173 Hs.3016S3 ESTs 

429054 AJ0D1443 Hs.195614 splicing factor 3b, subarnl 3, 130kD 

445B73 AA250970 Hs_251946 pory(A>bindng protein, cytoplasmic 1-1 

412935 BE267045 Hs.75064 lubulin-specfficcr^eronec 

415068 Z1944B Hs.131887 ESTs, WeaWy similar to T2439G hyprtheti 

434826 AF155661 Hs.22265 pyruvate dehydrogenase phosphatase 

415621 AI646602 ESTs 

415653 AA76B553 Hs.193145 metaJtothionein 1E (functional) 

433332 A1367347 HS.44B98 Homo sapiens clone TCCCTAQ0151 mRNA sequ 

465497 AA112573 H&J27B695 Homo sapiens prostein mRMA, complete cds 

418245 AA088767 Ks.B3BS3 transmembrane, prostate androgen Induced 

AW207440 Hs.185973 degenerative spermatocyte (homdog Droso 

M18667 Hs.1867 progastricsln (pepsinogen C) 

AA502764 Hs.1 23469 ESTs. Weakly similar to AF2QBB55 1 BM-01 

AW972512 Hs.20985 sin3- associated polypeptide. 30kD 

AF226053 Hs.66170 HSKM-B protein 

AW445167 Hs.12B036 ESTs 

Y15723 Hs.75295 guanytate cyclase 1, soluble, alpha 3 

AW627932 Hs.302421 Qemh* 

AF15510B HsJS6150 Homo sapiens, Simitarto R1KEN cDNA 2B10 

AW959731 Hs.323099 ESTs 

R27496 rte.1378 annwlnA3 

AI417215 Hs.87159 hypothetical protein FU12577 

N!^022490:Horno sapiens hypothetical prat 
ESTs 

S-BdetiDsylmethlonine decarboxylase 1 
LtV-1 protein, estrogen regulated 
hypothetical protein FU20041 
K3AA0918 protein 
synaptajarun 2 

NM_005 656*: Homo sapiBns transmembrane pr 
AA348947 Hs.91816 hypothoDcaJ protein 
AF097994 L-tynurenln^elpha-amnoadipale arninotra 

AI239923 Hs.63931 ESTs 
AW953937 H&240B45 ESTs 

X57010 Hs.81343 collagen, type II, alpha 1 {primary rate 
A1659306 Hs.73B26 protein tyrosine phosphatase, non-reoept 
BE177320 Hs.15614B hypothetical protein FU13231 
A1376329 Hs.126629 ESTs 

242387 HsJ538B3 bansmembrana, prostate androgen induced 
AW3728B4 Hs.240770 nuclear cap binding protein subunH 2, 2 
U41060 Hs.7913G UV-1 protein, estrogen regulated 
AW578716 Hs.7644 H1 fclstoiwfemBy. member 2 
R76472 Ha.65646 ESTs 
AI497778 r&20509 HBVpX associated pfoteln-8 
BE410107 Hs.179817 OGI-B2 protein, PSDR1 
AB020689 Hs.90419 KIAA0B&2 proton 
NM.012445 Hs.28B126 spond in 2, exTraceJIUtar matrix protein 
H18&35 Hs,31608 hypothetical protein RJ20041 
AB016B9B Hi 100469 mys^/iyrnphotd or mbed-Sneage leukem 
BE566823 Hs^9899 ESTs, Weakfy similar to G02075 Iranscrip 
AF189723 Hs.105778 ATPase. Ca transporting, type 2C, member 
AA514535 rb.2B3704 ESTs 
ALO39402 Hs.1 25783 DEME-6 protein 
A1623511 Hs,1ia567 ESTs 
AA42488t Hs.256301 hypothetical proteh MGC13170 
AA1 1401 6 Hs.75746 aldehyde cehydrog&nase 1 fernlly, membar 
BE514492 Hs.117487 gene near HD on 4p 16.3 with homology to 
MM.001141 Hs.111256 arachidonale 15-fpoxygenase, second typ 
AI6353B6 Hs.142846 hypothetical protein 
NM_004805 Hs.94561 sutfblransferase family, cytosoBc, 2B, 
AW971163 Hs.6019 DnaJ {Hsp40} homofcg, subfamily C, niemba 
NM.D14324 Kg.1 28749 a!pna-methylscyl-CcA racemase 
AKDD2135 Hs.3542 hypclteucat protein FU11273 
FW9031 H&22627 ESTs 

F1303B Hs.27373 Homo sapiens mRtfA; cDNA DKF2p5640U63 (T 
AA1600QO Hs.137396 ESTs, WeaWy similar to JC5238 c ' 
D1192B Hs.76845 phosphosedne phosphatase-! ike 
AL122055 Hs/129836 WAA1028 protein 
452017 AF1O9302 HsJ27495 prostate cancer associated protein 7 
437162 AWQQ55Q5 Hs.5464 QiyroW hormone receptor coacMng pr 
451468 AWS03398 Hs.293863 ESTs, Moderately sinflar to 138022 hypot 
429467 NMJXM477 Hs.203772 FSHD region gene 1 
451752 ABQ32997 K1AA1171 proWn 

430294 AI538226 Hs.32976 gtianine nudso&Ja rinding protan 4 
452259 AA317439 Hs.28707 signal sequence receptor, gamma (translo 
456362 AW973003 Hs.179909 hypotnebral protein FU22995 

752 



425211 

456508 

458809 

410762 

440594 

413328 

447805 

43480B 

45330B 

421470 

418555 

404632 

450B61 

431583 

400303 

4O029O 

410037 

410240 

403046 

419G47 

450203 

450164 

417318 

417153 

412350 

455088 

433B52 

452879 

4303B7 

416276 

441021 

453171 

416795 

427615 

419440 

422048 

453082 

420948 

442592 

421685 

431724 

423242 

434485 

407103 

414085 

422486 

422083 

439735 

420039 

449845 

423445 

433730 

426006 

451982 

4150B2 
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449048 
440300 
427308 
435706 
416B54 
451406 
410075 
410275 
419239 
442501 
436761 



436556 
458440 
446416 
417873 
430598 
421305 
429299 
407894 
414664 
408419 
430945 
432363 
447574 
447595 
453857 
444108 
422390 
417379 
419619 
410102 
418821 



433233 
415276 
440146 
407819 
450402 
432527 
419733 
422134 
432435 
450680 
436420 
430122 
411031 
429259 
407813 
417067 
410227 
417958 
427176 
446825 
447366 
432375 
419713 
427479 
443162 
413950 



419083 
448148 
419465 
432274 
447620 
430589 
418004 
416655 
459284 
421829 
437252 
437957 
417061 
423337 
418B38 
453469 
408063 
422072 
420297 
420522 
408B33 
426647 



Z45D51 

N3976Q 

026067 

W31254 

M01S4 

A1694320 

705387 

U85658 



Hs22920 



Hs.174905 

Hs.7045 

Hs.80296 

Hs.6295 

H8.7991 

Hs.61796 



AA468183 Hs.184598 

AA315267 Hs.23126 

A1817776 Hs.236557 

AA640891 Hs.1 02406 



AI3G4997 
AJ095468 



Hs.7572 
Hs.135254 



AV658299 Hs.1 63959 

BE266&59 Hs.293659 

AK0D1764 HS247112 

BE397354 Hr324830 



A1620463 
AJ278313 



Hs.34740B 
Hs.41143 



AA587775 Hs.66295 

AW9B3697 Hs.44743 

UB06G9 Hs.55999 
AA534489 

AF162SS6 Hs.1B695 

AW379130 Hs.18953 

AL080235 Hs.35861 

R557B4 Hs.140942 
243784 
AA196390 

AA22B899 Hs.101307 

AW248508 Hs.279727 

AA436002 Hs.183161 

BE327427 Hs.79953 

AB040927 Hs.301804 

U88686 Hs.78353 

AW0U231 Hb.90790 
R42I85 

BE218027 Hs.89969 

AW975028 Ks.102754 
AW362955 

AW17S019 Hs.112110 

BH218886 Hs^82070 

AF131784 Hs.25318 

AA443966 Rs.31595 

NM.013342 Hs.233765 

W37943 Hs.34892 

AA420450 H 5^92911 

AL120247 Hs.40109 

AJ001417 Hs.81086 

AB009284 Hs.61152 

AA767382 Hs.1 93417 

AW381569 H&.40334 

A1580252 Hs.293245 

AA340805 Hs.105887 
A1791B55 

AW968050 Hs£238t 

BE410032 Hs.178471 

T49951 Hs.9029 

AA249096 Bs.32793 

AW247529 Hs.6793 

AI479560 H5.S8613 
MM_016578 HsJ2O509 

AW500239 Hs.21187 

AK000382 Hs.274251 
AW290951 

AJ002744 H5J246315 

U37519 H 3.87539 

AW9G8613 Hs.79428 

AF155660 Hs.300496 

AB01B33O Hs.10B7C8 

AI433033 H&.164159 

BE277414 Hs.3947 
A1575944 Hs.166691 
NMJJQ46S5 Ra.127337 
AW385224 Hs.35198 
AB014533 H&33010 
BE0B654S Ha42346 
AB018255 HS.11113B 
A1B28272 Hs.68323 
AW957137 Hs.98541 
AW512232 Hs^54B35 
AA243464 Hs>294101 



similar b S6&401 (caHle) glucose indue 

Homo sapiens. Similar to RIKBJ cDNA 5830 

KIAA0033 protein 

GL004 protein 

Purkinie cell protein 4 

ESTs, WeaWy aanlar to T17248 hypothec" 

ESTs 

transcription factor AP-2 gamma (acSval 

Homo sapiens cDNA: FU23241 fa, clone C 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens clone 1 thrombospondvi rnRNA 
ESTs 

Homo sapiens, Similar lo RJKEN cDNA A430 
hypothetical protein FU109D2 
dlplheria toxin resistance protein reqii 
hypothetical protein MGC13102 
phosphdnositide-specHlc phospholfpase 
rrxil^PDZ^oornain-containtng protein 
KIAA1435 protein 

NK homeobox (Drosophlla), family 3 r A 

gb:nf76g11js1 NCLCGAP_Co3 Homo ssptens 

touste&lfce kinase 1 

phosphodiesterase 9A 

DKFZP5B6E1621 protein 

ESTs 

ankyrln 3. nocte of Ran vter (ankyrin G) 
p^zp99b1D.s1 Stratagene muscle 937209 H 
Homo sapiens HUT1 1 protein mRNA, partial 
ESTs; homdogue of PEM-3 [Ckxia savignyi 
ESTs 
ESTs 

KJAA1494 protein 

SFRS protein Kinase 2 

Homo sapiens cDNA: FU22930 lis. done K 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU14415 lis, clone HE 
mitochondrial ribosomal protein L42 
ESTs 

Homo sapiens done 25194 mRWA sequence 
ESTs 

TCf3 (E2A) fusion partner (in childhood 
KIAA1323 protein 
Plakophilin 
K1AA0JJ72 protein 

solute carrier family 22 (exlraneuronal 

exostoses (miiGple>lkB 2 

ESTs 

ESTs 

ESTs, WeatuV similar to putative pl50 [H 
ESTs, Weakly similar to Homotog of rat Z 
ESTs 

nudix (fuideoskfe dphosphate linked nwf 
KIAA0798 gene product 
DKFZP434G032 protein 
ESTs 

platelet-activating factor aoetyfhydrola 

Homo sapiens cDNA FU 12292 fe, clone MA 

HBV pX assod&ted proten-8 

Homo sapiens cDNAj FU23068 fte, clone L 

hypothetical protein FLJ20375; WAA1797 

ESTs 

UOP-IiBcay-aJpha-r>gaIectMaTTiIne:poryp 
aldehyde dehydrogenase 3 family, member 
BCL2/adenovirus E1B 19kD-toteractlng pro 
mitochondrial solute earner 
caJcfrin^almodurm^epeiKlenl protein kin 
ESTs, WeaWy slmter to AUU1JKUMAN ALU S 
mei transforming oncogene (derived from 
Homo sapiens cDNA FU12033 Us, clone HE 
axfn 2 (conducSn, axlQ 
ectonucteoflde pyrophosphates e/pnoaphodi 
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ESTs 
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neural proliferation, differentiation an 
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ESTs 
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Hs.63386 


ESTs 
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424255 


BE207166 
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AE922189 
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ESTs 
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HS.B16B7 


non-meiuslaliccefe 3, protein express 
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BE396290 


Hs.6097 


synaptogyrai 2 
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ESTs 
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Homo sapiens cDMA FIJI 21 87 fi& t cVwa MA 
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3.1 




NH.001898 Hs.123114 
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3.1 
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ABC41036 


Hs.57771 


KHiiKrein n 


3.1 


434970 


AW272262 


rfe2257b7 


cois 


3.1 




BE391Q46 


HsZrB962 


A Ilk J 1 itfAtoln 

ajm-i protein 


3.1 




AA195037 


Hs.1 69341 
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Hs.125291 


ESTs 
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3.0 
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ESTs 
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ESTs 
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ESTs 
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ESTs 
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ESTs 
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42081B AW969635 Hs^3032 ESTs, WteaWy slrnlar to 1207269A reverse 27 
0 _ 438510 AL080220 Hs.6285 DKFZP586P0123 protein 27 
SO 406S27 T64904 Hs.163780 ESTs 27 
442315 AA173992 Hs.7956 ESTs. Moderately similar to 2N91 .HUMAN 2 27 
421717 AF230924 Hs.107i87 divalent cation tolerant protein CUTA 26 
425910 AA83Q797 Hs.184760 C CAAT-box- binding tra rise rtpticn Eactor 26 
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440695 


AW088353 


He.246240 


ESTs 


26 


416941 




Hs.48778 


til ban protean 


2.6 


422717 


Al 557523 


HS.1 19475 


cold Inducible RNA-blndlng proton 


26 


426110 


inj (WVJ40 


HS.1 56 too 


rspncaBon factor C {accvaior i) 1 (14 


2.6 








fatty add emkte hydrolase 


2.6 


401197 








£.0 




A ACSY7C7C 

AA5U757D 


riS.2oDJDi 


Homo sapiens cDNA: rLJ22o9o Its, cfone n 


2.6 


410297 


AA 140/ 1U 




tumlcan 


2.6 






ris. lDSraUU 


ht>is, weawy similar lo aluijiuman ALU b 




4oJlin 


AlAftT7r>5 IV 

AWo/zdl f 


nS.lZOBZ 


Hcmo haptens mKfvA, cDNA UKr^poDDLa] J [Tt 


2.6 


43/040 


AWU/WJb 


nS.l rdittM 


l*i>0>C 1 


Z6 




AW44C5XJU 


Dr7A4 

HSr9/01 


gnaovth arrest and ONA-darnage-inducibfa, 


2.E 


4.50Z1J 




Hs.71472 


nypotheocai protein FU10774; K1AA170S 


2.6 


408157 


AA047685 


Ks. 62946 


ESTs 


2.6 


420805 


L10333 


nS.99947 


reb'culon 1 


2.6 


434144 


Db2tfU4ro 




hypotnetica] protein MGC3243 


2.6 


4^6761 


AIO15709 


Hs.172089 


Homo sapiana mRKA; cOWA DKFZp586l2D22 (f 


2.6 


428695 


AT355&47 


Hs.189999 


purinergc recepto {(amil/ A group 5) 


2.6 


431725 


X65724 


Hs.2839 


Norrle disease (pseudogltoma) 


Z6 


425174 


087450 


Hs. 154978 


K1AA0261 protein 


Z6 


41v4o7 


A CH 4ft*} EMC 

Ar1u25Ao 


HS.63931 


dadi5himd (Drosophia) homolog 


2.6 


413W!> 


aoi4U5> 


Hs-75368 


carboxypeplkfase E 


2.6 


426293 


BE2 50944 


Ha. 183556 


solute carrier family 1 (nsuire) amino a 


2.6 


428160 


All 29767 


Hs. 182874 


guanine nucleofide bkiduig protein (G pr 


Z6 


439680 


AW/4vf 41 


H&5B461 


ESTs, Weaw similar to A35659 knjeppa- 


2»6 


424385 


AA1 55082 


Hs.1463B8 


mi crcrtu biie-as a ofiale d protein 7 


Z6 


4UDO/U 


W/!ft3o2 


nS.ZrooUl 


nypotheucaT protein MGC13170 


2.B 


413762 


AW411479 


H3.84B 


FK506-blndlng protein 4 (59kDJ 


Z6 


429322 


297630 


Hs.2261 17 


H1 hfetonafarniy, mamberO 


2.6 


449518 


BE395253 


HsJ0861 


ESTs 


Z6 


445919 


T53519 


Hs.334692 


hypothetical protein &AGC14141 


2.6 


429343 


AKOC07B5 


Hs.1994B0 


Homo sapiana, Stmilar to epsm 3, clone 


16 


4oB55Z 




Ks.6314 


Jypfi 1 transmembrane receptor (sslzure-r 


Z6 


431472 


AK001D23 


Hs.256549 


nucleotide binding protein 2 (E^oli Mn 


2.6 


409299 


AA045650 


Hs.53125 


small nuclear ribcnucteoprotein D2 polyp 


2.6 


434672 


AW29402U 


HS.1 17721 


ESTs 


Z6 


443015 


R33261 


Hs.6614 


ESTs, Weakly slmlar to A43932 mucin 2 p 


Z6 


452576 


AB023177 


Hs.29900 


K! AAA 960 protein 


2,6 


412B43 


AF007555 


Ka.74624 


prolan tyrosine phosphatase, receptor 1 


2.6 


430375 


AW371D4S 


1 | AAm ft 

Hs.93758 


H4 hfetone famtly, member H 


2.6 


408196 


AL034548 


Hs.43627 


8RY (sex determining region Y)-box 22 


2.6 


424339 


BE257148 




endogtycan 


2.6 


416292 


AA1 79233 


Hs.42390 


nasopharyngeal rjarctnoma suscepHbUhV « 


2.6 


451040 


AA324743 


Hs .40806 


ESTs 


Z6 


445636 


AW1 05401 




ribosoma! protefn 123 


Z6 


A 1 £H"7K 


AWz7QUo7 




K1AA0779 pratan 


2.6 


42200 D 


M30599 


Hs.1 10637 


homeoboxAlO 


2.6 


441126 


AA570256 




ESTs, Weakty dmBar to T23273 hypotnetj 


26 


446237 


AW27Q516 


Hs.149596 


Homo sapseng. Sinjlsr to RIKEM cDNA 2310 


26 


431797 


BE169641 


Hs.270134 


hypottietical! protein FLJ 20280 


25 


405789 


AI0414O3 




ribosoma! protein L29 


Z5 


423217 


NM_Q0Q094 


Hs.1640 


collagen, type VII, alpha 1 (eptdetmorys 


2.5 


423119 


AA421973 


H&.169119 


ESTs, weawy similar to T25731 nypothetj 


2.5 


420450 


AA262331 


Hs>f8376 


Homo sapiens done HB-2 mRN A sequence 


25 


4oglD0 


AI373544 


Hs.331328 


intormediate filament protein syncoiln 


25 


43U942 


AISS7486 


Hs.119122 


riboeomal protein LI 3a 


25 


452B27 


AI571835 


Hs.55468 


ESTs 


25 


431609 


AW792792 


Hs^64330 


N-acylsphHigoaine amidohydrolase {acM a 


25 


431106 


AA991508 


Hs.105317 


ESTs 


25 


410310 


JQ2931 


Hs.62192 


coagulation factor ill (IhromboplBstan, 


2.5 


4oU3d3 


870114 


KS.2394B9 


TtA1 cytotoxic granule-associated RNA-di 


25 


439864 


AI720078 


H6.291997 


ESTs, Weakly similar to A47582 B^efl gr 


25 


4O0EJZ1 


AA326425 


Hs.1 92375 


ESTs 


25 


421 UV1 


W22B21 




rnxjsomal protan 126 


25 


439414 


NM.001183 Hs.6551 


ATPase, H bansporting, lysosomal (vacuo 


2.5 


400263 






Eos Control 


2.5 


45U28 


AW0&3384 


Hs.11067 


ESTs, KgMy similar to T463S5 faypotiiab* 


25 


449051 


AW9B14C0 


Hs.333526 


MER2 receptor tyrosine kinase (oerb-bz, 


25 


431615 


AW295B69 


Hs.235860 


ESTs 


25 


433037 


NM_01416C 


I Hs^79938 


HSPC067 protein 


25 


409504 


AA304961 


Hs.699 


pcpuayipruiyi ssGmcfase o \cycujpnnin 


(LZ 


409330 


AK001231 


H5.53940 


hypothelteal protein FU10369 


25 


436299 


AK000767 


Hs.5111 


hypothetJca) protein FU20729 


25 


450828 


AW3B2884 


Hs.204715 


ESTs 


25 


426793 


X89887 


Hs.172350 


HIR (hslone cell cycle regulation defec 


25 


431122 


AI267593 


Hs^50535 


Homo sapiens mRNA; cONA 0KF^>434H2412 (f 


2.5 


407236 


W79485 


HpJ 73980 


nuclear matrix proteb NMP20D related to 


25 


421109 


L32832 


Hs.101B42 


AT-brc&ng transcription (actor 1 


25 


408770 


AWZ7O608 


Hs.170195 


bone morphogenefJc protein 7 {osteogerv'c 


25 


416737 


AF154335 


Hs.79691 


UM domain jjrotain 


25 


414869 


AA157291 


Hs.21479 


uotoucteh 1 


25 


431222 


X56777 


Hs.273790 


zona paltodda glycoprotein 3A (sperm re 


25 


403532 






NM_D24639:Homo sapiens hypothetical prot 


25 
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1 1 gt>43 


AI096485 


Hs.169341 


fcbis, Mooeraiei/ similar to obabi}/ aipna 


C.0 




AL135352 


Hs.255883 


tib i b, weaKiy similar to I380ZZ nypoineu 


2.5 


407867 


AA579668 


Hs.41072 


serine (or cysteine) proteinase 1 Inhlbflo 




414555 


AW975C63 


H8.343443 




2 5 


446911 


N27605 


Hs.16492 


iJ^rz.roWi>cU.c£ pfOtefri 




4351 26 


A1393666 


Ha.42315 


pIQ-blnding protein 


o c 

£0 




M24470 


Hs.1435 


guaiosine monophosphate reductase 


Z.3 


426788 


U66615 


Hs.172280 


OWIrarlr rcJatEu > ITIduK 355003(60, 3C0 


£.3 


4.4705Q 


MM.0163U Hs.17200 


ii j ATyMjKe protein containing and and 1 1 a 


o a 
cA 


408461 


AB037756 


Hs.45207 


hrurnntfiahVol rwn)an l/l A A 4 
IiypulllaDCB pfOlBin IMnAlaOD 






R456B9 


Hs.21889 


Homo sapiens cDN A FU12978 fls, ckxia NT 


2.5 


443837 


AI984625 


Hs.9664 


spfndte pole body protein 


2.5 




AA4B1078 


Vte.109045 


nypotneijcai protein rU1049o 


25 


40DV£ 1 


AA922192 


HsJ3962 


ESTs 


2.5 


435750 


ABQ29012 


Hs.4990 


WAA1089 protein 


2.5 


435025 


TO6990 


Hs4742 


anchor attachment protein 1 (Gaa1 p, yeas 


2.5 


407255 


AAD12992 


Hs.256301 


nypouieocai prolan muuji ru 


£0 


425876 


C75094 


Rs.334514 


NG22 protein 


2.5 


449458 


AI80507B 


Hs.208261 


ESTs 


2.5 


428013 


AF151O20 


Hs,1B1444 


hypothetical protein 


25 


424369 


RB7622 


Hs.26714 


KIAA1831 protein 


Z5 


431604 


AFJ75265 


Hs.264190 


vacuolar protein sorting 35 (yeast homo) 


Z5 


452779 


AA4ia?75 


HM7234 


ESTs 


2.5 


4335B5 


T853D1 


Hs. 194397 


gb:yd78d06.s1 Scares fetal liver spleen 


2.5 


436682 


AA354489 


Hs.222103 


EBP5Q-P0Z Interactor of 64 kD 


2.5 


42751S — 


T79526 


Hs.179516 


Integra] type 1 prolan 


2.5 


418700 


AI963808 


Hs.86970 


ESTs, Moderator/ shnlbr to ALU5 _RUMANA 


Z5 


TABLE 67B 











Pkey: Unique Eos probeset Identifier number 
CAT number. Gens cluster number 
Accession: Genbank accession nurnbeis 

Pkay CAT Number Accession 



415989 10194J BC013389 BC017398 AT023543 AA191424 AI2677O0 A1469633 AW95B46S AW953397 AA172056 BE940298 BF9D9208 BF909980 BF095153 

BG2B5B37 A1720344 BF541715 AA3550B6 AA172236 
429220 15103^7 AW341473 AA448195 AW207206 AI951341 AA969259 

412446 t 63467J BC021735 AI669212 AL12D184 A1769949 BE701002 BE1843A3 BE819G31 8G702238BF090fJ49BF9633188F9Bt912BF943013AA934514 

AA151245 BF9BD859 AA987907 Z41449 BF908059 BF90B053 BF908049 BE699424 BF908D6Q BF9B2832 BF952O20 BF963134 BIQ3553B 
BF9Q8052 BF908057 BF090026 BF943158 AI632924 BF512340 BF952Q21 BF960776 BF943437 BF942847 AI76B015 PQ9776 F04816 F02721 
AA1 02645 A1633838 AA61 7929 BFS47C1Q1 BI035448 BH935 876 AW890S37 AW69 8604 BF957405 BF963433 BG704815 

433404 7392_1 BC0229B0BF2240B1 BG149908 AWS72842 BE6706B7A1702161 AW341632 BE222503 N71836 AI026061 AW953116AW083132W979261 

AV725377 A142329B A1S40707 AW675518 A103261 1 AI818044 AI29950B AI911386 AI270418 BE219267 BM141826 AA826491 Z25159 AA557421 
N59447 Z39436T32982R54110 BF1157B3 FU9044 BFB0B433 

449625 24S224J B1918163AW779760 N4S574A1375997AA235OTBQ6»1«A!9136^1 ^ 

BG966917T613B2 R49391 R45432 AI203107 R35004 F07491 R25094 R35350 

453160 G02EL5 BC009612 NM^C03526 BI597616 AV761592 AV760377 AL601008 B1604131 BE64591B BG1877K) BG1B1S26 BG210634 BG192999 AJ263307 

AA3441 66 AW9529S6 AA033609 AA037562 AA7221 83 R79452 H70775 BF67499t BE769437 BG007B56 AAD374S3 AW572535 Ah 43991 
AAQB4581 AA033G10 AV742510 AV73578B R0B336 

420218 191547J AW958037 R42557 A1337047 AA948350 AI838005 AA459950 AI624915 AI638047 A1467856 AI521826 AAB60305 AI932315 AW003O92 AW271756 
AW7793B0 AA609879 AI634791 AW93770 AI5S521 1 Z41145 A1627952 AA303734 BE349457 AW196765 AA256527 BE089727 

437124 594D8J AUJ50013 BG939500 AW269191 AA769925 A1377973 A1625545 AAB11365 AA521 114 N24705 A1379579AA424899 AtB84671 AA829715 AI463010 

N35401 AA677452 AA504340 AI209149 AA863574 Al 379062 AIQ84455 A12B0147 AA644327 BF432508 N27873 M47364 M348B0 Al 1 47024 TB6B60 
A1219716 AA960926 H25544 BIB571 23 AW96D4B9 AA 599099 AA765246 AA328537 BG434703 AA890373 AA424765 AJ292318 AA8298B6 N95742 
A1218758 H255S8 N3S282 AA024987 N36687 BI919187H49471 AA8B9970 AW166152 AA466546 A1252S04 A14527B2 AA554458 AAS070BO 
AA724542 AW102730 AA909978 AW1 18134 AVU827241 W55431 8M1273B1 BF43S987 AB)16509 AW6B3972 BM1276B6 C15552 N63435 N51744 
T9880D M56980 BG10B636 N49381 R49B86 D51278 BI756612 AA508234 R49BB5 8F850422 

432586 8533J BC022881 AU150944BG75t»783AW754175 AVV867737AI911659A!06fl 

438869 52134_t AFD75009 R63109 R6306B 

450377 12109.1 AB033091 AL520743 BE811813 N53332 N99716 A1561910 AA280655 BE710392 AV705100 AW293978 AW444558 AA281459 AI679751 A1873695 

SG700891 B15S3517 R8051B BG779771 BG534451 AA479402 AW9615B0 BF061430 AJ957643 AI768496 AW51211 8 AA479302 AW770384 
AW07247O A1041696 AJ 049 699 AW592865 AA976261 BE879747 BG114119 AW1B3S11 AAS09938 BF571621 BF350794BF381375 BE9256S9 
AL050294 BC010371 BF982270AL042656 BF095732 AW81261B BF095731 BG212397 BF678765 B 1038602 BG388664 AW675337 BG289398 
BE939598 AW805570 AA527097 AW150S40 A1693720 AA743354 AI915793 N4B18S AI573107 AA043474 A1351615A19G9490 A1910763 R50866 
A1699181N73808H08164 

436063 5463_1 AK000028 AA4544B3 A1298674 AA720773 AW61 629 A1B84670 A1936202 AW294235 D61652 BF881 184 AV71 1384 N27154 A1B26970 AV734970 

N40094 N28596 AA8B4747 AA51 2890 BG436 593 
428342 6712_1 AK056315 AI015524 AA724079 BI713619 A1377728 AW293682 AI928140 AJ092404 AI06S630 AA731 340 BM469B29 AW96B804 AA425658 

AA769094 BF446026 AW11871 9 A1332765 AW500883 AW576555 AI859571 AW49S6B4 AW614573 AW629495 AW505314 W74704 AI356361 

A1923640 AWO7D509 A1521500 AL042Q95 AA609309 AA761319 AI381489 R45700 AA761333 AW265424 AA909524 AA635311 AA649040 

A1392620 Z4O70B AJ985564 AW263513 AA913892 AI693486 AW263502 AI806164 AW291 1 37 BI061872 BI059498 AA134476 AW0B48B8 AA036967 

AW370823 T55263 BI0O2756 AA489664 BF827261 W74741 BF963166 
424036 6226_1 NNLD33445 BC001193Aia8576l BF794032AA476620 AAB10906 AAB10905 AI291244 A18B50S7 AJ359708A1335629 H97395 AI344589 AA300377 

AA457566 AW771833 BE465621 AI364068 A1364452 AE48505 A1918342 A1928670 AAB865B0 AL531029 AA8B6344 At186419 8G329096 

BM045465 AL531028 BG437151 BE86B021 AA179427 
450203 19009.1 AK055952 BG182168 BG220105 BG 19^569 BG1B8964 BG18738B 8G 220104 BG1B3714 BE64S99B AI819354 AW974068 A 1 333 635 A1580846 

A1024796 AWQ2Q098 BI491 127 A1393644 N74993 AW472959 BM47B854 B1597437 H121 65 BI458612 BE543192 
451752 10408.5 AB032997 Al 141878 AW978722 6E4671 19 AI761 408 BF7273B5 AW23703S AI934521 BF436248 AI479688 Z40632 AA8320B1 AW295901 

BF057835 BE466977 A1621269 BE465983 BF756369 N74056 AIB17896 AA716567 AA934774 H62600 H09497 BF943762 BE3S5335 B£8B3333 
432363 1234917J AW970240 AA534489 AW970323 
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422890 61426J 



417379 1610005 J 
407619 7392.2 

419733 761^.3 



432B75 1237917J 
. _ 447620 687223.1 
15 409151 -4123J 



20 409960 39576J 



25 433891 647290.1 
414922 1563 J? 



30 414222 18695J 
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40 
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426991 29771.1 



434194 626BOJ 
432908 452541 1 
412652 18658L2 



437179 12239.1 



41816G 18856.1 



43141B 12D918J 
4478B1 44623.1 



407192 220Q202J 
434747 117643.1 
410297 299QJ 



424339 50599.1 



AK057B05 AW1 62343 AM 90479 AI093318 BE04B820 AI1 98397 AA654667 BE219303 Z393S1 F02655 Z28734T16575 F10145Z45266AW572911 
AW964436 AWOO4O30 A1632565 BE502530 BI792363 BF05692B AA449241 A1651 825 AAB05324 AI264B63 AW196918 M943267 A1953735 
AI263703 BF056387 AW594171 A1867447 AA319159 AI903440 AW9561 10 AI366013 AI867923 BG91 1906 D81 H2C15616 AL53B697 Z25032 
Z43784R13382AV746924 AA449369AA318ei5BF364265H17038H10064RM161 T87230 R40698 AW204071 BIB19428AA683393AA683376 
243192T74078T05103 F12527T77951 Ht0118M7037 BF855407 RL9603 
M196390 AA507837M196468 

AK056626 AtB0G896BF939O22BE644718 A1954754 BE218177 BE348567 A1962406AW293122 A1968798 AI457321 BE327226 BG913531 
AW939055 T3Q280 R54166 243366 BGB19153 BF003119 BE646274 BF940B81 R18246 R42185 

AK027321 WG3676 W63789 BE046412 BF114614 BE646183 BM1 26230 AW044233 A1951970 AW66354B AJ139947 AA514302 AAS46232 
BM126251 AA789002AA581966 AA809643 AW188B70BE706664 BE70S539 BE153177 BF084925 AL133779 AW961788 AA659693 AA347970 
AA295134 AA526037 AA4492B2 BG190454 H61476 T91396 N2001B T90114 H75644 BE710736 BF687723 H385B1 AA249370 BF72669B 
BE&41554 BKM5099TB4625 AW129678 BG770826 
AW973834Al791932AJ791855Al732640AA558833AA559897Al82t610 
A1973051 AI4Q0921 A1796154 AW241317 AW2S0951 

NMJ)04892 AF047442 BE27533B BF724863 BI917206 BE276993 AL6O2308 AA3O6105 BM1525G5 8C001 364 AW993471 AW993481 AW993283 
AW99291 9 AW992921 AW9&29S0 AW992361 AW993220 BG573124 AA456385 AA234796 AI902726 AA354813 BI092644 BG77B400 BI260001 
BG007325 A1267455 AA426574 AJ 160782 A14721B6 AA2555D0 AA434006 BG435520 A135611 1 H00S25 AV749060 BG944497 BG292031 
AA902153 C04925 AA90216O AA383100 AW073533 AA256706 AA150809 R65766 AW95B448 BE090972 BFS93195 AV738979 R65855 R80136 
AA484677 

BE644758 AI082238 BF940027 A1201079 AI436035 AW275966 AJ085394 A1291655 AW070441 AI474134 AI26B97B AT768279 AI567682 AA693941 
BF477668 AW664149 AA2837B2 BFE0953B AW296BB8 Al 268977 ATI 68133 BM352065 AI262769 BF941976 AI056920 AA4B1861 BF763697 
AL5658BB BM352383 AA427768 AA3B5343 AI185986 AA931831 AA134972 BF217480 BF111012 AA908246 AA319949 AA318136 AL514271 
BF364291 AL515057AV702312 AA377395AL544217A1341000 AW193583 A1350789AA888338 BF9453BOAW879092 M1 30839 T9t 066 N92326 
AKXW389 AA078832 AL572370 W04622 BE314003 AW9 60808 BM360872 AA31 91 60 AA 130778 AL514257 
AW1B2329 AA613792 T053Q4 AW85B385 

BG107484 AA632009 AI432670 AI656660 W650B84 AI521919 A! 264653 AW150793 AW611894 AI91 7098 BI091245 AI6S1454 BF434B89 A1580286 
AI880735 BE301995 AI392959 AW61 3965 BM023628 AW515374 AI4601 02 BMQ23318 BE328188 AI952820 AI581 363 AA557165 AI695677 
AL562079 AI70G926 AI470561 BF063O56 AW1963B7 AU1329B4 BI06404S A1970157 R02122 H55924 AI521721 AA808206 AA726223 AT766003 
AW339821 AA605951 A1287969 AW664B27 

BC021085 A1527872 A152S296 AL557DB7 B1255090 AU143499 AL56035S BGB23170 BE7369B8 AU1413B8 AL5B0262 BI764173 AU120299 
AU141755 AU141988 AU1 42941 AU141330 AU141784 BI770B85 BIS11394 BE901426 BI918039 BG750842 BE883026 BI254740 &E732690 
BG430761 BE792868 AW328267 ALt35173 AA102674 BF726986 AL564735 BE155962 BE155979 BF741 679 H67776 H59234 H89655 AW117774 
AW274435 129008 NMJW3104 U 07361 BFO02B24 BI222949 AJ458045 AW951537 BFJ96474 AI66S049 AJ042523 BE4Q3928 AU 160125 AU160SS0 
AW069B77 AW015214 A1948718 BE219706 AJ953605 BE217755 AI9913B2 AI245005 BF431179 AI521843 BE04B908 AJ420449 AW275385 
AA33695D BE501 521 BF740566 AA31 1404 AA384639 Br772535 BG47307G BE891298 BE246928 BM01 2986 BE242693 BE901342 8G745358 
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TABLE 67C 



Pkey: Unique number corresponding to an Eos pro beset 

Ret Sequence source. The 7 digit numbers In this column are Gen bank Identifier (Gl) numbers. Dunham I. el si." refers to the publication en tilled The DMA 

sequence of human chromosome 22.' Dunham L et al., Mature (1999) 402-489-495. 
Strand: Indicates DNA strand [mm which axons were predicted 
NLpcsto: Indicates nucleotide positions of predicted exons. 
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TABLE 66A: 995 GENES UP-REGULATED 1N PROSTATE CANCER OOfcPAREDTO NORMAL ADULT TISSUES 



Table 6BA lists about 995 genes up-regulated In prostate cancer compared to normal adult (issues. These were selected from 9009 probesets on the Affyme trix/Eos Hu03 GeneCWp 
array that showed some expression In human xenograft turners and cell lines. The 995 genes ware selected such that the ratio of 'average* prostate cancer to "averager normal adult 
tissues was greater than or equal to 2.5, The "average 11 prostate cancer level was set to the 85th percentile amongst prostate cancers. The "average" normal adult tissue level was set 
to the 85th percentile amongst norvmaflgnant tissues. In order to remove gene-specific background levels of non-specific hybridization, the 10th percentile value amongst norn 
malignant tissues was subtracted from both tha numerator and the denominator before the ratio was e 

Phay. Unique Eos probe sot kJenflfer number 
ExAccn: Exemplar Accession number, Gen bank accession number 
UnipenalD: UrJgene number 
Untflena TfflaUrifeana gene title 
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protein kinase C, mu 

RNA, U2 small nud ear 
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ESTs 
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Homo sapiens pmstan mR WA, complete cos 

pyruvate dehydrogenase phosphatase 

adenosine monophosphate deaminase [isofo 

novel C3HC4 typB Zinc finger (ring finge 

cystathkmine-beta-synlh&se 

Lkynurenlne/alpha-amlnoadipate om'natra 

protein kinase 6, iota 

ESTs 

DKF2P5641052 protein 
ESTs 

Homo sapiens mRNA; cDNA LW2p434E082 (fr 
ESTs 

Homo sapfens cOKA FU13752 fe, done PL 

homeo box B13 

cadharin-lite protein VR20 

Homo sapiens prosJefn mRflA, complete ods 

ESTs 

poly(A>iindina protein, cytoplasmic W 
homeo box A9 

porysiJanylate btndfng proteln-lrrteractln 
ESTs 

programmed cell death 5 

colsgen. type II, alpha 1 [primary cste 

metallothbncni 1E trunctional) 

ESTs, Weakly slmlar b 138022 hypotheti 

K1AA12S5 protein 

keinkrein 4 {prostasa, enamel matrix, p 

UDP^acetjft^riarD^dactMarnina:polyp 

ESTs 

ESTs 

eyes absent (Drosophila} hornotog 2 

annerin A3 

ESTs 

chromaf n assembly factor 1, sub unit A ( 

hypothetical protein MGC3199 

nectJn 3; DKFZP56 690846 protein 

Homo sapiens cDMA FU12540 fs, clone NT 

early growth response 2 (Krox-20 (Drosop 

hypothetical protein FU202B7 

ESTs 

ESTs 

ESTs, Weakly Similar to G02075 Iran scrip 
1SU transcription factor, LlrvWiomeodoma 
ESTs 

hypothetical protein MGC4309 

Homo saplans mRMA; cDNA DKF2p5C4K143 (fr 

ESTs 
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442772 AW50368Q Hs.5957 Homo sapiens dona 24416 mRNA sequence 7.1 

425211 M18687 Hs.1B67 progastricsm {pepa'nogcn C) 7.1 

431958 X63629 Hs.2877 cadherin 3, type 1, P-tadherin (placenta 7.0 

408418 AW963897 Hs.44743 KIAA1435 protein 7.0 

5 436063 AKD00028 ribosomaJ protein S24 7.0 

412350 A1659306 Hs.73826 protein fyrosnie phosphatase, norwecepl 6.9 

414664 AA587775 Hs.66295 nnulti-PDZ-doraain-conlHirang protein 6.9 

428465 AW970976 Rs.293553 ESTs 6.9 

. _ 436476 AA326108 Hs.338» bHLH protein DEC2 6.8 

10 431585 BE242803 Hs.262623 hypothetical protein FU1Q326 6.8 

434808 AF155106 Hs.256150 Homo sapiens. Similar to RJKEN cONA 2810 R7 

432103 T15803 Hs.272458 proton phosphatase 3 [formerly 2B), cat 67 

4560BS BE177320 Hs.156148 hypothetical protein FU 13231 6.6 

. _ 429084 AJ001443 Hs.195614 splicing factor 3b. suburit 3. 130kD 6.6 

15 422486 BE514492 Hs. 11 7487 gene near U Don 4p 16.3 wrth homology fa 6.6 

410227 AB009284 Hs.61152 exostoses (muK^eHike 2 6,6 

453439 A1572438 Hs.32976 guanine nucleotide binding protein 4 6.5 

410037 AB020725 Hs^8009 WAA0918 protein &5 

420297 AI628272 Hs^8323 ESTs. Weakly aim^ar to ALU INHUMAN ALU S 6.4 

20 409648 AW451449 Hs.57749 ESTs 6.4 

448148 NM_016573 Hs.20509 HBVpX associated proteln-B £.4 

41026B AA316181 Hs.61635 six transmembrane epithelial antigen of 6,4 

422134 AW179019 Hs.112110 mitochondrial ribosomal prolan 142 6,3 

„ 415068 Z19448 Hs.131887 ESTs, WeaWy similar k) T24395 hypotherj 6.3 

25 404632 NrwL022490:Homo sapians hypothefcal prot 6.3 

412935 BE267045 Hs.7S064 tobulh-specHic chaperon e c 6.3 

453308 AW959731 Hs.323099 ESTs 6.3 

449048 Z45051 Hs.22920 similar to S6B401 {cattle) glucose indue 6.3 

„ 410762 AF226053 Hsj66170 HSKM-B protein 6.3 

30 436032 AA150797 Hs.109276 latexln protein 6.3 

456608 AA502764 Hs.1 23469 ESTs, Weatty similar to AF208855 1 BM-01 6.3 

413328 Y15723 Hs.75295 guanylafe cyclase 1, soluble, alpha 3 6.2 

425071 NfyL013989 Hs.154424 deiodinase Jodothyrotilne, type 11 6.2 

424036 AA770688 H2Ahletone family, member L 6.2 

35 418246 AA088767 Hs>83B83 transmembrane, prostata androgen Induced 6.2 

450861 AI523B98 Hs.17617 ESTs 6.1 

418821 AA436002 Hs.183161 ESTs 6.1 

458609 AW972512 Hs.209B5 sln^assoclated polypeptfde. 30kD 6.1 

432527 AW975028 Hs.102754 ESTa 6.1 

40 433730 AK002135 H&3542 hypothetical protein FU 11 273 6,1 

449845 AW971183 Hs.6019 DnaJ {Hsp40) riomoJog, subfamly C. membe 6.1 

420943 AB016898 Hs.100460 rrryeiddTymprioVi or mixed -lineage iBt&em 8.1 

425704 U79293 Hs.1 59254 Human done 23948 mRNA sequence 6.0 

458440 AKS546B Hs.1 35254 Homo sapiens clone 1 1hrombosporKfln mRNA 6.0 

45 419647 AA348947 H&91816 hypothetical proteh 5.0 

455497 AA112573 Hs.276695 Homo sapiens prostein mRNA, ccmplatB cds 6.0 

428593 AW20744O Hs.185973 degenerative spermatocyte (tomotog Droso 5.9 

400303 AA242758 Hs.79136 UV-1 protein, estrogen regulated 5.9 

446416 AV656299 Hs.1 63959 ESTs 5.9 

50 407819 R421B5 ESTs 5.9 

4303B7 AW372884 Hs. 24 0770 noctearcapfcinding protein subumt 2, 2 5,9 

410102 AW248508 Hs.279727 ESTs; homologue of PEM-3 [dona savlgnyi S.9 

429165 AW0098B6 HG.11825B prostate cancer associated protein 1 5.9 

_ _ 417B73 BE266659 Hb_293659 Homo sapiens. Simlar to RJKEN CDNAA430 5.3 

55 410240 AL157424 Hs.61289 synaptojanln 2 5.8 

421305 BE397354 Hs.324830 dlptharia toxin resistance protaii requi 5,3 

451406 AI694320 Hs.6295 ESTs, Weakly slmKar to T17248 hypotheO 5.8 

418795 AJ497778 Hs.20509 HBV pX associated proten-a 5.7 

^ 427701 AA41H01 HsJ243BQ6 rrariear autoanEgrciic sperm protein (his 5.7 

60 45146B AW50339B Hs.293663 ESTs, Moderately similar to 138022 riypot 5.7 

433852 AI37632B H&126629 ESTs 5.7 

432882 NMJH3257 Hs.279696 seiurr^ucocorfjcoid regulated klnsse-tx 5.5 

419440 AB0206B9 Hs.90419 KJAA0882 protein 5.6 

. ^ 435706 W31254 Hs.7045 GL0O4 protein 5.6 

65 431583 AL042613 Rs.262476 S^denosylmelhfenlne decarboxylase 1 5.6 

442409 BE208843 Ha.129544 hypottiettetf protein MGC1643A 5,6 

453171 R76472 Hs.65646 ESTs 5,6 

437252 At433833 Hs.164159 ESTs, WeaMy similar to ALU1 JJUMAN ALU S 6.6 
422083 NM_001141 Hs.11 1256 arachldonate 15-Bpoxygenase, second typ 5.6 
70 429838 A1916662 Bs.211577 ktnectln 1 (klnesin receptor) 5.5 
403045 NM_005656*:Homt> sapiens transmembrane pr 5.5 

447805 AW627932 Hs.302421 gemln4 5.5 
443162 T49951 Hs.9029 DKFZP434G032 protein 5.4 
__. 418555 A1417215 Hs.B7159 hypothetical protan FU12577 5.4 
75 437162 AWCD5505 Ks.5464 ftyrad hormone receptor coacthratlng pr 5.4 
419465 AW50Q239 Hs.21187 Motno sapiens cDNA: FU23D8B fis. done L 5.4 
440146 AW014231 Hs.90790 Hcmosaplerts cDcM: FLJ22930fls, ctona K 5.4 
427308 D26067 Hs.174905 KOAA0O33 protein 5.4 
409151 AA308105 SEC22, vesicle trafficking protein (S. c S.4 

80 432B40 AK001403 Hs.27952t hypclhellcal protein FU2053Q 5.4 
423242 AL039402 Hs.1 25783 DEME-6 protein 5.3 
453032 H18835 Hs.31608 hypothetlcat protein FU20041 5.3 
441021 AW578716 Hs.7644 H1hbtonefamiry t nierhhBr2 &3 
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415276 U88666 Hs.7B353 SFRS prcten Wnase 2 5.3 

400290 H1S836 Hs.31608 hypottiefcd protein FU2Q041 5^3 

432435 BE21888S Hs.282070 ESTs 5.3 

_ 417318 AW953937 Hs.240B45 ESTs 5.3 

5 429467 NMJJQ4477 Hs.203772 FSHD region gene 1 5,3 

416276 U4106O Hs.79136 UV-1 protein, estrogen regulated 5u3 

410076 T053B7 Hs.7991 ESTs 5.3 

420039 NWL004605 Hs.94581 sulfolransferase family, cytosollc, 2B, 5.2 

^_ 423445 NM.014324 Hs.12B749 alpha-me1hy!acy1-CoA.racema5e 5.2 

10 436420 AA443966 Hs^1595 ESTs ^2 

414085 AA114016 Hs75746 aldehyde dehydrogenase 1 family, member 5.1 

422048 NM_012445 rfe.288126 spcnrfin 2, extracellular matrix proleb 5.1 

427176 AW3B1569 Hs.40334 ESTs 5.1 

450B32 AWg70G02 Hs.105421 ESTs 5.1 

15 448B07 AJ571940 Hs7549 ESTs 5.1 

420568 F09247 Hs.247735 protocadherln alpha 1D 5.1 

421685 AF189723 Hs. 106778 ATPase. Ca transporting, type 2C, member 5.0 

427615 BE41M07 Hs.179817 Cd-82 protein, PSDR1 5,0 

430598 AK0Q1764 Hs.247112 hypothetical protein RJ 109 02 5.0 

20 452578 AB023177 Hs.2S900 KWA0960 protetn 5.0 

431724 AA514535 Hs.283704 ESTs 5.0 

409757 NM-001898 Hs.123114 cystaftiSN 5.0 

432242 AW022716 Hs,162160 ESTs, Weakly simBar to ALU4J1UMAN ALU S 5.0 

. „ 407103 AA4248B1 Hs.256301 hypothetical protein MGC13170 5.0 

25 421154 AA284333 Hs.287631 Homo sapiens cDNA FU 14269 fis.ctone PL 5.0 

400301 X03636 Hs.1657 estrogen receptor 1 5.0 

426006 R49031 Hs.22627 ESTs 5.0 

452679 Z423B7 Hs.83853 transmembrane, prostate androgen induced 4.9 

^ 446BB0 AIB11807 Rs.108646 HomosapienscDNAFU14934fis,ctonePL 4.9 

30 444108 R55784 Hs.140942 ESTs 4.9 

452259 AA317439 Hs.28707 signal Eequence receptor, gamma (transb 4.9 

419033 A1479560 Hs.98613 Homo sapiens cONA FU12292 tts, done MA 4.9 

416B54 W0164 Hs.80296 Putffnje call protein 4 49 

422390 Z43784 ankyrin 3, node of Ranvier {ankyrln G) 4.9 

35 428330 L22524 Hs.2256 matrix meidloprotdnase 7 (mairitysin, 4.9 

419168 A1336132 Hs.3371B Homo sapIerecDNAFLJ 12641 fts, clone NT 4.9 

452017 AF109302 Ha.27495 prostate cancer associated protein 7 4.9 

456362 AW973003 Hs.179909 hypothetical protein FU22995 4.9 

442501 AA315267 H6.23128 ESTs 4.8 

40 439735 A1635386 Hs,142B46 hypothetical protein 4.8 

451752 AB032937 KIAA1 171 protein 4.8 

423583 AL122055 Hs.129836 WAA1 028 protein 4.B 

446157 BE270828 Hs.131740 Homo sapiens cDNA: FU22562 fe, done H 4.8 

432363 A&534489 gbmf76g11.s1 MCLCGAP_Co3 Homo sapiens 4.8 

45 433313 W2012B Hs.295039 ESTs 4.8 

4051 Oi AW9SB504 Hs.123073 CDC2-refeted protein kinase 7 4,B 

453857 AU080235 Hs.35861 DKFZP5B6E1 621 protetn 48 

451982 F13036 HsJZ7373 Homo sapiens mRNA; cDNA DKFZp5B401763 (f 4.7 

__ 447574 AF162666 HS.188GS tousted-tke kinase 1 4J 

50 429299 AI620463 Hs.347408 hypothetical protein MGC13102 4,7 

420522 AW957137 Hs. 98541 hypothelfeal protein 4.7 

417379 AA196390 gb:zp99b1Q,sl Stratagene muscle 937209 H 4.7 

411031 W37943 Hs.34B92 K1AA1 323 protein 47 

. 431725 X65724 Hs.2839 Norrfa disease {pseudogRama} 47 

55 441224 AU0769G4 Hs.7753 calumenin 4.7 

407B13 AL1 20247 Hs40109 KIAA0872 protein 4.7 

440074 AAB63045 Hs.10669 ESTs, WeeJdy similar So T0005O hypothec* 4.7 

430294 AI53B226 Hs.32976 guanine nucleotide binding protein 4 47 

415082 AA1 60000 Hs.137396 ESTs, WeaMy similar to J C523B galaclosy 47 

60 414922 DO 0723 glycine cleavage system protein H (amino 47 

430945 U8Q569 Hs.55999 NK hamecbox {Drosophlla), family 3. A 47 

43B825 BE327427 HsJ9953 ESTs 47 

425174 087450 Hs.154878 KJAA0261 protein 47 

420360 AA640891 Hs.102406 ESTa 47 

65 410193 AJ132592 Hs.59757 zmcfinger protein 2JB1 47 

440300 N397B0 Hs.8859 Homo sapiens. Similar to RIKEN cDNA 5630 46 

436761 A1B17776 Hs336S57 ESTs 46 

410275 U85658 Hs.6179B transcription factor AP-2 gamma {actrvat 46 

404922 NrVL003071:Homo SBplens SW1/SNF related, 46 

70 436556 A1364997 Hs.7572 ESTs 4.6 

429302 AU076674 H3.198B99 eukaryohc translation inifMon factor 46 

447595 AW379130 Hs.1B953 phosphodiesterase 9A 46 

433006 BE242758 Hs.190223 ESTs, Moderately similar to T292B5 hypot 4G 

__ 407894 AJ278313 Hs.41143 phospholi»osilio^[}6drTcphospholipase 4.5 

75 418818 AA22BB99 Hs.101307 Homo sapiens HUTU protein mRNA. partial 45 

433556 W56321 Hs.1 11460 catelurr^m«!uRn-dependent protein km 45 

443123 AA09453B Hs.272808 putative transcription regulation nudea 45 

419991 AJQOD098 Hs.94210 eyes absent (Drosophila) homolog 1 4.5 

_ . 419594 AA013051 Hs.91417 topoisomsrase (DNA) II binding protein 4.5 

80 453390 AA86249& Hs.28482 ESTs 4.5 

417061 AI675944 Hs.186691 Homo sapiens cDN A FU12033 Ms, clone HE 4.5 

419239 AA46B163 Hs.184598 Homo sapiens cDNA: FU23241 Us. clone C 45 

43GB73 N23874 Hs.50477 RAB27A, member RAS oncogens family 45 
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417958 


AA7673B2 


Hs.1 93417 


ESTs 


4.4 




Arid i «J4 




HcnrtD sapiens clona 25194 mRNA sequence 


4*4 




AVYOUQU/o 


u. mf ace 
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4*4 
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4,4 
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AI923627 
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ESTs 
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426023 
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Hs.1 72028 


a <f sintagrin and mataJBoprotetnase doma 


4.3 


462747 


BE153855 


Hs.61460 


Igsupemsnly receptor LMIR 


4.3 


429269 


AA4HU4bu 




RakophUh 


4.3 


452092 


BE245374 


Hs .27842 


hypotneSca) proteki FU11210 


4.3 


437367 


BE277414 


Hs.5947 


meJ transfarrm'ng oncogene {darned from 


4.3 


469284 


AF1 55660 


Hs.300496 


mitochondrial solute carrier 


43 


417067 


AJ 001417 


Hs.81086 


solute canter family 22 (extraneuronaJ 


4.3 


423453 


AW450737 


Hs.12B791 


CGMH prolesi 


4.3 


407137 


T97307 




gb.-ye53h05.s1 Soares tetal Ever spleen 


4.3 


448826 


A15B9252 


Hs.29324o 


ESTs, Weakly similar to putative p150 [H 


4,3 


419713 


AW9G805B 


Ha.92381 


nudix InucteosJda diphosphate linked moi 


4.3 


427359 


AWU2G762 


HS.79B81 


Homo sapiens cDNA: FU23005 ffe, dons L 


4.3 


450402 


Bb2iBQ27 


Hs.69969 


ESTs 


4.3 


428055 


AA420564 


MS. 10T760 


ESTs 


4.3 


429441 


AJ224172 


Ks.204096 


TlpophSln B {uteroglobin family member) 


4.3 


432676 


AI791655 




ESTs 


4.2 


439963 


AW247529 


Rs.5793 


platelet-activating factor ecefythydrota 


4.2 


445707 


AJ248720 


Hb.114390 


ESTs 


42 


419297 


A A A JR74 A 

AA148710 




kimtean 


42. 


416737 


AF154335 


Hs.79691 


L1M domain protarn 


4.2 


430255 


AKO 00703 


Hs .323822 


Homo sapiens mRNA for KtAAl551 pratetn, 


4.2 


421829 


AB018330 


H5.10Q708 


caldijrn/calmodunn-dependeiit protein kin 


4>2 


447620 


AW290951 




ESTs 


4.2 


426647 


A A Ai« If* A 

AA243464 


Hs. 294101 


pre-E-cell laukem'a (ranscriptkn factor 


42 


432885 


BE159028 


Hs .279704 


chromatid accessibility comptox 1 


42 


446237 


AW270515 


Ks.1 49596 


Homo sapiens, SknBar lo RIKEN cONA 2310 


4.2 


421878 


AA299652 


HS.1 11496 


Homo sapiens cDNA FU11643 fis, dona HE 


4.2 


-423337 


till ftrtjccr" 

NMJQ04655 


Hs. 1273 37 


axai 2 (ccflductin, axil) 


4.2 


435854 


AJ27B120 


Hs-4996 


putative ankyitn-repeal conlaring prole 


42 


427723 


A) 355 260 


Hs.279789 


filstone deace^rlase 3 


42 


430122 


N&L013342 


Hs.233765 


TCF3 (E2A) fusion partner (m chikJhnwf 


42 


416140 


A1918035 


Hs.301198 


roundabout (axon guidance receptor, Dros 


4.2 


428231 


HI 7969 


Hs. 183105 


nuclear autoantfnen 


4.2 


447966 


AA340605 


Hs.105887 


ESTs, Weakly similar b Homolog of rat Z 


41 


432274 


AK000382 


Hs.274251 


rrypothebcal protam FU20375; KJAA1797 


4.1 


432621 


A) 298501 


Hs.21192 


ESTs, VVeaWy simHar to T46428 hypotheH 


41 


409063 


BE0BQ548 


rte,42346 


cafclnsu in -binding protein catearcuvl 


41 


407192 


AASO9200 




gfa^fl2s02^1 SoaresJeeSsjyHT Homo sap 


4.1 


426716 


ft II MMTA 

NMJXJ6379 


Hs. 17 1921 


sema tomato, fmrnunoglobuRn doma^i (ig), 


4.1 


427479 


BE410092 


Hs.176471 


WAA07B8gene product 


4.1 


434194 


AF1i9B47 




Homo sapiens PR0 1550 mRNA, partial cds 


4.1 


422072 


a r\A4orvrr > 

ABO 18255 


Hs.111138 


K1AA0712 gene product 


41 


411145 


BE439553 


HS.25U52B 


Homo sapiens, done IMAGE:4098694. mRNA, 


41 


449459 


BE546846 


Hs.1 95048 


ESTs 


4.1 


421662 


NM.014141 


Hs.106652 


ceQ recognition molecule Gaspr2 


4.1 


445309 


AU57474 


Hb.12504 


likely orthobg of mouse Arkadia 


41 


412520 


AA442324 


Hs.795 


HSAhlstonelamEy, member O 


4.0 


448663 


BE614599 




hypothetical protein MGC14797 


4.0 


425689 


W16480 


hte.24283 


ESTs, Moderately similar b reduced expr 


40 


433647 


AA603367 


HSw222294 


EflTs 


4.0 


426030 


BE243933 


Hs.106642 


zinc finger protein 22 {KOX 15) 


40 


416838 


AW385224 


Hs.35196 


ectonideotkie pyTophospnatase^osphodf 


40 


439B20 


AL3o02D4 


Hs. 233853 


Htwno sapiens mRNA lull length insert cON 


4.0 


422634 


NMJJ16D10 


Hs.118B21 


CGV62pro!ein 


4.0 


453469 


AB014533 


H&33O10 


KWA9633 protein 


4.0 


418827 


BE327311 


Hs.47166 


HTO21 


4.0 


446791 


A1632278 


Hs.195922 


ESTs 


40 


443884 






ianfln HfflwiiW 

wpvn reccpiur 


a n 


416857 


AA188775 


H3^92453 


ESTs 


40 


401519 


AA428202 




ClsrjG0476*:gIJ12737Z79|reflKP_012l63.1l 


4.0 


407846 


Hs.40403 


Cfcp/p300-1ntaracting transazlivator, wit 


4.0 


431689 


AA3Q5688 


Hs.267695 


UDP-GdibetaGIcNAc beta 1.3-galaclosyl(r 


3£ 


406784 


AW971350 


Hs.63386 


ESTs 


3L9 


435655 


AW1056G3 


Ks.6947 


HSPOQ69 protein 


3.9 


432615 


AA557191 


HS.5S028 


ESTs, Weakly staiHar to 154374 gene MF2 


3.9 


448804 


AW512213 


Hs-342849 


ADP-rlbosylaflon factor-like 5 


3.9 


430389 


AL117429 


Hs_24flB45 


DKFZP434D146 protefn 


3.9 


432278 


AL137508 


Hs.274256 


hypothetical protein FLI23563 


3,9 


414869 


AA157291 


Hs.21479 


ubinuddn 1 


3.9 


429163 


AA884766 




glram2Da10£l SoaresJ^FUJjGBC.Sl Homos 


a9 
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444190 


A1B7B918 


Hs. 10526 


cysteine and glydne-rfcft protein 2 


3.9 


44o»y 1 


VII JE jyVMrA 

NNLw225u 


Hs. 10082 


potassium I ntsrmediata/smafi conductance 


3.9 


433339 


AFU192Z6 


H5.B035 


glioblastoma ovorex pressed 


3.9 


435021 


AA922192 


Hs.73982 


ESTs 


3.9 


412834 


R77123 


Hs.79881 


Homo saplsns cDMA: FLJ230QB fc, done L 


3.9 






Hs.Zflaool 


rtomo sapiens cUNA: rLJZZBsb tis, clone H 


3J& 


4Uoti33 


AW512232 


Hs.254835 


ESTs 


3.9 




AABIoao/ 


Ks.looUo 


Homo fiepfens cDNA FUi2187 fis r ctons MA 


3.9 


4561 77 


NML012391 


Rs.79414 


prostate epittie rum- sped fie Ets transcr 


3.6 


433a4b 


AICD4£iC 

AIG61545 


H3.1 52982 


hypothetical protein FU13117 


3.8 




AIAI4C4 jTrn 

AWlbi45D 


rfS,iU32Ui 


CGI-8G protein 


3.8 


434614 


A1249502 


1 1_ one 

HS.2SG69 


ESTs 


3.8 


425910 


AA630797 


HS.1B4760 


C CAAT-bo*-bfti ding transcifpiion factof 


3.8 


432370 


AA30B334 


Hs.274424 


N-ace^lneuraminic add phosphate syntha 


3.8 


413010 


AA393273 


Hs.75133 


transcriph'on factor 6-tlke 1 (mltochond 


3.B 




AAr/o4oo 




mStocnondrfa ri bosom a protein 19 


3.6 


439 D24 


R96096 


H835596 


ESTs 


3.8 


426891 


AK001536 




Homo sapiens cDNA FU10B74 fis, done NT 


3.8 


427871 


AW992405 


Hs.59622 


Homo sapiens, clone IMAGE: 3507281, mRNA, 


3.8 


439671 


AW162B40 


Hs.6641 


kJnesJn femHy member 5G 


3.8 


418113 


AI272141 


H3.83484 


SHY {sex datBrninng region Y)-fcox 4 


3.8 


422424 


Al 186431 


Ks296638 


prostate tfffe re n Nation factor 


3.8 


447439 


AA313565 


Hs.145020 


ESTs, WeaWy slrnBar to KIAA1 205 protein 


3.8 


414178 


AW957372 


Hs.46791 


ESTs, Weekly similar to I3B022 hypothefl 


3.8 


427099 


AB032953 


Hs.173560 


odd OzAerMH homotog 2 (Drosoriiila, mans 




4O790B 


BE379758 


Ha. 11 0853 


unchsfactarized hem&tDpoie!fc stem/proge 


a8 


440 585 


AWDB8363 


Hs-246240 


ESTs 


3LB 


429586 


AI071613 


Hs^B538 


Homo sapiens cDNA: FU210S5 fe, done C 


18 


447533 


NM_004786 


Ha .16792 


tirtcfodoxm-Jike, 32kD 


3.6 


411190 


AA306342 


Hs.69171 


protein kinase (Mike 2 


3.6 


426126 


AL1 18747 


Hs.26691 


ESTs 


3.8 


434263 


N34395 


Rs44648 


ESTs 


3.8 


41440-7 


AA147026 


Hs.76704 


ESTs 


3.7 


432426 


AW973152 


Hs.31050 


ESTs 


3L7 


425770 


NMj014363 


Hb.159492 


spasfc ataxia of Ch artevoix-Seg u e nay {s 


3.7 


432579 


ARM3244 


HbJ278439 


nucfeolar protein 3 {apo ptosis repressor 


3.7 


444101 


R19175 


Hs. 1697 93 


ribosomal proteki L32 


3.7 


432908 


AJ861896 




ESTs 


3.7 


432600 


BE391045 


Hs.278962 


AIM-1 protein 


a7 


425966 


BEDB1342 


Hs, 283037 


HSPC039 protetn 


3.7 


419972 


Ai rtA-4 a 

AL041465 


Hs.182982 


golgliv-67 


17 


410001 


AB041036 


Hs.57771 


kalMceln 11 


3.7 


432205 


AIB065B3 


Hb.1 25291 


ESTs 


3.7 


401197 






EMSP00000229263*^SPC213. 


3.7 


420061 


AW024937 


Hs.29410 


ESTs 


3l7 


431103 


M57399 


Hs.44 


pfekrtrophTn {heparin hlnding growth toe 


3.7 


433577 


AW0070BO 


Hs.284132 


ESTs 


3.7 


435017 


AA336522 


Ks.12854 


angiotensin II, type 1 receptor-assoclat 


3.7 


420 B0 5 


LI 0333 


Hs.99947 


tBfculonl 


17 


401666 






Target Exon 


17 


457183 


i IA4 nan 

H918B2 


Hs.1 18569 


Drt-Nndfog protein 1DAX (InlilbrikJii o( 


17 


4279 82 


Nf4JD16156 


Hs.1 81326 


WAA1073 protein 


17 


405367 






M^_022170*:Homo sapiens V/illams-Beuren 


17 


449933 


14114 * 'jflnn 

AW157098 


Hs.324104 


Human DMA sequence from done RP1-&3M2 o 


17 


432388 


X15218 


Hs-2969 


v-ski avisn sacoma vral oncogene homo! 


17 


438157 


AW137011 


Hs .49676 


ESTs 


17 


451945 


BE504055 


Hs. 21 1420 


ESTs 


17 


452721 


AJ2E9529 


Hs.301871 


solute carrier famly 37 (rjrycerol-3pho 


16 


454229 


AW957744 


Hs.276469 


lacrimal proline rictv protein 


3.6 


418974 


Ami IWOQ 


HS.B0667 


RALBP1 assooatsd bps domain containing 


16 


421637 


BE3B34SB 


Hs.1 05547 


neural profiferation, diflarenfiatioa an 


16 


426738 


NNL.0Q03BO 


Hs. 192803 


xeroderma pigmentosum, cornplemernscon 9 


16 


440484 


BE616768 


Hs.7232 


acetyl-Coenzyme Acarboxyiase alpha 


16 


40774B 


ALD7yw9 


H&3B176 


WAA0506 protein; SCN Chcarfan OscfM 


3.6 


453049 


BE537217 


Ks.30343 


ESTs 


16 


422)89 


AA523172 


Hs.103135 


ESTs, VUeaWy slmfer to SFR^HUMAN SPUG 


16 


408096 


8E250162 


Hs.63765 


dhydrofblate reductase 


16 


407204 


JU1933 


Hs.140237 


ESTs. Weakly ssnilarto AUU1_HUMAN ALU S 


16 


450649 


NMJ001429 


Hs^5272 


E1 A binding protein p300 


9.6 


436489 


AJ 272269 


H5.1 21429 


zkic-blndJng protein Rbcc728 


16 


A Aim a 
44/tfjO 


Vr/9940 


Hs.21906 


Homo sapiens done 24670 mRNA seqMence 


16 


424265 


BE207168 


Ha.144630 


nuclaar receptor subfamily % group F, m 


16 


452260 


AA453208 


Hs.330994 


RAB9, member RA8 oncogene famly 


16 


420344 


BE463721 


Hs.97101 


putathre G protoimcoupted receptor 


16 


427715 


BE245274 


H5.1B0428 


WAA1181 prdem 


3.6 


426793 


XB98B7 


Hs.172350 


HSR (hfetcne cell cycle regulation deiec 


16 


419B79 


Z17605 


Hs.93564 


Homer, neuronal Immediate eariygene^, 2 


16 


403752 






NMJX)2753^Homo sapiens mitogervactivate 


16 


416559 


AA225048 


Hs.104207 


ESTs 


16 


436523 


H66220 


Rs.276177 


ESTs 


16 


438705 


AI049624 


Hs.283390 


ESTs, WeaWy simHar to 2109260A B cell 


3.6 


443960 


AI093577 


Hs.255416 


hypothetbal protein RJ21966 


15 


412513 


AA3225S9 


Hs.5163 


ESTs, Wea%sirniJarto AF151840 1 CGt-8 


15 
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428695 A1355647 Hs.189999 punrarux receptor {family A group 5) 3.5 

419517 AF052107 Hs.90797 Homo sapiens clone 23620 mRNA sequence 3.5 

446494 AA463275 Hs.288905 WW Domflto-Conlaining Gere 3.5 

417515 124203 Hs.B2237 ataxfe-teJangtedasla group D-assoclated 3.S 

5 433891 AA613792 gb:no97h03.s1 NCLCGAP_Pr2 Homo sapiens 3.5 

4273t5 AA17S949 Rs.175563 Homo sapiens mRNA; cONA DKFZp5G4N0763 fF 3.5 

445525 BE149866 Hs.1483t Homo sapiens, Slmiar to ztoc&nger^ 3.5 

434672 AW294O20 Hs.1 17721 ESTs 3.5 

453288 AW583292 Hs.274412 similar to yeast Upf3, variant A as 

10 434970 AW272252 Hs.225767 ESTs 3.5 

417193 A192Z189 Hs.2B8390 hypothetical protein FU22795 3.5 

437617 AB2B7Q1 Hs£716 KJAA0310 gens product 3u5 

445943 AW6SB533 Hs.181574 ESTs 3.5 

412828 AI972402 Hs^06O51 hypotbefcal protan MGC2648 3.5 

15 452167 M75238 H5,13075 Homo sapiens cDNA: RJ2301 3 fis, clone L 3.5 

410467 AF102546 Rs.63931 dachshund (DrosophPa) homolog 3,5 

440193 AW902312 Hs.7037 Homo sapiens done 24923 mRNA sequence 3.5 

451B15 AW974911 Hs,184793 HomoaapiatecDNA: FU21880fi5.cton8H 3.5 

^„ 429170 KNL001394 Hs.2359 dual specffidty phosphatase 4 3.5 

20 4100B2 AADB1594 Hs.158311 MusashUDrosophlla) homolog 1 3.5 

436087 BE3D0296 Hs.5054 CGM33 protein 3.5 

437698 W81260 Hs.43410 ESTs 3,5 

451131 A1267586 Hjl266012 fatfy-ackJ-Coortzyme A Dgase, tons-chain 3-5 

439609 AW971945 Hb.293236 ESTs 3.5 

25 443555 M71710 Ha.21398 ESTs, Moderately similar to A Chain A, H 3.5 

418510 AW245993 Hs.223394 hypothetical proteTn MGC2742 3.5 

409960 BE261944 hsxddnasel 3.5 

425263 MM-001197 Hs.t55419 BCmnteracHng WHer (apopWs-induc 3.5 

40421O NM^Ot^36:HornosapfertsrnyeloIa7lymphold 3.5 

30 442323 AW016669 Hs.29190 ESTs 3.5 

452707 AKB3823 Hs.45070 ESTs 3,5 

421437 AW821252 Hs.1 04336 hypothetical proteVi 3.5 

426514 BE616633 Hd.170195 boas morpfioffenefc protein 7 (osteogenic 3,4 

450828 AW382884 H»,204715 ESTs 3.4 

35 446625 BE266822 Hs.344097 filarain A, alpha (acfin-clnding protein- 3,4 

421B67 AM81076 Hs.1 09045 hypotheBcet protein FU10498 3.4 

434954 AF161455 Hs.284295 Homo sapiens HSPC337 mRNA, partial cos 3.4 

4529D8 AB001451 Hs.309S5 neuronal She adaptor homolog 3.4 

429922 297630 Hs.226117 HI Nstonefamly, mamberO 3,4 

40 448734 BE614070 Hs326416 Homo sapiens mRNA; cDNA DKFZp554H19l6 (F 3.4 

434293 NM.004445 HaJ3795 EptiB6 3,4 

414222 AL135173 scrotal dehydrogenase 3.4 

421674 T10707 Hs.296355 hypothetical protein FU2313B 3.4 

Atr 438270 AA805166 Kg.154762 HlV-1 rav binding protein 2 3.4 

45 411078 AJ22202Q Hs.1B2364 CocoaCrisp 3.4 

419749 X73608 Hs.93029 snarctosteonectln, ewev and kszaVIke d 3-4 

432302 AA345B57 Hs.274307 K1AA1442 proton 3.4 

407944 R34Q08 Hs.239727 dssrnocoirin 2 3.4 

431674 AA09B901 Rs.301642 (^protein coupted receptor 3.4 

50 408291 AB023191 Hs.44131 WAA0974 protein 3.4 

'431578 AB037759 Hs. 261587 GCN2 elF2atpha tonase a4 

440789 AB007B57 Hs.7416 KIAA0397 gene product 3.4 

434058 T99949 Hs^03428 Homo sapiens cDNA FU14B32 fis, clone OV 3.4 

452296 BE37993S H&28666 propjarnmed cell death 10 3,4 

55 417229 AA97SOS6 Hs.1 9 522 hypothetical protein PRG2849 3-4 

439593 BE073597 Hs.1 24863 ESTs 3-4 

407136 T64B96 Hs.287420 Homo sapiens cDNA FU11533 fis, clone HE 3A 

421044 AF061871 Hs.101302 Human DM sequence from clone RP1-23BD1 5 3.4 

427461 AA531527 Hs. 33 2040 hypothefcd proteb MGC13Q10 3J3 

60 417315 AI080042 Hs.180450 ribosomaJ protein 824 3.3 

423392 AA195037 Hs. 169341 HTPAP protein 3.3 

412652 AW01777 ESTs 3.3 

415172 AF079529 rte.78106 phosphodiesterase 88 3,3 

^_ 418372 AA311833 Hs.84318 replication profeto A1 (7C*D} 3-3 

65 • 447950 AW796524 Hs.63644 Homo sapiens mTcrosorna) signal pepfldase 3.3 

425010 T16B37 Hs.4241 ESTs 3.3 

420460 AA262331 H&v48376 Homo sapiens clone HB-2 mRNA sequence 3.3 

443181 AI039201 Ha.283316 ESTs &3 

442064 AI422887 Hs.88594 ESTs a3 

70 431GG3 NHL018569 Hs^67182 TBX3hso protein 3.3 

412926 A1B79076 H 3.75051 macrophage myrfctoylsted etanrne-rich C - 3.3 

42QB18 AW969635 Ks. 33032 ESTs, Weatfy similar to 120728QA reverse 3.3 

408829 NMJ06042 tts,483B4 heparan sulfate (glucosamine) 3-Osulfot 3-3 

408681 AW953S53 HsJ292833 ESTs, Weakly sJmlfar Id I3B022 hypoOiefi 3.3 

75 436278 BE396290 Hs.5097 synaptogyrin 2 3.3 

424560 AA158727 Hs. 150555 proteii predicted by clone 23733 3.3 

417215 BE253181 Hs.81687 non-metastalk; ce5s 3> protoln Bxpresse a3 

418166 AI764416 Cdc42 effector protein 3 3-3 

_^ 434629 AA7890B1 Hs^02S glloma-amplifed sequence-41 3,2 

80 4273S8 AW3S002O Hs.20415 cliromosorne 21 open reading trame 11 3.2 

457489 A1693B15 Hs.127179 crypffegena 32 

434747 AAB37085 ESTs $2 

413125 BE244588 Hs.75207 glyoxalasel 3.2 
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435561 AA351978 Ha.4943 hepatowIularcatinomaassoctaledpTOl 12 

447210 AF035269 Hs.17752 phoBphaUdyteerine-speclflcphosphollpaB 3.2 

422522 ATQ23428 Hs.34549 ESTs, HgMy similar to S94541 1 done 4 3.2 

423396 A1382555 Rs.1 27950 bromodwnaiw:ontrfn1iig 1 3.2 

5 40t785 NH,002275^Homo sapiens ksratin 15 (KRT1 12 

441345 AWD68579 Hs.7780 Romo sapiens mRNA; cDNA DKFZp564A072 (fr 3.2 

451404 AA46077S Hs.6235 ESTs, Weak!/ similar to T17248 hypothefl 3.2 

422173 BE3B582B Hs.250619 phorboMke protein MDS01 9 12 

452099 BE612992 Hs.27931 hypothetical protein FU10607 similar to 3.2 

10 452826 BE245256 Ha.301E36 peroxisomal biogenesis factor 6 3.2 

41 9829 AJ924226 Ms.115185 ESTs, Moderately similar to PC4259 ferrl 3.2 

407688 W25317 Hs.37616 Human 09 spflce variant B mRNA, complete 12 

406214 NNL01637l:Homosapiens hyrirojcysieroid (1 12 

416292 AA179233 Hs.423S0 nasopharyngeal carcinoma sosceptiblffty 3.2 

1 5 438714 AA814859 Hs. 160074 ESTs 3.2 

433213 AW665130 HsJ3719D ESTs 3.2 

412170 D16532 Hs.73729 ray low density lipoprotein receptor 3.2 

409340 BE174629 Hs.321130 hypo1heflcalpiotelnMGC2771 3.2 

413142 MB1740 Hs.75212 ornithine Carboxylase 1 3.2 

20 451338 AWB12322 Hs.19131 transcription fectorDp-2{E2FdimBrizrf 3.2 

437179 AA39350B serological defined coton cancer antlg 3.2 

41B700 AfeESffiB Hs.86970 ESTs. Moderately similar to ALU5_HUMAN A 3.2 

438381 AA80S666 Hs. 146217 Bamo sapiens cDMA: FU23077 fis, dona L 3L2 

450546 AA010200 Hs.175551 ESTs 3.2 

25 409995 AW960597 Hs.129206 ESTs 3.2 

408739 W01555 Ks.44665 ESTs, Moderately sVrflar to I3BQ22 hypot 32 

438021 AAB26425 Hs.192375 ESTs 12 

403532 NMJK4638 tooo sapiens hypomefcal prot 3.2 

_ 452941 AL110347 Hs.31074 W^utfcgUjcosarnlne sulfohyrfroiase (sulfa 3.1 

30 414341 D80004 Hs.75S09 K1AA01B2 protein 3.1 

423044 AA320B29 Hs.97266 protacedharin 18 11 

429716 R25685 H 3^11933 calaaen, type XVII, alpha 1 It 

444357 H54892 Hs.10974 hypotfiatJcal proteto FU22390 3.1 

423551 AA3275S8 Hs.89633 ESTs 11 

35 453186 AK001708 Ks.3227t hypolhefeal protein FLJ10B45 11 

413550 WD3011 14*306881 MSTP043 protein 11 

431933 AI187057 Hs/t 32554 ESTs 11 

43977B AL109729 Hs.99384 putative transmembrane protein 11 

448044 AM586B2 gbtt13e01.x1 NCLCGAPJJD24 Homo sapiens 3.1 

40 422017 NMJM3877 Hs.110776 STAT Induced STAT mhibHor-2 11 

421254 AK001724 Hs.102950 coat protein gamma-cop 11 

424339 BE257148 endoglyean 11 

445109 AF039916 Hs. 12330 ectoiucIeosWe triphosphate diphospnohyd 11 

Aer 425320 U29344 Hs.83190 fatty acid synthase 11 

45 426170 BE1S1065 Hs.167531 methyl crD^oyi-Coe iizy me A carboxylase 2 3.1 

408770 AW270508 Hs, 170195 bone morphogeneBc protein 7 (osteogenic 11 

442202 BE272BS2 Hs.108534 hypothefcal proislti BJ22625 11 

417622 AW298163 H&82318 WAS protein family, member 3 11 

„ 439864 AI720078 Ks 291997 ESTs. Weakly similar to A47582 B-ceU gr 11 

50 43303B AA574091 Hs .105954 ESTs 11 

447050 NMJM6314 Hs.17200 - STAM-lfke protein containing SH3 and ITA 11 

436299 AK000767 H&5111 hypothetical protein FU20729 11 

408380 ATO3050 Hs.44532 dnibtquflki 3.1 

407910 AA650274 Hs.41296 flbronecfin leucine rich mansmetnbrane p 11 

55 445098 AL050272 Hs.12305 DKfZP5fi68183 protefe 11 

431797 BE1CS641 Hil270134 hypothetical protein RJ2O20O 11 

437296 AA350994 Hs. 20281 K1M1700 3.1 

452827 AI122B43 Hs.184319 ESTs, Weakly similar to K1AA1036 protein 11 

44B913 AA194422 Hsi2564 myosin VI 11 

60 415079 R43179 Hs.22895 hypothetical protein FU23548 10 

408157 AA047685 rfc.62345 ESTs 10 

443337 Y076O4 Hs.9235 norvmetastaffc eels 4. protein expresse 3.0 

410310 J02931 H&J62102 coagulation factor ill (thromboplastin, 10 

437546 AW074836 Hs.173984 T-boxt 10 

65 451134 AA318315 Hs.25999 hypothetical protein FU22195 3.0 

423932 T95633 Hs.189703 ESTs 10 

433496 AW373764 Hs.71 a^ha-2^ycoproteln 1,zlnc 10 

453293 AA382267 Hs.10653 ESTs 10 

„^ 428171 AA469323 Hs,182825 libofiomal profeki L35 3.0 

70 428293 BE250944 Hs.183558 solute canter tam^y 1 (neutral amino a 10 

428857 NMJI15B65 Hs,171731 solute earner family 14 (urea transport 10 

406S27 T64904 Hb.163780 ESTs 10 

413259 AA215404 ESTs 10 

„ 426110 NMJ002913 Hs.168563 replcatlon fiacfer C (acfivator 1) 1 (U 10 

75 411089 AA458454 ceU dvismn cycle 2-llke 1 (PITSLRH pr 10 

431416 AA532718 ESTs 10 

431631 AA546908 Hs.122244 ESTs 3.0 

409251 R10723 HsJ20573 ESTs 10 

434974 AA778711 eulcaryotetranslaaon inrtjabon factor 10 

80 447147 AA910353 Hs.75432 ESTs, WeaWy similar to T234B2 hypotheB 10 

450937 R49131 Hs,26257 ATP-dependant Interferon response protei 10 

432728 NnA_008979 Ha_27a721 HLA class II region expressed gene KE4 3.0 

433037 NM_0141S8 Hs^79938 KSPC0B7 protein 10 
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H3 hfetone family, member A 3.0 

hypothetical protel n FU2D730 3.0 

growth arrest end DfiA-damage^ndiiclWe, 3.0 

gb:EST38330i MAGE resequences. MAGL Homo 3.0 

ESTs, Weatdy sVnflar to ALUI^HUMAW ALU 8 3,0 

eryMiyrfrgcgrlwii receptor nudear transJ 3.0 

Homo sapiens cONA; RJ23435 lis, cbne H 3.0 

ESTs 3,0 

TIA1 cyiohafcgrariule-asTOciaferJ RKA-bi 3.0 

vesicle Eraffcting protein 3.0 

hypolheffca] prolan MGC13170 3.0 

ESTs 3w0 

ESTs. Weakly similar to A35559 krueppet- 3.0 

Homo sarians cONA FU2Z785 fia, cbne K 3.0 

mtcratubute-sssocJated protein 7 2,9 

breast oarofnoma amplified sequence 1 ZB 

cJaudin 8 2.9 

gb:yc1 9d03 s\ Stratagerie lung (937210) H 2.9 

GCN1 (general control of amJno^cW synt 2.9 

ESTs 2.9 

raAA021Sgeneprodud 2.9 

H2A history famRy, member L 2.9 

CGI-63 protein ZO 

DKFZP5B6P0123 pitrfein 29 

collagen, type VII, alpha 1 (eptoermolys Z9 

KIAA0129 gene product 2.9 

ESTs, Weakly similar to ALU1 J4UMAN ALU S 2.9 

ESTs, Weakly similar to A43932 mucin 2 p Z9 

hypothec protab FU10B28 Z9 

HMK-1 suHMransferase Z9 

transmembrane 7 superfamlly member 2 29 

ESTs 29 

ESTs. Moderately sirilflr 1o ZN91 _HUM^NZ 29 

DKFZP564G2022 protein 2.9 

ESTs 2.9 

Homo sapiens, done 1MAGE:41 39775, mRNA, 29 

hypothetical protein 29 

guanine nucleotide binding protein (G pr 29 

ESTs 29 

gb:nz24cOBj:1 NCLCGAF.GCai Homo saptens 29 

hypottieCcal protein FU20397 2.9 

hypoflvetteal protein MGC2487 29 

hypothetical protein MGC15435 29 

SRY (sex detertrfning regfon Y)-box 22 29 

eukaryotic translation InrMon factor 29 

Homo sapiens cDHA RJ 10071 fis, clone HE 29 

B4 pistons family, member H 29 

H8CARG protein 29 

IMP (Inoslne rnonophosphate) dehydragenas, 2.9 

ESTs 29 

type I transmembrane lecepkx (seizures 29 

ESTs, Highly similar lo B34DB7 hypofliet! 29 

ESTs 29 

MAD (mothers against decapentapleglc, Dr 29 

Homo sapiens, Simlter to epsftr 3, ckwa 29 

nban protean 29 

Homo sarienscONA FU12978 fis, clone NT 29 

hypothetical protein FU10774; KIAA1709 29 

ESTs, WeaHy similar to I38022 hypothefi 28 

Homo sapiens, clone IMAGE3502329, mRNA, 28 

Homo sapiens mRMA; cDNA DKFZp58B12022 (f 28 

ESTs. Moderately similar to CLC3_HUMAN C 2.8 

ESTs 28 

Target Exon 28 

Irrtarmedisfe filament protein syncoslin 28 

ESTs, Moderately similar to 135937 DNA/R 28 

vacuolar protein sorting 35 (yeast homat 2.8 

dWaierrt cation tolerant protein CUTA 28 
SVWSMFrefated, matrix associated, acfi 28 
GKD03 proten 28 
ESTs 28 
ESTs 28 
spindle pole body protein 28 
KJAA17C8 protein 2.8 
hypothettca) protein P15-2 28 
hypolhefcal protein FU10369 28 
FK506-Kadaig protein 4 (59kO) 28 
gb:yd78d06Ls1 Soaies fetal livar spleen 28 
SWI/SNF related, mamx associated, acti 26 
fatty acid anVJe hydrolase 28 
cold Inducible RNA-bindhg protein 28 
protein tyrosine phosphatase, receptor t 28 
hypothetical protein PR02955 28 
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407255 AA012992 Hs.256301 hypolhfilicat protein MGC1 3170 2.B 

431472 AK001023 Hs.256549 nucleotide Wn*ig protein 2 (Exoll Mbi 2.B 

423319 AW467064 Rs.5740 ESTs 2.8 

454144 BE260478 Hs.182695 hypoflielfcal protein MGC3243 28 

5 452827 AI571B35 Hs£546B ESTs 28 

452040 AW973242 H<l293690 ESTs, We^Jy similar to B8G22 hypothec 26 

418830 BE513731 H&.BB959 fiypctheScal protein MGC4B1 6 2.a 

417517 AF00117B Hs.82238 POP4 (processing of precursor , S. cerev 28 

_ rt 419286 AA236005 Hs. 128764 ESTs 28 

10 430287 AW182459 Hs.125759 ESTs. Weakly sfrttiar to LEU5 JtUMAN LEUKE 28 

420911 U77413 Hs-10G293 Olinked N-acerytgbcosemlns (GfcNAc) V 2B 

428054 AI9488B8 Hs.266519 ESTs 2B 

409504 AA304961 Hs.699 pepHdytprolyl teonwrase B (cyctophUin 28 

.. _ 44008O AW051597 ESTs 28 

15 431609 AW792792 Hs-264330 N-acykphingosine amldohydrotese (a:id c 28 

4478SS AM36124 Hs.294009 Homo sapiens cOMA RJ133B4 fe, done PL 26 

431525 AA5C6S56 Hs.8185 KIAA1557 protein 28 

442995 AA532511 Hs.2BB455 Homo sapiens cDNA: FU23270 fls, clone C 28 

433101 AW572317 Ha.120B2 Homo sapiens mRNA; cONA DKFZp566L203 (fr 28 

20 449052 AW029507 Hs.161102 ESTs 28 

416663 AK0O1100 Hs,416SQ desmocoh%i3 26 

442013 AAS0647B Hs.10500 Human DMA sequence from ctane RP11 -353C1 2.8 

449051 AW9G1400 Rs.333526 HER2 receptor tyrosine kinase {c-erb-b2, 28 

453D3B AW292415 Hs.20509 HBVpX associated prote!r>-8 28 

25 419175 AW270037 K1AA0779 protein 28 

426788 U66S15 Hs.172280 SVWSNF related, matrix associated, acti 2B 

421091 W22821 ribosomal protein L26 2.8 

413435 X51405 Hs.75360 csrboxy peptidase E 28 

„ 432651 AW973744 H&.293100 ESTs 28 

30 452700 A! 659390 Hs.2B8940 five-span transmembrane protein MS3 26 

454148 AW732837 Ks.42390 nasopharyngeal carcinoma stisceplibJRty 2.8 

419452 U33635 Hs.00572 PTK7 protein tyrosine kinase 7 28 

425491 AA883316 HsJ255221 ESTs 28 

414556 AWS75053 Hs.343443 ribosoma! protein L36 28 

35 435126 A1393666 Hs.42315 plO-Wnding protesi 28 

435 602 AF217515 Hs.283532 uncharaclerfzed bone marrow protein BMD3 27 

441128 AA57C256 ESTs, Weakly slmlar to T23273 hypolheii 2.7 

440340 AW695503 Hs.125276 ESTs 27 

410339 AI916499 Hs.296256 ESTs 27 

40 430542 A1557466 Hs,1t9122 ribosomaJ protein L1 3a 27 

4067B9 AI041403 ribosomal protein L29 27 

424130 AL05O136 Hs.1 40945 Homo sapwns mRNA; cDMA DKFZp5B6L141 <fr 27 

449518 BE395253 Hs.30861 EST* 27 

Mtm 410099 AA081630 WAA0038 gene product 27 

45 422000 M30599 Hs.110S37 homeoboxAlO 27 

407110 AAO1B042 H&-252QB5 Pfader-Waii/Angeiman syrxirorre-5 , 2.7 

415443 707353 Hs.7948 ESTs 27 

432745 AJ821926 gfcnl76f05.xS NCLCGAP_Pr3 Homo sapiens 27 

409299 AA045S50 Hs.53125 smaB micfear ribonuclecprotelii 02 polyp 27 

50 424369 R67622 Hs*28714 K1AA1631 protein 27 

417831 H16423 Hs.82685 C047 enfigen {RrweWed antigen, integr 27 

424723 BE409B13 Hs.152337 protein arglnlne fkialbyltransferase 3(h 27 
451428 AW063364 Hs,11057 ESTs, Highly slmlar to T46395 hy potheB 27 
400263 Eos Control 27 

55 435750 AB029012 Hel4990 K1AA10B9 protein 27 
443804 AL135352 Hs.255883 ESTs, WeaWy slmlar to 136022 hypolheii 2.7 
421709 AA159394 Hs.107056 CED-6 protein 27 
421077 AK0D0OS1 Hs.1 0159 0 rrypcihsfcat protean 27 
^ 416359 AUD42210 Hs.1 6493 hypothetical protein DKFZp762K23lB; K1AA 27 
60 442104 120971 Hs.168 phosphodiesterssa 4B, cAMP^pecffic {dun 27 
411605 AW006631 ESTs 27 

437133 AB019319 Hs£460 KWA0776 protein 27 
407236 W79485 Hs.1 739 B0 nuclear matrix pretah NMP200 related to 27 
432125 AW972667 Hs.183006 Homo sapiens cONA FU12300 fls, done MA 27 
65 427657 AV652249 Hs.1 6 01 07 polymerase (DNA directed), beta 27 
445636 AW105401 ribosomal protein L29 27 

443601 AW20B942 Hs. 253594 Intron of^dTorttnophdangeaJ syndro 27 
420223 NZ7807 riosomfi] protein L4 27 

„ 427515 T795Z6 Hs.179516 Integra] type 1 protein 27 
70 447673 AI8239B7 Hs.182285 ESTs 27 
411960 R77776 HM81C3 ESTs 27 
433212 BE21B049 Hs,121B20 ESTs 27 
448244 BE613416 Ks.336425 Homo sapiens, dona MGCr17296, mRNA, com 27 
__ 441551 AA318224 Hs.296141 ESTs 27 
75 421863 X55079 Hs,1437 glucaaidase, alpha; add {Fompe disease, 27 
411562 AUD50201 Hs.70769 hypothetical protein DKFZp5B6E1 923 27 
434351 AA806184 Hs.116502 ESTs 27 
442875 BE623Q03 H6.23625 Homo sapiens done TCOCTAOT142 mRNA sequ 27 
421109 L32832 Ha.1 01842 AT-toxflng Iranscripttori factor 1 27 
80 432450 At990739 Hs.158184 ORF 27 
426174 AA547959 Hs.1 15838 ESTs 27 
445919 T53519 Hs.334692 hypotheccaJ protein MGC14141 27 
423115 AA421973 H&169119 ESTs. WaakJy ginnar to T25731 hypothati 27 
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431222 X56777 Hs.273730 zona peJJucWa fllyooprotelci 3A (apwrn ra 2.7 

46155B NMJ01O69 Hs.26630 ATP-Nndlng cassette. siiMarnilyA(ABC1 27 

431615 AW295B59 Hs_235860 ESTs 2.7 

418649 AKJ96465 Hs.169341 ESTs, Moderately similar to S65657 alpha 27 

5 431122 Al 267593 Hs.250535 Homo sapiens mRNA;cDNA DKFZp434N 2412 (f 2.7 

438682 AA3544B9 Hs.222103 EBPSO-PDZIftteractoTof 64 kD 2.7 

424954 NM_O0O546 Hs.1846 tomor protein p53 (U-f taumenl syndrome) 2.7 

458062 AW978811 H&314451 ESTs, WeaWy simitar to ALUIJiUMAN ALUS 27 

427067 BE073913 Hs.173515 ^characterized hypothalamus protein HTO 2.7 

10 412524 AA417813 Hs.44208 hypolhsfical protein FU23153 27 

435176 AA744375 Hs.189413 ESTs 27 

419390 AI701162 Hs_902u7 hypolhellcalprotanMGC1113B 27 

428013 AF151020 Hs.181444 hypothetical protein 27 

441094 1133819 Hb.7647 MYC^ssoeiated zinc finger protein {puii 27 

15 451356 AA74B418 H».164S77 ESTs 27 

408461 AB037756 Hs.45207 hypdhellcalprotBinKIAA1335 27 

418516 NM_C06218 Hs.85701 phcsphoirraflkk^foase. catalytic, a! 27 

423476 AL035633 Human DMA sequence from clone RP5-1 046G1 27 

444034 AL1619S7 Ha. 10177 ptectelrin homology domain Interacting p 27 

20 404913 NMJ)2440B*iHomo sapiens Ngfch (EXtisophH 27 

407378 AA299264 Hs.57776 ESTs, Moderately similar to 138022 hypot 27 

441054 AA913591 ESTs 2.6 

428500 AI815395 Hs.184641 fattyadddasakirase2 26 

430512 AF162294 Hs.241578 U6 snRMArassodated Sm-Jfce protein LSmfl 26 

25 433301 AW296280 Hs.152016 Homo sapiens cDNA: FLJ22140 fij, done H 26 

444381 BE387335 Hs.283713 ESTs. Weakly similar to S64054 hypotheH 26 

443666 AW58179 Hs.41628 ESTs, We akty efonter to neuronal thread 26 

427376 BE515037 Hs.177556 melanoma antigen, femlly D,1 26 

439414 NMJM183 Hg.6551 ATPase, H transporting, lysosomal (vacuo 26 

30 421866 M24470 Ms. 1435 guanosho monophosphate reductase 2.6 

421605 BE44M06 Hs.106127 RNApdJymerasol IGkDastibunlt 26 

428797 AW93625B Hs.342849 ADP-ribosylalionfactar-llke5 26 

410835 BE067395 to.68851 dynein, cytoplasmk:, Intermediate polype 26 

407BB7 AA579668 H&41072 serine {or cysteine) proteinase InNHto 2.6 

35 430341 NMJJ06348 Hs.239631 gctfi! transport complex 1 (90 kDa subtinl 26 

424956 AW19B103 Hs.18724 ESTs, WeaWy similar to granute cell mar 26 

452743 AB011128 M5.30512 HomosaplensmRNAforKIAAOSSe proteto, 26 

408212 AA297557 Ks.43728 hypothetical protein 26 

407893 BE408359 Hs<43621 Homo sapiens, Similar to hypcthefcal pr 26 

40 425976 C75094 Hs.334514 NG22 protein 26 

448424 AW009892 Hb31924 ESTs 26 

420164 AW339037 Hs24908 ESTs 26 

443444 AW952619 Hs.1 7235 Homo sapiens clone TCCCIAD0176 mRKA sequ 26 

412774 AA120865 Ws,23l36 ESTs 26 

45 445922 A1337316 Hs.147998 ESTs 26 

429925 KMJM0786 cytochrome P450, 51 (lanosterol 14-alpha 26 

419222 AO001528 HS.B9718 spermine synthase 26 

417327 NWL0M922 Hs.019S4 SEC24 (3. cerevisiae) relErted gene Jamil 26 

420337 AW295840 Us. 14555 Homo sapiens cDNA: FU21513 fis, done C 26 

50 408743 AL110246 Hs,47367 K1AA1785 protein 26 

434011 AW953437 Hs,5485 done FLB5214 26 

434976 A1963821 ESTs 2.6 

417454 ttM-000202 Hs.1 72458 Iduronate 2suhatase (Hunter syndrome) 26 

448789 BE5391W Hs.22051 hypothetical protein MGC15548 26 

55 441049 W58320 Hs.29341 hypcUieecal protein FU 22376 26 

437271 AL137445 Hs26846 Homo sapiens mRNA; cDMA DKFZp5660134 (fr 26 

425017 AL11S305 Hs2$409 ESTs 26 

440333 AE37B424 Hs.288761 hypothetical protein FU21749 26 

434551 BE387162 Hs.2BQ&5B ESTs, Highly strrtlar to A35661 DNA exds 2.6 

60 413132 NNL006623 Hs.75209 proteb kh3^MWP^e pendent ceWyti 26 

409619 AKQ01015 Hs 55220 BCL2-associated alhanogene 2 26 

444550 BE250716 Hb.67614 ESTs 26 

446258 AI283476 K&263478 ESTs 2.6 

440191 AI990417 tubufri r beUl5 26 

65 4531B7 A1181383 Hs.34549 ESTs, Highly similar to SS4541 1 done 4 26 

412133 U8346D Hs73614 solute carrier famSy 31 (copper transpo 26 

426841 A1052358 Hs.193726 ESTs 2.6 

435937 AA830B93 Hs.1 19769 ESTs 26 

426960 AW410646 Hs.164649 hypothetical protein DKFZp434H247 26 

70 456161 BE264645 H5J2B2093 hypoUw&al protein FU21 91 8 2.6 

456236 AF045229 HsJ82280 regulator of 6-proteln signalling 10 26 

435025 T08990 Hs.4742 anchor attachment protein 1 (Gaalp, yeas 26 

416751 T48130 rls.6B97 Homo sapiens mRNA; cDNA DKFZp5a6P1 622 (f 26 

„ 458946 AAO09716 Hsl42311 ESTs 26 

75 424762 AL1 19442 Hs.1 83684 euteryotfetranslalkxi InWon factor 26 

449458 AW05076 Hb.208261 ESTs 26 

432409 AAS06538 Hs.130732 WAA1575 protaii 26 

425164 BE27828B Hs.1 55049 lutheran Wood group (Auberger b antigen 26 

442336 AW34095B Hs.7572 ESTs 26 

80 445622 AF1Q6697 Hs.12971 thioredown reductase beta 26 

440409 AW294316 ESTs 26 

444610 AI174783 gb:HA250l Human fata! Bvar cDMA library 2.6 

425863 U43604 Hs.159901 Human unidentified mRNA, partial seqtienc 26 
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436972 AA284679 Hs^5640 rfaudw 3 2.6 

42BS09 BE313114 Hs.2970B ESTs 26 

431163 NIOQ6855 Hs.250696 KDEL (lys-Asp-Gfu-Leu) endoplasmic retic 2.6 

45763B A1792670 Rs.144405 EST* 2.6 

5 442821 BE391929 Rs.8752 IranamembranB protein 4 2.5 

435522 N64214 Hs£774 synovtaj s a rooma translocation gene on c 25 

405764 AA429B25 H&343443 ribosomal protein L36 25 

422032 AA476966 Hs.1 10857 polymerase (HMA) 111 (DMA directed} poly 25 

. _ 409125 R17268 Hs.343587 agonal transport of synaptic vesicles 2.5 

10 416801 X9B834 Hj.79971 sat{Drosoptilla)-llte2 25 

446091 AVTO22192 ESTs 25 

419848 AA251242 Hs.103238 ESTs 25 

415015 NNL002109 Hs.7779B hlstiayMRWA synthetase 15 

431836 AF17B532 Hs.271411 betefiHe APP-deavtng erpzyme 2 2.5 

15 414SB7 AA524394 Hs^94022 hypothetical protein FU 14950 2.5 

447087 AW4C3B70 Hs.301872 hypothecs] protein NIGC4840 2.5 

456050 R79445 H3.76230 ribosoma! prolan S10 25 

42OTB AA3714C9 Ha.105664 hypothetical protein FLJ22294 2.5 

414528 AA14B950 Hs.1BB836 ESTs 2.5 

20 426494 AL1 19528 Hs.170098 KJAA0372 gene product 2.5 

407656 AW7479B6 H&37443 Homo sapiens mRNA; cDNA DKFZp434B2119 (f 2.5 

437330 AL3S3944 Hs^0115 Homo sapiens mRNA- cDNA DKFZp761Jl1 12 (f 25 

449139 BE26B315 HbJ23111 phanylaJanin&JRNA synttietase-Ske 26 

_ _ 44B153 Y1Q8Q5 Hb.20521 HMT1 (hnRNP methyl transferase, 8. cerevl 2.S 

25 417386 AUQ37228 Hs.82043 Dl23gana product 25 

409420 Z15006 rte.5445t lamiiu'n, gamma 2 (nteeln (100WD), kaflnt 2.5 

406795 AA6SB292 Hs.170345 ESTs 2.5 

407627 AI419020 Hs.62B20 chromosome 6 open reading frame 1 2.6 

^ 421535 ABO 02359 Hs.10547a pfiosphodbosTtfomrytglycl Wxiine syntha 2.5 

30 415852 R51034 Hs.144513 ESTs 25 

426418 M90464 Hs.169625 collagen. lypeJV, alpha 5 (Alport syndr 25 

436557 W15573 Hs.5027 ESTs, Weakly similar to A475B2 B-cctl gr 2.5 

413517 N76712 H^44829 ESTs, Weahly similar to 138022 hypothett 2.5 

449500 AW955345 Hs.1 292 B ESTs 25 

35 457090 AL060243 Hs.1 80920 ribosomal protean 89 25 
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^ISl? 7 !? 81 "?? 4193 AlW3494AI363a77 AK68B55AA121979 BFgB3131 AA491795 B (31 52645 AJ 350401 Bf 939121 AM79401 BJ 493099 BFD57693 
AW0550 BF11 1919 BF448282 B 1 493 100 AWM376B AWOOS202 AI564010 BF433292 AI458202 A1954746 B^B2^M11 yS&48 

AW976442 AA74331 9 AA76S420 

BG18O0O3 AW975741 AW49B65 AA731828 AA731829 

AW974812 AI821B22 AJ821820 A1821075 AIB21073 AAB51643 AA551662 

^i^ 24 8,956849 6,965735 AW973032 BI962894 BI963048 AA548765 A1928504 AA041551 AW043754 AIOB6702 AW008106 AA974849 
W0136Q N94710 HB7967 

AW051597 AT733052 AH67287 AI732999 AI5B69t8 AI476787 AI791542 AA887204 AW025394 AA36333B AI2402B5 AJ791393 
AB0ie322 B0012480 BfS24S73 AW665554 AJ934469 AT479916 BRH8179 BF096162 BF096132 AA744972 AI95196& A)B5a339 BE076331 
AABB699B A1570585 A19166BB AI678B11 AI693109 AI308135 AAB68046 AA961064 AI01B0B2 H80B1B BE221942 R52809^915164 AA365B2S 
244671 BI052776 BF882486BG2B6184 AI589558 AA931663AA534979 AI275392AI273455 R52553AA829920 H80652 AA36O72BF10B1B 
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441128 



40B789 
410099 



411605 



445B36 

420223 
423476 

441064 
429925 



434976 
440191 



440409 
444610 
446091 
431843 



2093^.1 
16732.1 



432745 112643J 
400263 18977J 



10026.3 



8561.5 

19164BL1 
32437.1 

2641 490 J 
33135.1 



121716L1 
l»,790_2 



588375J 
21452BL1 
515091J 
445334.1 

1B977_1 



TABLE 68C 



AW953666 AW176773 W5527 AA765570 AA081 927 BP093262 B6743753 Ai.037576 AA534314 BE614964 BE973713 M49493 BEOOGfiM 
BED06630 AW270037 M234765 AI334004 BF0S7179 AI857450A1341191 A1&H143 A1917449 AVV517207AaS5424 AVWQ^^4 aIS47CT2 
M994211 AA861901 AA581873 AJ580i57 AI3S4363 AW242357 AW235291 ^645AA31S8®R369IW^51 ^SSSSSm^Lm 
F02090AI187299AI609544Z40516AW952314 ftmm.0OAi*raT59 N4S403 

BC015B89 BE857108AL526926 A1527438 BG913023AJ6B4B67BE8584S1 AI885227A1935218 BE645596 Afe224«?6AA77ai6t 
BF345973 AW195853 AJBB7121 A1336147 AKJ91354 AI769203 AW627BS9 BE677432 BF439954 BG8202»W^K^SbsaSSb 
AA932022 BE222881 AW016109 BM3526B7 BF52MJ99BG8t8618 AI935916 AA912295 BmSmf^SSSm^^S^ 
BG819-ffSBF835B50AA323718AA86^ 

BF835954 AL573997 A15B1252 BF9576B7 AL580170 AW952324 A15693B2 AW14B695 *Us£SS^Smmm^maS^ 
H435GBBF94518BAL559686AJL539326T15481AW895092AL582684BI519896 wi*M«/UHRilWIJ»AL5B13Z7AUB5B42 
££!J£? Z B «28850 AI356567 AI148171 AW221S5BG1 49561 BFD0C671 AA233101 AA573721 AA447991 AW01685S AI005068 AA554071 

AHMJ403Z4914S 

AK0S5674 AW965247 AV75153B AA2B0B26 RE3043 AA3313B? AKD56148 BI917678 BGB19395BG91 1971 BGB20167A11742S4 AA*W7W\ 
AA364503BG714279AWa93230AAM1?74H24222AW»176BFa^ 

A1439658 BE551237 AA2B3724 BF109530 AI457096 AI805992 B£AmXM^^ Hm^A^^^Sf^S^S^ 
BE702179AA484S49T23611BE327043AA7^ 

A1422304 AI204B99 AIO41169AA211402AW827O81 AA7BB593 T32736 AI7W935 M74m?T0^4 fl^S^ VlSbkS^^ 
AWa3370AW9Q1OTAV^71W6«TON7a^^ 

AA6S8B28AI821926A179U91AA635129AAS64492 ' 6/ " / **"" n3 
Z1 1692 X5146S N&L001B61 M19997 B1224253 BG83047B AU122147 AU123437 BG113591 BG752624BE888804 81868689 BGM7»« iwnoaav: 
BM916S2S AJS80409 AL682B66 AI909178BFB49555AA371735 BF03B641 BF727115 Ba06WB^^SsMSK™S^^ 
S»SSS M1a!a AI31323S BG831724 BFSS986:! 66998318 B ""«^B™8^SF^mASf F01241 



BE166758 BM452445 A193780B AW02612B K236B4 AWD06011 AI337621 F33111 BF344301 BG105450 BG387^BF^S47HF1wSV' 
8M00736B BF6693B5 BE7720D7 BI19MB7 BF761700BI261519 BF944452 BF898506 A^TbMO^^Siw^LTs If^r 
H^3BIM6047BF8B8479BF7B13S0BE79!m»B^^ 

8F089«OBI000274BG25S503BGfi74499BG774174B«)15084 i«ioruo9i»»aM«S9BBFlw»8B6»IW84 
BG2S68S2 H10532 N46B14 R52610 AW377696 BM4804BB W58819 B1042ia3 BG97749B BE787451 BFB70009 BG477473 US1137 RHWi mmra 
WBB88 BI838226BFTO4289 AA42S173 BE741829 AWB674S5 A/123BB3AW008B31 ^^AW^AVl^m^Xm^^ 
BF921562 AA09423O BE0925B7 W88151 AA526153 AI672166 BF1M4496 R12579 BF852352 S^Tb^OZ » aE? 
B^AB45raB6381BWBB08967BGOaBB34BEK1703 H5B7ttH8S^ 

N27B07 AA2S6634 BE276324 

SSE"""" 51 H29379R19493 H1B042 AL133985AW134660 AB99437AA057405 AA917450 AI002692T09262 
AI2212D7AI659BS6AA913591 AI220M2 

W7727BG773392H5a531 AA226450BOB79564AA292178 W56883 BE813131 C03B46 BsKmSSSi Sro^^aTO^™? 
BKKKM21 BG502073 BG425943 W^7290W31353 BE3Q4451 BF20S311 era48717 N78122 AA^ 7 A525334 
AI953821 AA657925 AA935436 AW97B0B8 

BC000222 AL136871 W&UXJ2261 BC009497BM461705 BG470749 66826905 BG2851 27 BI2S3235 BE397026 BG704967 AW9S122S rimwrit 



BE260391 AA5999 12 BF686779 AJ755222 AM52272 AW241 170 AU 156555 F28259T16319 AA36250S NG4153 BtMlteUWAtf^F^m?? 
AV7295B5B^SBE667433 BM011110BG285856AI922439 BE270975 AU19338 BRSOTW A^5178 ISbSb SSSS? 



^!1??SS 2 A" 26014 AA778101 AA243218 AW49BB37 AA748311 AI754395T15728 AA7763"69A^195ii^625^ 
AJ0B1 112 AI783806 AL569622 AAO70466 BF229938 N581S9 HW288 M32598 W0293 H^S A^^361^^Sl^^ 



AAK3197 D51888 AA382S27 BG05666B AW118162 AW073071 BE293B6B BF792321 BF792258 BG107178T566M B^^m^aa^HnAt; 
AAS08004 B1085686 AA564586 AI221630 D52045 C14510 AA029390 W60153 H99743 S MT^W AJMiraDAAfi^ 



^«X 5 ?.^T^ R76BSBW * 22MB AW770291 f«»34AA2089l8 R97811 H40328 RK252 AA935029 A1B13809 AA933BOTMI29S5 
AJ619676AW1903C6 BF035010AVTO87B97AI8649B9T57243 R48211 AA1 13880 R265S4C14467 C14444 ai9&J^ RPHUMifi rfSSi^ 



AJ174783R12271 R83569 
AJ274757 AIS59500 AVWJ22192 

cSl 1 D60W 1 D^O^ 060181 M10WW ^ C1e87 «N9^Ct48l5C15068 D80763C14B18 

^?^JS\l2SI?ilr 0 ? T HL M1999r7 61224253 BG830478 AU122147AU123437 BG113591 BG752B24BE866804 BI868669 0G3372ieAW82S995 
BM016525 A15804O9 A1562865 AJ909178 BF849556 AA371735 BF03B841 BF727116 BC006547 ^T^^^UB^^mf^ 
BI03348G Bl5f 7580 BG87S486 B 1011028 A1313235 BGB31724 BF8S9B62 BG99834 B B lOll^l B FW8^^R%^ B^^roAVraS^5TO1241 
^S^^SIS^ 65 BG938849 AA248724BG829202 B^BE^^^^mxS^SSt 



Pkey: 
Ref: 



Unique number corresponding to an Eos piobaest 

Sequence source. The 7 digit numbers to this column are Geutenk Iden&fer (Gl) 
sequence of hisnaochrorreKorae 22." Dunham K et ^L, Nahjre (1999) 402:489^95. 
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Strand: Indicates DMA strand from wfiicfi exorvs were predicted. 
NLposiHon; Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


Mi ___irc—_ 

NLpostCon 


403047 


35 401 S3 


Minus 


59793-59968 


401424 


8176894 


Plus 


24223-2442B 


4U1401 




Minus 


iiyy/D-tiiii/z 


401747 


9769672 


Minus 


118596-1 18816,1 191 19-119244v119609-1 1976 


404632 


9796868 


Plus 


4509645229 


403046 


3640153 


Minus 


55707-55 B59,5G369-56511 


404922 


7341893 


Plus 


13248-13426 


401519 


6649315 


Plus 


157315-157950 


401197 


9719706 


Plus 


176341-176452 


401866 


8018106 


Ptos 


73126-73623 


4053B7 


6567915 


Minus 


3769-3833,5708-5895 


403752 


7678857 


Plus 


33704-33828 


404210 


5006246 


Plus 


1B9326-170121 


401785 


7249190 


Minus 


165778-166996,166189-166314,166406-16656 


406214 


7342036 


Phis 


66320-86523 


403532 


8076842 


Minus 


B17S0-81S01 


400860 


9757499 


Mnus 


151830-152104,152649-152744 


404913 


7341740 


Plus 


97717-S7B76 



Table G9A lists about 2G0 genes that are upregulrfed in human umbilical vein endothelial cells (HUVEQ compared to normal body tissues. These genes were selected from 59680 
pre besets on the EosJAffy matrix HuQ3 GanecKp array. Gene expression data for each probes el obtain ad from this analysts was expressed as average Intensity (At), a normalized 
vaUia reflec&ig the relative tevd of mRNA expression. 

Table 70A lists about 148 genes that are dawnregulated using the mean of vessels (veins and arteries) compared to the mean of HUVEG These genes were selected from 59680 
probesets on lbs EosfAnymetrix Ku03 GenecWp array. Gene expression data for each probeset obtained from this analysis was expressed as average Intensity (At), a normalized 
value rallecfing relative level of mRNA expression, 

TABLE 69A: 

Ptef Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Ganbank accession number 

UnlnenelD: Unigene number 

Unfgene TftlecUniganB gene title 

R1: Ratio of the mean of HLFVEC At's to the maan of the normal body tfegua ATs 



Pkey 


ExAccn 


UntgeneJD 


LWgene Title 


R1 


424806 


AA382523 


Hs.105889 


MSTP031 proton 


1.51 


418036 


Z3797B 


Hs.63337 


latent feansfoming growth factor beta b 


1.94 


444381 


BE367335 


Hs.283713 


ESTs, WeaWy similar to S64054 hypotheti 


1.60 


442802 


AL1 33035 


Hs.8728 


hypothetical proteh DKFZp434Gl7l 


1.73 


417944 


A1J077196 


H&82965 


collagen, type V, alpha 2 


146 


410480 


R97457 


R&639B4 


cadhorin 13, H-cadterfn (heart) 

RNA POLYMERASE 1 AND TRANSCRIPT RELEASE 


1.91 


452516 


AA05B630 


Hs.29759 


1.59 


425139 


AW630486 


Hs.25338 


protease, serine. 23 I 


1.63 


402463 






NMJM4624:Rcmo sapiens SIOOcatcRirrvbirid 


150 


423798 


AF047033 


Hs.132304 


solute carrier faml V 4, sodium bicarb on 


131 


412568 


AI87B826 


Hs.74034 


caveoln 1, caveolEe protein, 22kD 


1.53 


419948 


ABO41035 


H8.93W7 


NMj>16931:Kornosaplens NADPH oxidase 4 ( 


Z06 


447526 


AL048753 


Hs.303649 


srrtaJ inducible cytokine A2 (monocyte ch 


2.10 


445500 


U78093 


Hs.15154 


sushi-re peavcontefailng protein, Xchrom 


1.67 


441457 


AW996651 


HS/43S3B 


ESTs 


1.30 


408296 


AL1 17452 


Ha.44155 


DKFZP5B6G1517 protein 


1.61 


418994 


AA29B520 


Hs.89546 


selecfin E (endothelial adhesion indeed 


187 


413795 


AL040178 


Hs.142003 


ESTs 


164 


441689 


Al 123705 


K&2890B8 


ESTs 


144 


412567 


AI760979 


Hs.74034 


Homo sapiens done 24651 mRNA sequence 


175 


424432 


AB037821 


Hu.14685B 


protocadhertn 10 


165 


432673 


AB028B59 


H&279605 


OnaJ (Hsp40) hanobg, suMamly B, membe 


131 


418683 


U9O906 


H&87241 


hvpotheOca! protein from clones 23549 a. 


1.42 


453085 


AW954243 




KIAA0251 protein 


U7 


438BB7 


R66867 


Ha.265499 


ESTs 


149 


436729 


BE621807 




uangmambrana 4 aucerfamfly member 1 


1.91 


400494 






E(4SP0000023a97O*^IG3O (FragmerrQ. 


1.34 


442506 


BS66411 




ESTs 


1,54 


425023 


AW956889 


Ha.154210 


endothelial dfflarenbaSon, sphingollpl 


155 


414476 


AA301867 


Hs.76224 


EGF-coatalnlng fibuUnJlke extraceOula 


2.65 


452933 


AW391423 


H&288555 


Homo sapiens cDNA: FU22425 fa, done H 


162 


422389 


AF240S35 


H3.115897 


protDcadherin 12 


1.33 


417124 


BE122762 


Hs.25338 


ESTs 


2.13 


433681 


AJ0D4377 


HsJ200360 


Homo sapiens cOHA FU 13027 fis. dona NT 


131 


419983 


W55956 


Hs.94030 


Homo sapiens mRNA; cDKA DKFZp586E1624 (f 


154 


414776 


AA1 55598 


Hs.212839 


hypothetical prataki FU14195; KIAA17f4 


1.51 


450534 


A1570189 


Hs.25132 


WAA0470 genu product 


143 


449618 


A1076459 


Hs.15978 


K1AA1272 protein 


142 


446098 


AW072215 


Hs3G8470 


ESTs 


153 


413281 


AA861271 


Hs.222024 


transcription factor BMAL2 


140 


448674 


W31178 


Hs.154140 


ovary-specific acidic protein 


147 


407061 


X97748 




gbiKsapiens PTX3 gene promoter region. 


133 
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434846 AW295389 Hs.119768 ESTs 1.29 

40B570 AL04640S Hs.103463 K1AAT798 protein 1.47 

436772 AW9756B8 metalothfonein 1E (funcfona)) 1.81 

_ 453789 AAB28S17 Hs.118502 ESTs 1-41 

5 424259 AKQ01776 Hs.143954 hypolhefcal protein FU10914 1.35 

441020 W79283 Hs*35Q62 ESTs 162 

401234 m!tosen-actfv9tad protein kinase 8 Inter 1.46 

420174 A1B24144 Hs.199749 ESTs 1.75 

4456B4 AK001696 Hs.13109 Ran binding protein 11 1.49 

10 418693 AI75QB76 Hs.&7409 Ihrombospondln 1 1.69 

426535 AUD77012 HB.2885B2 ESTs, Wee% similar to ubiquitous TPR m 1.77 

412330 NNL005100 Hs.788 A kinase (PRKA) anchor protein (gravm) 102 

412646 NMJ306825 Hs.7436B tmn a membrane protem (63*D), endaptesmi 164 

434523 AA7O3709 Hs.23410 translocase of Inner mitochondrial membr 1.26 

15 433376 AI249361 Hs.74122 caspase 4, apoptosts-rekted cysteine pr 1.49 

422099 AA156022 Hs.1 11518 hypothec* cat protein 1.56 

449394 AAD0436B Ha.18160 Homo sapiens cDNA FU115S0 lis, clone HE 1.54 

445119 AF035121 Hs.12337 Wnase fnswt domain receptor (a type It 140 

„ 438142 T90309 Hb.269651 ESTs 145 

20 437478 AL390T72 Hs.317432 branched chain arrrfoolrensferase 1, cytos 144 

422809 AK001379 Hs.121028 hypdhetfcal prolan FU10549 142 

453459 8E047032 Hs.2577B9 ESTs 1.51 

442711 AF161073 Hs.8645 hypothetical protein 142 

426428 AUJ37544 Hs.1 84298 cyd In-dependent kinasa 7 (rwmotog of Xe 132 

25 400286 XQ6256 Hs.149609 integrin, alpha 5 (fibronedJn receptor, 2.67 

417640 D30B57 Hs.82353 protein C receptor, WKbtrraJial (EPCR) 2.47 

407252 AA65S037 Hs.163780 (A8075B32) KJAA19S2 protein {Homo saplen 176 

418825 AA226861 Bs.22394 hypottietical protelft RJ 10893 1-40 

417805 U3B545 HsJ82587 priosp holloas e D1, phopnaiidytcholkie-spe 135 

30 426B26 NM_0M02fi H6.172670 adivln A receptor type 1 Wfke 1 1.55 

415714 NMJW2290 H&7B672 !OTrmn t abha4 1.90 

446964 ABQ20722 Hs.16714 Rho guanine exchange factor (GEF) 1 5 171 

421776 AA428Q0O Hs.283D72 scHn related protein 20 oonvlex, subun 188 

. ^ 42121B NML000499 H8.72&12 cytochrome P450 ) subferrffl/I (arorneOc c 175 

35 462298 AI039243 Ha£7B5B5 MM_024756; Homo sapiens hypothetical pro 1.86 

434596 T69538 gb:yb65gtZs1 Stratagene ovary (937217) 152 

412351 AL135950 Hs.73828 T-cell acuta lymphocyte leukemia 1 161 

433374 A1B21409 Hs*04471 EST 1-45 

429113 D28235 Ks.196384 pros^rufln-endopero^de synthase 2 {p 2.14 

40 423099 NMJD02837 Ho. 123641 protein tyrosiie phosphatase, receptor t 1.50 

435705 AA7821U H&28Q43 ESTs 1.53 

456999 AA319795 Hsl2985B1 eutarryotic translation elongation (actor 1.62 

453983 H94997 Hs.16450 ESTs 1.94 
406506 Target Exon 197 

45 424503 NIVL002Z05 Hs.1 49509 mtegrm, atpha 5 {flbrortectin receptor. 232 

452924 AW5B0339 Hs.97199 co mptem enl component C1q receptor 2.36 

426596 AW363332 Hs.1 71 8 44 NMJH6505 Homo s apfens pdtovlrus recent 161 

418941 AA452970 Hs.239527 EtB-55kD&^ssoclated protein 5 1.41 

450152 AI13863S Hs22968 intcon cf VEGFR 170 

50 437269 AA334384 Hs.1 43420 ESTs 160 

445279 R41900 Hs.22245 ESTs 159 

406509 ALD36923 Hs.322710 ESTs 1.52 

410240 AL157424 Hs.61289 synaptojarrin 2 161 

424711 NM.005795 Hs.1 521 75 csWtonii receptor-liko 1.96 

55 449316 AW236021 Hs .78531 Homo sapiens, Slmlar to RIKEHcDNA 5730 181 

407862 BE548267 Hs.337986 Homo sapiens cONA FU10934 ffe, done OV 1.91 

410057 R66634 Hs,26BlQ7 mdfineiln 185 

446727 AB011095 Ha.16032 WAA0523 protein 167 

_ ^ 412564 XB3703 cardiac anVyrin repeal protein 1-51 

60 414786 AI246462 Hs. 243010 Kama sapiens ras homotog gene family, me 171 

410Z76 A1554545 Hs.71B32 angtopoteUn-Z 1.91 

4DB6Z7 T64904 Hs.163780 ESTs 176 
405025 Bono mcrohog ensile protein 5 1.69 

422648 DB6983 Hs.1 16693 Melanoma associated gene 2.33 

65 452652 AKC01972 H s^O 922 rr/pothefcal protein FU11110 1.76 

427912 AUO 22310 Hs.1 81097 tumor necrosis factor (I lgand)syperfeml 179 

432004 BEO19302 Hs.2B94 placental growth factor, vascular endoth 2.02 

430129 BE30170B Hs.233955 hypothetical protein FU2O401 193 

429276 AFO56065 Hs.1 9861 2 G protein-coupled receptor 51 1.65 

70 407704 BE315072 Ha.78768 maljgrwnt cell expresslorvenhanced gen^ 146 

446694 AA4767S6 Hs.1 04477 ESuWquifri UgaseSWURF2 2.01 

434396 AA12109B Ha.3838 serum4nducfcle kinase 1.83 

416851 AW963951 Hs.85618 ESTs 165 

412420 AUQ356G8 Hs.73853 bone mnrphonenetlc prclain 2 1.69 

75 416179 R19015 Hs.78067 MAD (mothers against decapentapleglc, Dr 171 

450589 AI369275 Hs.243010 Homo septana cOMA FU14445 fe, clone HE 166 

416658 U03272 Hs J9432 fibrillin 2 (congenital contrarian* as 1 65 

425235 AA353113 Hs.1 12497 Homo sapfens cDTW: FU22743 6s, clone H 183 

428266 AB020643 Hs.1S3005 WAA0836 profdn 177 

80 41Z755 BE144306 Hs.T73891 ESTs, Weakly similar to P4H\_HUMAN PRQLY 2.00 

456977 AK000252 Hs.169758 hyoothetcal protein FU20245 1.61 

456977 AKD0D252 Hs.1 60753 rrypothetlcal protein FU20245 161 

4Z78S7 WML017413 Hs.303084 apdln; peptide Ifeand fbrAPJ receptor 1^4 

777 
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425234 
444409 
421340 
445701 
444009 
409220 
416940 
416922 
431546 
434927 
434558 
436163 
428438 
453365 
418058 
439410 
453467 
420209 
417933 
436420 
414430 
412140 
442104 
425387 
410506 
413745 
405121 
425811 
415765 



421937 
40B669 
416815 
416203 
406243 
416054 
407891 
407891 



419172 
442006 
447560 
432126 
417426 
434217 
439265 
442923 
433001 
418741 
442573 
422544 
449722 
413794 
422603 
406954 
448231 
408989 
429663 
418738 
418185 
417308 
407975 
420256 
400666 
431726 
440086 
413132 
451979 
416136 
414577 
444330 
422424 
400289 
416007 



AW152225 Hs.165909 ESTs.Weakly similar to 138022 iiypolhetJ 1.74 

AI792140 hts.49265 ESTs lBa 

F07783 tfs.1369 decay accelerating factor for complement {74 

AF055581 Hs.13131 lymphocyte adaptor protein 218 

AI3B0792 Hs.135104 ESTs t90 

BE243323 Hs,51233 lunw necrosis factor receptor superfeml 216 

N7S620 Hs.43157 ESTs 203 

AW956580 Hs,42699 ESTs 2.09 

A1834273 Hs.9711 novel proton 1 61 

H46612 Hs.Z93615 Homo sapiens HSPC285 mRNA, partial cds 2L14 

AW264102 Hs.39168 ESTs ^ 

BE263252 Hb£101 hypothetical protein MGC3178 2!()5 

Nfc*_C01955 HsJ2271 andolhelln 1 2,45 

AAD35211 Hs.17404 SOX7 SRV (sex detemiining rerfon Y)-box Z20 

AW161552 Hs.83381 guanine nucleotide binding protein 11 2.26 

AA632012 H8.1B8746 ESTs 103 

AI535997 Hs.30089 ESTs 2*39 

AA256444 Hs,1264B5 hypolheiicaJ protein FU 12604; KJM1692 m 

X02308 Hs>82962 thycrfdylate synthetase 170 

AA443966 Hs.31595 ESTs > 1 97 

AI346201 Ks.76118 ubrquifoi carboxyl-termlnal esterase L1 1 1.93 

AA219691 Hs.73625 RAB8Vtteracfln flt Wnesfn-Bke<fBbldnes 1.80 

120971 Hs.t88 phosphodiesterase 4B, cAMP-spedfic (dun 203 

J040A8 Hs.156346 topofeomerase (DMA) Halp)ia(170kDJ 1,58 

AA084248 Hb.85339 G protein-coupled receptor 39 161 

AW247252 H975514 nucfeosldB phnsphcuyfase l]gg 

. t n^en-BCtivsted protein kinase 8 Inter 299 

ALO39104 Hs,159557 karyo^snna^haa^RAGoonoftl, impor 1,76 

NM_005424 Hs.78a24 tyrosine tinase wtth fenmunoglobulin and 213 

BE041395 Hs.156110 ESTs, Weekly sMar to unknown protein 271 

AIB78857 Hs.109706 hematologbfll and neurological expressed 165 

AM93591 «s.78146 platetet/endolhellal cell a^eslon molec 239 

U415I4 Hs.80120 UDP^aoety^atoha^alactosarrfnKpolyp 1.77 

X54942 Hs.63758 CDC28 proteui kinase 2 157 

Y0O797 Hs,624 inferieukfn8 209 

NM-002316 Hs.83354 lysyloxtdB&e-llke2 242 

AA48B620 Hs-41135 endomira'n-2 234 

AA486620 Hs^1135 endomucliv2 234 

AA1SD797 Hs.109278 latexin protein 224 

BE247684 Hs. 103070 ESTs 207 

AVW3B625 HsJ22120 E5Ts**rt1ar to TRANSMEMBRANE 4 SUPERF 2.00 

AW97S183 ESTs, WeaWy similar to S724B2 hypoiheti 233 

AF065214 Hs.18858 pnosphahpase A2, group IVC (cyfosolfc, 2.11 

AA127221 Hs296502 ESTs 2 27 

NIO02291 Hs,B2124 laninfn, teta 1 2 'l1 

AWD14795 HS23349 ESTs 203 

AL134430 Hs.6906 Homo sapiens cDNA: RJ23197 f s, clone R 2!o3 

AW248322 Hs,95835 ESTs* Weakly simiter to unnamed profan 1.57 

AF217513 Hs2799QS clone HQ0310PRO03l0p1 leo 

H63265 Hs£861 ESTs, WerfdyslmSar to 841044 chrornosom 237 

H93366 Hs.7567 branclied cham aminotransferase 1, cytos 255 

AB018259 Hs.118140 WAAD7169ene product 209 

BE260074 Hs53960 cycltn B1 V76 

AF234532 rfe.61638 myosin X £fl1 

BE242587 Hs. 118851 heraatopoedcally expressed homeobox 215 

5^305 gb:H«nana|phasBtelEteandsateSifa3 108 

AI701916 HS202509 ESTs £77 

AW381666 Hs.49500 KIAAD746 protein \A3 

M68374 Hs21l567 phospho!fpase A2, group IVA (cytosoffe, 267 

AW3B3S33 H&66B2 sdutecairter family 7 f (clonic amino 216 

AWS58272 Hs347325 tntercelluter adbeston motecufa 2 260 

H6O720 rk81892 K1AA0101 gene product 1.86 

X69426 HU1716 endofhatel eel l^pecific molecule 1 292 

U84722 Hs.76206 cadherin 5, type 2, VE-cadherii (vasoila 292 

NMJX>242S:Homosaplen& matrix metaKopra 259 

NM_007351 Ra2S6107 mulbmerfn ^^72 

NML005402 Hs.268757 w^rd simian letiketnla viral oncogene hom 225 

NM. .006823 Ks,75209 protein kfnBss (cAMP-tlependent catdyfi 176 

BJ6972 Ks,27372 endotheVal tyrosine Wnase (E^ (8MX), 3.37 

«8355 Hs,295B44 tissue factor pathway inhfetor 2 ail 

AI058548 Hs.72116 hypothetical protetn FU20992simterto 3.11 

AI597G55 Hs.49265 ESTs |q6 

A1166431 Hs.296638 prostate dHferentiaHon factor 296 

X07820 Ks.2258 matrix metsltoprotelnase 10(stromely5ln 4,84 

M13509 Hs.83169 mainxmetaJtoprotainase 1 {TntefSlrfel 1191 



TABLE 69B 

Ptey: Unique Eos probasel Identifier number 
CAT number Gene duster number 
Accession: Genbank accession numlefs 



Pkey 



CAT Number Accession 
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453085 10017 1 BC01 7336 BG71 6430 BG501286 BI458528 AJ582Z23 N98532AI338138 AI273442AW102617 AA831177 AA745642AA412583 AA355375 

BG547492AW954243 BI7B6546 BG057641 AI192435 AI338935AI312651 AI7G8679AI191125 AI205832AA676899 AHJ78010 A188B71&AA452830 
BF445542 N69930 M715017 8F446713 BE046652 AW771909 AA907729 A11 43749 AI76129D AA890233 BF925759 A178371 3 AI7S7267 AA81 4638 
W5677B AA918461 BG743526 BE645242 A102S32B AJ296436 AT290445 H27710 AI475D34 BG740023 Al 090348 AI340003 BI602481 W38495 
Al 183314 A192741 8 BG397181 AAS76310 W1 9359 W56S07 C05751 AW380760 AW360770 AW360790 BFB30729 H28425 AA037326 AA375B05 

436729 6624 J X75684 AL673167 AM4S461 AI453743 A1963655 AI564644 AA9771B0 AI694U1 A1691355 AW071625 AI678712 AI720939 AB27769 BE439796 

AIS63432 AA292956 AW192S93 AI66SB38 A16969D5 A1424384 A116131 2 AJ91 1921 A1S97B01 BI494959 A12409B8 A1492554 AW2SZ737 BEC44G33 
AWO0857QAW62S505BI494958 AA0fl8439 AA7O6057BF 222B20 BFS936C8 B E5 01 957 AAS 24526 BE 0441 34 AW572531 AW015724 BE349166 
AA043217 BE219784 AI799814 AA129575 AI671727AM70033 BE64Q195 AVm9725 AA903050AA1 47228 AA404570AI075B78VV3B1B1 AI972739 
AW6731 52 AA723200 C0B123 BF057147 AA6276B6 AA157944 AI990245 AA66251 7 T32487 AI8001Q6 A1333170 A1B591 60 W45410 AI990S27 
AW27504B AA182640 AA478328 AI298935 AW08S1 58 AW471421 AW1 03470 AW30O456 AW191997 A1823466 AAS62397 AA136658 AI251817 
AW339104M724739AM111OTM191349AA757735AA037696^^ AI692846 AI0610S5 HBG9B3 R79S33 AI950693 

A1245632 AI349390 M1482B4 AT798502 AA487B93 AI621320 AW194272 C06365 AA9538B3 BE856936 A1916523 A1872628 A1927217 A1453463 
Al 1 89366 AW338678 AI261359 AI500576 BF477735 AKJ32569 A1972893 A1985583 Z28771 A1363829 AJ693030 AA603566 BE773488 AW3393M 
BE773489 BE773462BE773495AI65D33B BE773499 AI745717BE811475 BE811470BE811464BEB1141BBE611415 BEB11400BEB11396 
BE81 1368 BB811352 BE773501 BE773494 8E773486 BE773474 BE773473 BE773470 BE773461 BE6U350 BE811337 6F593B47 BGO55071 
AW675302 BF003Q6B AA71 9173 BE81 1348 A1582462 A16B6240 BE773500 AI244845 A156S439 A1918463 AI472527 AM46740 AA035576 AA1 9141 4 
AW6741 45 C057B2 A1589264 D5755B AI46B237 AJ432033 AA989682 R217S2 BF0024S7 AA98S297 AL574095 AL576200 AL571074 A1574525 
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BE773483 AI567995 WBQ075 BF941163 AJ738844 8E8 11458 BE7734S1 Ai262fl30 AA548565 BE706942 BE1SB360 T6SQ26 AW242B58AW1 97954 
BESQ51 84 AA722206 A1344943 AI348877 At33466Q BE621 857 BE156280 AA454099 AA037722 BF843B97 AW806183 AAQ4321 6 BG482886 
AA1 82734 AA877242 AW372926 H 27252 R381 14 BF851 858 BE166214 AA1 90427 79 1762 AAO350S7 AA837326T10930 BFS0S587 B (75 5027 
BG506731 BCMB442 BC010166 AL550134 AL553096 AL54B700AL550751 AL547978 AL545286 AL5 406 43 AH 11 8527 AL601379 B1259821 
BG74178G B1868522 AU135866 BI552770 BI259210 B1256520 BI255669 BG485098 B125822B BG496501 BM04451 2 AU 133984 AL55G586 
BE745111 B1222633AU133917BG288151 B1260715 BI550560 BG500773 B1551761 BG707601 B181 8593 Bf 6913B3 BG72 11 29 BG 541 578 
BE906666 BG7S10SB BI224135 BG400746 BG476QB5 BE79D43G AVY080238 All 1 37549 BG429B9B BE392485 AW961686 BG721055 BE906365 
BE546656 BG541235 AW5 83735 BG528290 BI260895 AW651591 BM048974 BM043B05 BG1 42185 AA315188 A1446515 CQ63O0 BG497E44 
AA888544 A1815987 BG52B631 BE619182 AW2391B5 AWO 62910 AWQ62S02 AA347236 F1 1933 AA4880D5 AA301631 AA376800 056120 
AA343532 AA308B36 P00242 AA3760B6 AA316968 AA343799 B1870221 BE910282 BG538746 AW960564 AV732879 D1 6854 AA192519 BFS22148 
AA216013 BG624091 BE544387 BG507Q0B AW17G446 BF79Q033 BE08B925 BE08B854 AA921 353 R21800 AA011222T97525 

442S06 29197J MM.Q02B40 L40377 BG227962 A1925897WG0100 BG9S0023 AAB53496 AA360401 AW956168 W61351 A1074B46 AA37379B BG193973 BG499416 

BFB16267A1675832BG611734 BG777123 AW449485 BF912016 BE566411 BM068728 AW976385 AL121194AJ767324 BM054718 AW366882 

436772 1239464_1 AA166151 Z251O9CO5177AW07568BAA731O63N67O84 

434596 14701J AF1473T4T59538 T59589 T59598T59542 

412564 18571 1 X83703 WM_014391 BC018667 AU139209 6E924924 BE924899 BE924977 BE924985 BE924947 BF229Q55 BE924S17 BE924938 BE812B21 

BE924937 AA969184 BE924898 BF229QB6 BF229054 BFW1177 BE924934 BF22904O BE925037 BE92501 1 BG 943776 AA48B072 AA486364 
236249 AW455806 AW5B2753 BF0S1131 BE924910 BF229058 BF0811 72 BE924956 AW378093 BF0B1 132 BF08116B BEB 12736 BE812738 
BF081165 004160 C04483 BF229048 

442006 1239046J AWB75183AA973583A1365103AI699495 W301787 



TABLE 69C 

Ptey. Unique number corrcsporKflng to an Eos probssot 

PM: Sequence eource. The 7 cSgrt numbera in |h& column are Gentian* WenWrer (Gl) 
twmsn chromosomB 23T Dunham, et si (1999) Nature , 402^89-495. 

Strand: Indicates DMA strand from which exons were p reacted. 

NLposHbn: Indicates rwcteotide posifons of predJcted exorts. 

Pkey 
402463 



numbers. 'Dunham, et at" refers to the puMcadon enfilted The DMA sequance of 



481234 
406506 
405025 
406121 
400666 





Strand 


NtposWon 


9796895 


Mnus 


8B1B-B952 


9714719 


Bus 


169845-170272 


9929642 


Plus 


120173-120337 


7711374 


Minus 


6843-8077 


7107727 


Plus 


105267-105343,106184-106294; 


6102330 


Minus 


35616-38004,3656746684 


811B4S6 


Plus 


17982-18115,20297-20456 



TABLE 70A 



Pkey: Urique Eos probesetrdentifier number 
ExAocn: Exemplar Accession number, Genbajik accession number 
UnfeeaelD: lAiiflenn number 
Untgene TtflaUnlgene gene tiGe 



R1: 



RsOo of Hie mean of the vessel AJ"s to the mean of the HUVEC AJ's 



Pkey 


ExAdcr 


UiugendD 


428928 


BE409538 


Hs.194657 


439180 


AI393742 


Hs,1S9067 


412836 


NMJ)0441S 




426158 


NhLW1982 


Ks.199067 


414320 


U13616 


HBJ5B93 


41767B 


U90916 


Hs.82B45 


414572 


AU077174 


Hs.288181 


415314 


N68802 


Hs.5422 


431103 


M57399 


Hs.44 


406973 


M34996 


Hs.198253 


456974 


M12529 


Hs.169401 


430560 


Z28942 


Hs.243960 


406828 


AA419202 


Hs,84298 


422046 


NMJJ12445 


HS.2BB12B 



Un^geneTtCe 

cadherin 1, type 1, C-cadharin (epHhfiB 
v-erb-b2 avian erythroblastic leutemfa v 
rJesmopIaMiitOPt.DPII) 
v-erb*b2 avian eiythmblesEc leukemb v 
ankymi 3, node of Ramfer (ankyrin G) 
Homo sapiens cDNA: FU21930 fis, done H 
cathepsta H 
gVooptotetn M6B 

ptejoirophfi (heparin binding growih fee 
major MstDcompaHURty complex, class 
apoRpoprotel n E 

N-myc downstream- regulated gene 2 
CD74 antigen (inv arrant polypeptide of m 
sportdln 2. extraceflular mrinx proletn 



R1 

099 

1.05 

1.25 

136 

1.50 

1.55 

1.64 

1.70 

1.75 

1.8B 

1J0 

1.91 

1.97 

1.99 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



4U0D£D 


AWoJtrtJUo 


B6.84298 


CD74 anBgen (Invariant polypeptide of m 


2.02 


453023 


AWU2a733 


Hs.31439 


serine protease Inhibitor, Kunftz type, 


ZQ9 


41AJ31 


AI22JJ20 


Ho. 7 5 335 


glydna Brn'cdinotransferase (L-arginkieig 


2.12 




AvfOIOttDl 


Hs.84298 


UJ/ 4 antgon (mvanEnt paypeptKte of nn 


2.15 


41 8299 


AA27S53Q 


H5.8396B 


ffrtognn, rets 2 (antigen LUlo (p95), yf 








Hs.144465 


ESTs 


2.21 




■Jit iVrnnc^ 


Hs.B272 


.i ■ nrtn nlnn rICr. |W in ...ft IH|,n L . J„ % 

prostaglandin U2 synuiaso (21 KU, oral nj 




44W40 


An 931 15 


Hs.16511 


i....... ,t. a. nn lit,—. 4 

kmor proteii D52mke 1 


tLlf 


4Z2t>JH 


AI328377 


Hs.l73724 


creatma ktnese, Irafn 


2.29 




Alabuyito 


na.i/IJlo/ 


u&nsducef of bKoof, l 


2.31 




HZflj/O 




riypotneuca protein FU2QD93 


2.37 


418283 


S79B95 


Hs.83942 


cathepsln K (pycnotfysostosis) 


Z38 


424247 


X140O8 


Ks.234734 


hjoozyme {renal amytoidosfe) 


239 


4221U3 


AA9B4330 


Hs.1 11576 


protein kinase H1 1; small stress protein 


2.41 


415994 


Lift* 

NM-002923 


Hs.76944 


regulator of G-protetn signalling 2, 24k 


243 


427393 


AB 02901 8 


Hs. 177635 


KfAA1095 protein 


247 


AA AlAI 

414217 


Ald(J92SS 


i i. nnmo 

Hs.27SB93 


Homo sapiens cDNA: FU23166 fis, clone L 


2.49 


414w<* 




Hs.75736 


BpoSpopfoteta D 


2.50 


426716 


NP4JD0o379 


Hs.171921 


sema (fomafri. immiffiogtobulin domaii (tg), 


2.51 


414603 


X031DD 


H3.914 


Human mRMA for SB dossil hlstocompatKii 


2S1 


410532 


153055 


HS.lbboTo 


hemoglobiii, beta 


2.56 


431369 


DTifl MA writ 


Hs.251754 


secretory leukocyte protease mhitntor ( 


2,58 


437952 


D63209 


H8.5944 


salute carrier family 1 1 (prolon-coupled 


2,58 


453655 


AW960427 


H £.342274 


transtorrnlng growth factor, beta recepto 


2.59 


453655 


AWWU42/ 


Lin <9JOa"7jf 

re.342a74 


IrenBforTFiVig growth factor, beta recepto 


259 


453665 


AW9oU427 


Hs.342874 


transforming gio\A/4h factor, beta recepto 


2.59 


427SB0 


AA418305 


Hs.303205 


EST 


2.59 


442695 


AC002425 


H 5x503 


pB protan (candidate of metaslasfs 1) 


2.62 


425751 


T19239 


Hs.1940 


crystatJn, alpha B 


2.S3 


414194 


BE175494 


Rs.75811 


N-acylsphingosine amWohydrolasB (add c 


2.63 


443907 


AU076484 


Hs.9963 


TYRO ptoWn tyrosine teiase binding pro 


Z6B 


453464 


Ai8B49f1 


Hs.32989 


receptor (calcitonin) activity modifying 


2jB7 


410577 


NM_pD3278 


Hs.65424 


tetranectin (plasmlnog en-biding protan 


2.67 


452685 


llfln JAM 

A1634651 


HSv30250 


v-maf musoubaponeurotk: ttbrosarcoma (a 


2.67 


425669 


AA524547 


Hs,16031S 


FXYO domain-con talnlrrg Ion transport reg 


273 


406643 


M77976 


H3.347939 


hemoglobin, alpha 2 


2.73 


453767 


AB011792 


Hs.35094 


Brtraceluter matrix protein 2, female o 


2,74 


424614 


X54486 


Hs.1 51 242 


serine {or cysteine) proteinase InhJbito 


2-flO 


444195 


AB0Q2351 


Hs, 10587 


K1AA0353 protein 


ZB1 


41 /Zoo 


WZ5UU5 


Hs.24395 


small inducible cytokine subfamily B (Cy 


2.86 


442496 


U54617 


Hs.6364 


Homo saptens pyruvate dehydrogenase Una 


2.92 


412810 


M21574 


Hs.74615 


pfeteiefderived groiwtS) factor receptor, 


Z93 


440320 


ALD31846 




platophflln4 


Z94 


429123 


A6011099 


Hs.1 96847 


K1AAQS27 protein 


2.97 


411529 


AA430346 


hfe317595 


Homo sapiens cDMA HJ12927 tls» done NT 


3,f)1 


413305 


kit # fttuiinr 

NM^_000426 


Hs.323511 


Homo sapiens cDNA: FU23176 fls, clone L 


3.01 


425522 


AW330847 


Hs.18578 


ESTs 


3.01 


420195 


N44348 


H&2S1385 


Homo sapiens cDNA FU1 1177 fis, clone PL 
downregidafed in ovarian cancer 1 


3l04 


446584 


U53445 


Ha 1543 2 


3.04 


407B69 


A (327 976 


Hs.24391 


hypothetical protein FIJI 361 2 


3.06 


423915 


AP039018 


Kg.135281 


aipba-acfinb-2-aB6odatEd L1M protein 


3.07 


410036 


R57171 


Hs.57976 


cabeqoestin 2 (cardiac muscfe) 
ESTs 


3.10 


412047 


AA9345B9 


Hs.49696 


3.12 


414840 


R27319 


H&23823 


hafryVBnfcancerofcpIrt related with YRP 


3.14 


424651 


AH932UQ 




ESTs 


3.17 


443B32 


AVYot»222 


ns.yy To 


tensta 


3.18 


420659 


AW468397 


Hs.1 00000 


5100 caciurn-bmding pratem A3 (cagran 


120 


434025 


AF114264 


H 3,21 6381 


Homo sapiens done HH409 wiXnown mRNA 


3.20 


421834 


BE543205 


H&288771 


DKF2PSB6AD522 protein 


&20 


423951 


D13666 


Ks.136348 


parkss'n(C&P-Zos) 


3.24 


4473B4 


Al 377221 


Hs.4052fl 


ESTs 


3^4 


425383 


D83407 


Hs. 156007 


Down syndrome critical region gene 1-Hk 


3.25 


447023 


AA356764 


Hs.17109 


integral membrane protein 2A 


3,25 


420136 


AW8010SQ 


Hs.1 95851 


actvn, alpha 2, smooth muscle, aorta 


3.25 


406972 


M32053 




gtKHuman Ht9 RNA gene, complete cos. 


328 


410132 


Lit J fVHJOn 

NMJ0O3460 


H3.300946 


Mterofibriha53oclaI.ed glycoproteln-2 


3.27 


430998 


AF128847 


Hs. 204038 


Irtdolelhylamine N-metfyilransferase 


3.2S 


456896 


NB4.Q01928 


Hs.1 55597 


0 component of complBment(adlpsln) 


3.29 


443623 


AA345519 


Ks.9641 


complefner.tcornponerrt 1, q siiticoinponeni 


a33 


410614 


At09tl95 


Hs.65029 


growth errest-spcd^c 1 




451529 


AW170[VI 

nis I # 3U I 




ESTs 




430310 


U60115 


HsJ23SQ69 


four and a half UM dornans 1 


3.37 


424897 


D63216 


Hs.153684 


frtezteo^reletBd protein 


3.37 


427111 


AA351D26 


Ks.173594 


serine (or cysteine) proteinase tahlbtto 


3.3B 


443604 


C03577 


Hs.9615 


myosin regulatory light chain 2, smooth 


3.41 


421913 


A1934365 


Hs.109439 


ostsogjycln (osteoinducfive factor, mime 


3.43 


451331 


AK0O2C39 


HSJ51395 


Homo sapiens cDNA FU11 177 fis, clone PL 


3.44 


416908 


AA333990 


Hs^0424 


coagulation factor XiD. A1 pcrtypepWe 


3.4S 


407938 


AA905097 


Hs.65050 


phospbolamban 


3.48 


418005 


AI1B6220 


his.83164 


odlagen, type XV, alpha 1 


3.fi1 


452877 


AC50789 


Hs.32478 


ESTs 


a52 


419577 


L36531 


HS51296 


integrin, alpha 8 




404277 






N^019111^HorT»B3|)ien6rnaJorrfetocompa 


as4 
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407815 


AW3738BQ 


Hs.1 83860 


liypo'liBOcs' protein FLJ 20277 


421653 


AL1 17472 


Hs.10B924 


SH3-ckxnain protein 5 (pepsin} 


447111 


AI017574 


Hs.1 7409 


cysteme-rfch protein 1 (mtes&iai) 


426935 




Hs.172928 


collagen, type I, alpha 1 


418371 


M13560 


K3.64296 


CD74 anJSgen (Invariant polypeptide of m 


440274 


R24595 


Hs.7122 


scrapie responsive protein 1 


427818 


AW51 1222 


Hs.193765 


ESTs 


433465 


AV857776 


Hs-3314 


selenoprotein P. plasma, 1 


45B568 


At769067 


Ks. 127824 


ESTs, Weakly similar to T2B770 hypotheti 


459290 


NNUM1546 


Hs.34853 


inhfc tor of DMA binding 4, dominant neg 


4242DS 


NM_003734 


Hs.198241 


amine oxidase, copper contain fug 3 {vase 


453874 


AW591763 


H 3.36131 


collagen, iypa XIV, alpha 1 (widulin} 


42B957 


NhA_0038B1 


Hs.194679 


WNT1 inducible signaling pathway protein 


434B6B 


R50032 


Hs.159263 


collagen, type VI. alpha 2 


417043 


NM_004369 


HsJ&0S88 


collagen, type VI, alpha 3 


417512 


X76534 


H&B2226 


glycoprotein [transmembrane) nmb 


452669 


AA216363 


Hs.262958 


hypothetical protein DKKZp4 348044 


421314 


BE440002 


Hs.180324 


Komo sapiens, clone IMAG&4153312, mRNA, 


449925 


Al 342493 


Hs_24192 


Homo sapiens cDNA FU 20767 fis, done CO 


453500 


A1478427 


Ks.43125 


esophageal cancer related gene 4 protein 


430572 


U33114 


rte.245168 


tissue Inhibitor of metal to proteinase 3 


421823 


N40850 


Hs-28625 


ESTs 


422550 


BE297626 


Hs^96049 


rnlcrofibriflar-assodated protein 4 


443060 


D7&874 


HS.B944 


procollagen C^doperjCrJase enhancer 2 


450912 


AW939251 


Hs.25647 


v-ft» FBJ murine osteosarcoma viral onco 


432306 


Y182D7 


Hs.303090 


protein phosphatase 1,regutetory (lnrtfb 


419938 


AU076772 


Hs.1279 


complemen t component 1, rsybcamporient 


415165 


AW8B76M 


Hs78G65 


conipfemei]tconiporiQrit7 


414176 


BE140638 


Hs.75794 


endothelial differentiation, lysophospha 


452114 


M226B7 


H&.B23& 


ESTs 


406650 


AW24300 


Hs.172928 


collagen, typel,atpha1 


428411 


AW291464 


Hs.10338 


ESTs 


412802 


U41518 


H&J4602 


aquaportn 1 (channel-forming Integral pr 


406849 


AA454809 


Hs.172928 


collagen, &pe 1, alpha 1 


453064 


R40334 


Hs.89463 


potassium large conductance edefurn-actj 


429500 


X78565 


H&289114 


hexebracWon {teciasein C, cytotacbn) 


433336 


AFO17906 


Hs,313B6 


secreted fttzzted-rdaled protein 2 


421814 


L12350 


Hs.108823 


IhrombospomSn 2 


AZfifd 


AB0D7972 


HS.1 SwW 


myosin pnospnaiase, imjjhi suuunii 4 


41429D 


AJ568801 


Ha.71721 


ESTs 


415274 


AF001548 


Hs.78344 


myosin, heavy polypeptide 11, smooth mus 


417130 


AW276858 


Hs.81256 


S100 calclura-WrKSng protein A4 fcatetum 


4167B4 


AA334592 


Hs.79914 




453355 


AW295374 


Hs^H12 


Homo sapiens cDMA FU1 1 422 fe, clone HE 


408221 


AA912183 


Hs47447 


ESTs 


430223 


m.002514 


Hs^35935 


nephroblastoma overexpressed gene 


41658S 


X54162 


K&79386 


laornaSn 1 (smooth muscle) 


422737 


M26939 


Hs.119571 


cofegen.type III. alptial (Ehters-Oanl 


426406 


AI742501 


HS.1B9756 


corrpk^enlcornponsntl, s subcomponent 


448429 


D17406 


H&21223 


eatporOn 1, teste, smooth muscle 


413624 


BE177019 


Hs.76445 


SPARC4fte1(rnast9,rrevin) 



3.58 
3.64 
3.68 
3.70 
3,71 
3.73 
3.73 
3.75 
3.77 
3.80 
3.83 

ass 

3.89 
3.90 
3.95 
3.95 
3.97 
4.01 
4.03 
4.07 
4,09 
417 
4.25 
4.26 
4.29 
434 
4.37 
4.37 
4,41 
4.43 
4.43 
4/45 
4.51 
4.54 
4.67 

4.ea 

4.73 
4.76 
4,98 
5.13 
S.14 
5.20 
5.50 
5.65 
5.87 
6.47 
6.65 
7.04 
9,90 
9.91 



TABLE 70B 

Pkey: Unions Eos prohesat identifier number 
CAT number. Gene cluster number 
Accession: Gen bank accession numbers 



Pkey CAT Number Accession 

412636 143SL1 M7783Q NML004415 AF139065 BG681115 BG740377 BI7125&4 BG000655 AA12B470 BM3B324 H274Q8 BS3163D BE167165 AW370827 

AW370S1 3 J05211 BG668B65 BG740734 BG680618 BG739778 8(765807 BM353403 BM353248 AW177784 AW2057B9 AW951576 AW8485S2 
BE1&2164 BF149266 BE940187 B K3 60445 BKBC444 BF350 983 BE720095 BE720069 B E7 151 54 BE082564 6^)82576 BED 04047 AA8573 16 
B1039774 BE713818 BE713548 AW170253 BE160433 BKJ39776 AWB86475 BM462504 BE931734 BF149264 AA340777 BF361 183 BG621737 
AU127260 AW3S4859 BF993352 BG223489 BEB19009 Bf 381184 BE7 15956 R58704 AA852212 AW36S5S5 81090356 BF087707 BE819046 
BE81 9005 AA377127 BE073467 BE819059 BEB1S048 BI036308 BG990973 BTtM09o4 BF91891 1 AU1401 55 A1951766 AI43451 8 AW804874 
BF752SB9 BE637009 BE92582B BF149265 AW995615 BEB1 4264 BU39762 AUM04O7 BE144243 8E709863 BF9&5642 BK»1923 BF933510 
AW26S32BBG436319BE1B2166AW36S175 AWB4768&BG8t82MAW1 77933 BFB73679 AW178000 BE082526 BF476B68 BF085994 BF592276 
BE082507 BE082514 BE0B2505 BF873693 AW06B840 AW847678 BFB04153 AW3G5157 BE813B30BE002030AW365153BE1 84941 BF74S421 
BE1 84920 BF839552 BE1 84933 BF842254 BE69B470BE931048 BF999889 BF368B16 BE1 84924 BE159646 BE714632 BE1B4948 BG986845 
AA131128 AA099891 W39488 C0471 5 BF096124 BE8B5341 AW7S9304 ALB03116 BE1 49760 BE70S967 BE705966 BE705968 AWB4B723 
AW376699 AW376817 AW376607 B GO 050 97 BF761116BE698084 AW84S371 AW376782AW848789 AW849074 AW381413 BF927725 BF094211 
AW997139 BE865474 BE1 851 87 BE1 56621 BE7150B9 BE7132&7 BE71329B BE179915 AW7B9309 BFH72345 BP088676 BE705939 AW752599 
BG005197 BF3500B6 BE71 5196 BE715155 BF752396 BFD93617 BFB31190 BF752409 BE006551 BG959922 BP094833 BF094748BF094583 
AW377699 AW60723B BED 6 251 9 AW37 7700 BF349467 AI190590 A1554403 AI39292H AU158477 BM67252 AU153919 AI760S16 BFQ82516 
AM39101 AA451923 A1340328 AI590975 BI791 553 AJ7C0963 AI1 42682 AA039975 AA946936 AA644381 BM314884 AA702424 A1417612 
AW190555 AI220573 AI304772 AI270345 A1627383 AA5S2300 AJ9117Q2 AW166807 A1346078 W95070 AA149191 AA026884 AI830049 /VW780435 
AI078449 AI8199B4 AIB58282 BI4685B8 A1860584 A10Z5932 AA026047 AA703232 AA658154 AA51 5500 AW1920B5 AA91 8281 T77661 A1927207 
AC05263 BFG82491 AW021347 A(58fi096 BE939662 AA08B866 D12062 AA056527 AA782109 W 19267 WO 21 56 AW1 50038 AA022701 T87181 
H44405 A191D434 BF0B2513 AI494069 AI270027 AW35878 AA128330 BG681425 BE705078 R20904 BG580059 BG676647 BF764409 AA026B54 
AV745530 BI762796 BG287391 AW79B780 BE706045 BE926470 AW799118 BF067996 BE002273 AW879451 A1 571 075 BE0677BS AV721320 
AI022862 N2W54 C03378 N84767 AA131 077 H30146 BE714290 AI6B6B69 AI568B92 AI91559B AW105514 A1BB7258 AJ538577 BE926474 
BE067737 BG3194B6 AA247685 AW79B883 AW183521 BF98M73 AW8B0B76 BE939707 8E1 85750 BE714064 BE713903 BE713868 BE713763 
BG950164 BE713810 AW3651S1 BG955489 BE005272 BF91 5937 AW365148 A1905927 BF992780 AW853812 86954443 B1770853BG679408 

781 
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BG740832BG681087 B(^B430AA455lOOT87267BE696209BE6962lOB!Qa94B3BE0062738EB^ 

BG741970 AA026480 BE705999 BG877157 BE00909G BG6B1378 BE712291 BG96H98 BG678934 BI04Q941 AA337270 AW364371 AYV847442 
B105B659 BE813GB5 W95D48 W25458 AW17778B AA025651 BE 93 1733 BF154S37 BG949393 BE714441 AW9 95245 BE711601 AI2MOS0 
BE064323 BE719390 BE940148 BG991212BF375714 BF349522 BG996267 T48793 BI013292 BE001925 AW365156 AW365154 AW606653 
BF7631Q9 BE931637 BE167181 BE713879BF3540O8 BF878726 H90899 AW365145 W38382 AI4B8487 

464042 30254.1 AJ42Q458 AI018523 AA7086B8 BF949633 AL1 19553 BF945960 AI081305 AAQ41432 AI921013A1684910 A1654847 AW874199 AI2D6120 

AW241 428 R43Q3S T66767 AW103715 W2847B BF963052 H45926 BFB0756B AW903943 BE170143 B1040435 BF931989 BI6OOO00 AV722350 
W27787 H45331 B1549781 R53955 BI549855 BG991 583 BI491 075 AW020049AW1 29293 H45263 AA41O309AA340613 R42410 AA707199 
A1431587 BE858679 AW2922S7 AI421678 AA041 135 BE466753 AJ243913 AI358894 AW1 37298 AI36848B N 64350 AA779107 AW0259S9 R4905S 
AA347011 RS5722 AW771106 F04969Z383B1 F01659H17396 BI493714 AiS80lu3AW77l447Al2025S1 AA788851 AI494438 BF856114 H22570 

440820 3091 1 AW954381 BG007409 BM314056 AA465642T30661 T331I1 Z42B34C04542BF948152BF944325BF85BB95AA935284AI267360M64249 

W67500 F079B2 AA322394 BI489987 BE644965 BM313782 AA91G364 AI809246 AA83B7S0 BF115228 AAB29730 N27413 BM141766 AI742325 
AA455261 AA9387QB AT420241 A1130O39 BF222341 BF941301 AA771807 Z41Q34 BF447457 BF447426 AW515347 11)3874 F02360 FO2302 
N34898 AI678SB6 AA807824 AA348556 AW2Q4638 BM142045 BF446776 AV64B364 AI80136B AA971739 AI017351 AA76Q722 AM60007 AI4563B3 
AI694152 AA226536 BE467282 M47B0B BE34B825 BE8305B1 BE830583 N53O09 N59351 N41056 W67501 AAQ15245 H43293 F21282 R59121 
R56280 

424661 46Q29_5 BG435302 BMD83687 AA904U35 AA488889 W76175 AA761874 H26767 AA910081 AA8370BSAI521825 BG96S378 AI478562 AA743152AA746092 
HBB8B3 BG9B6375 AA635644 AM93205M669979 BE245127 BQ986430 BG9B&529 BF685873 BG0301 57 BG622575 M766495 

TABLE 70C 

Pkey: Unique number corresponding to en Eos probeset 

Ref: Sequence source.Tbe 7 dtgtt numbers tn tftfe column are Genbank kieo Bfler {Gl) numbers.Tlunham, et al." refers to the publication entitled The DNA sequence of 

human chromosome 22' Dunham, el a! (199a Nature 402:489495. 
Slrand: Indicates ON A strand from which exonswefe predicted 
Nt _p ration: Indicates nucleotide posttkms of predicted axons. 



Pkey 
404277 



Ref 

1834458 



Strand 
Minus 



Imposition 
91665-91946 



TABLE 71 A: 774 GEMES UP-REGULATED IN EW1NG SARCOMA COMPARED TO NORMAL ADULT TISSUES 

Table 1 A lists 774 genes up-regulated in Bring Sarcoma compared to normal adult tissues. These were selected from 35371 prabesets on the Affymatrix/Eos Hu 03 GeneCh'p array 
such ft at the ratio of "average 1 ' Ewing Sarcoma to "average' normal adult tissues was greater than or equal Jo 2-1 . The "average" Ewing Sarcoma level was set to the 75th percenffie 
amongsl numerous Ewing sarcomas. The "average' norms] adult tissue level was set to the 85th percentle amongst various nan-ma%naoI tissues, tn order to remove gene-spedfic 
background levels cT non-specific hybridization, the 1 0th percentile value amongst the oorMnaJtanant tissues was subtracted bom both the numerator and the denominator before the 
ratio was evaluated. 



Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelCfc Unlgene number 
UnlgeneTiflecUnlgBnegenethte 



R1: 


75tn percentile of ewing sarcoma to 85th percentile of body map 




Pkey 


ExAccn 


UnlgenelD 


Unlgene Trite 


R1 


104659 


AW960769 


Hs.105201 


ESTs 


703 


101447 


M213Q5 




giv:Human alpha sataBte and setellte 3 


64.7 


105782 


H09748 


Hs.57987 


S-cell CLUlymphoma 1 1 B (zinc finger pro 


50.5 


113861 


NMJH5756 


Ha.184942 


G protdrvcoupted receptor 64 


48.8 


121792 


AW96972B 


Hs.96381 


ESTs, Weakly sfmBar to serine protease 


46.7 


101104 


AW862258 


Hs.169268 


neuropeptide Y receptor Y1 


48.6 


110278 


AF081573 


Hs,19492 


piotocadherinO 


46.1 


126645 


AA316181 


Hs.61635 


six transmembrane epHheOat enflgen of 


41.4 


116752 


AL008583 


Ks.91622 


neuronal peiuYaxin receptor 


405 


119717 


AA91B317 


Hs.57987 


B-cdl CLUlymphomB 110 (zinc finger pro 


39.6 


104691 


U29690 


HS37744 


Homo sapiens beta-1 adrenergic receptor 


36.9 


110728 


AA7371D6 


1)5.32250 


ESTs, hkideratety slmlar to 178 BBS serhi 


35.5 


121362 


AF050147 


HS.97932 


chondromcdulin i precursor 


347 


131291 


NM.004350 


Hs.170019 


mnt-relatBd trenscrrpfion factor 3 


33j0 


101063 


D54745 


HsJ0247 


chotecystoWrJrk 


31.7 


121619 


AA528339 


Hs,178062 


ESTs, WesM/slirtJarto phosphatldyteeri 


28.7 


122651 


AW975398 


Ks.293836 


ESTs 


28.0 


100299 


D49493 


Hs.2171 


growth dfrerenPaBon factor 10 


26^ 


129977 


NM_000399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


2&2 


123619 


AA802964 




a b3io97c0ls1 NCt_CGAP^Pr2 Homo sapiens 


26.1 


124006 


AI1471K 


Hs.279727 


ESTs; homotogue of PEM-3 [Ciona savignyt 


23.1 


116301 


AW9G97C6 


Ha 33332 


ESTs 


226 


121231 


AAB1494B 


Hs^6343 


ESTs, WeaWy sffnilar to ALUCJHUMAN 1U! 


223 


106533 


AL134708 


Hs.145998 




22^ 


109166 


AA219B91 


Hs.73625 


RABfi litefacflhfl. Wne5ln4lce (ratkinas 


21.3 


131313 


R96290 


Hs.75874 


dbosornal protean L44 


2&8 


116790 


AW161357 




rnJcrctobutflhassodatsd protein tan 


m? 


105316 


AT671245 


H&24835 


hypothetical protein FU 14534 


1B.2 


102123 


NhL0018D9 


H3.1E94 


centromere protein A (17kD) 


17.B 


126218 


AL049B01 


Hs.13649 


Novel human gene mapping to chomoscme 13 


17.8 


119791 


AA554907 


Hs^8291 


ESTs 


16J 


113003 


AW292315 


Hs.7215 


ESTs 


16,3 


102636 


U9432D 


Hs.158330 


neuropeptide Y receptor Y5 


16.3 


126799 


AW753B65 


Hs.74376 


otfactomedln rotated ER localized protei 


16.3 


10S29B 


BE3a77S0 


Hs.28369 


hypotheflcal protein FU20287 


15,5 


107160 


AA314490 


Hs27669 


WAA1563 protein 


148 


115313 


AAB08001 


Ks.184411 


abmmn 


14.6 


123308 


C141B7 


Hs.103538 


ESTs 


145 
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126077 M7B772 Hs310836 ESTs 14.0 

100693 AAS84407 Hs. 21 1695 proteh tyrosine phosphatase, non-rtjcept 13.9 

108821 NM_00S594 Hs.28298 adaptor-related protein complex 4 f beta 13.9 

110288 H40665 Hs.31564 ESTs 13.8 

5 106498 AI221919 hypothetic^ protein FU 10582 13-8 

112134 R41823 Hs.7413 ESTs;cateynlenlr>2 13.7 

115265 AA334274 Ks.18368 DKFZP564B0769 protein 13.4 

12663& AW582962 Hs.102697 CGI-47 protein 13.2 

12SB9B AF078B47 Hs.191356 general transcription factor II H. potypa 12.8 

10 105200 AA328102 Hs.24641 cytosketeton associated protafa 2 123 

119456 AA248897 Hs.48784 ESTs 123 

135155 AI20795B Hs.165556 Homo sapiens. Similar to TEA domain lami 12.1 

108614 AA64B459 Hs 33 5951 hypofoefca! protein AF301222 12.0 

. _ 131941 BE252983 Hs350fl6 i^unin specific protease 1 12-0 

15 132968 AF234532 Hs.61638 myosin X 11-7 

132725 NM_006276 Hs.184167 EjHWng factor, a^nuie/SGrine-rtch 7 11.7 

111353 W20090 Hs.6616 ESTs 114 

123289 AA495904 Hs.103316 ESTs 113 

^ _ 104968 A1249502 HsJ9669 ESTs 11-1 

20 123532 AA608733 gb:ae56f06.Bl Siratagene lung carcinoma 110 

104173 AA084273 Hs.76561 ESTs, WeaMy similar to S47072 finger pr 10.7 

125556 AB033064 HsJ236463 WAA1238 protein 10.7 

127240 AJ005683 Hs.86998 nuclear factor of activated T*ceHsM 10.6 

„ 123049 QEO478B0 Ks.211869 dickkopf {Xenopus laevfe} homclDg 2 10.5 

25 132315 AF091086 H&44563 hypotheBcal prate "m 10.5 

134321 BE538082 Hs.8172 ESTs,MDteatelyslmBartoA4601QX-rm 10.5 

128609 NMJ)SJ3616 H 5. 102455 survival of motor neuron p rota ri In ferae 10.4 

131045 NMJ302916 Hs.35120 replication factor C (activator 1} 4 (37 10.4 

110730 M67655 Hs.26411 ESTs 10.3 

30 119186 A1970147 Hs.101265 hyfnthefcaJ protein FU22593 10.3 

115909 AW872527 Hs^9761 ESTs, Weakly similar bo DAP1_HUMAN DEATH 10.3 

121916 AW117207 Hs.98523 ESTs 10,2 

131402 Y06890 Bs.1 13503 teryopfierln (Importn) beta 3 10.1 

_ „ 101B79 AA176374 Ha243886 nuclear autoentjgerftcspctm protein (his 10.0 

35 133181 X91E62 Hs.66744 twl&tprosoprulajhomclog (acrocephatos 10.0 

131307 NM_00D025 Hs.2549 ^ren eig to, bela^, receptor 10.0 

113849 AA457211 Hs.8B5B tromodomaJn adjacent to zinc finger doma &9 

103446 X9Q834 Hs.79971 sal (Drosopftila)-like 2 9.7 

_ 105091 AA148859 Hs.1 79909 hypottetica) protein FU22B95 9.6 

40 106146 W072B6 Hs. 118392 ESTs, WeaJdy similar to LNHUER IgE Fc re 9.5 

127003 AWB16515 Ha.173540 ATPaKj, Class V. fops 10D 9.5 

115414 AA662240 Hs^83099 API 5q14 protein 9.4 

117B29 BE160952 Hs^47117 ESTs, Moderately similar to AUIFJttJMAN I 9.4 

12JB098 W79088 $b:EST01236 Subtracted Hippocaraptis, Stra 9-3 

45 121910 ACO4600 Hs.96978 hypoiheOcal protein Ktt3C10764 9.2 

130625 AF176012 H&260720 J domain containing protein 1 9.2 

129755 R42216 Hb.7759 Homo sapiens done 24538 mRNAsequerice 9.1 

100335 AW247629 Hs.6793 ptaMe^actlvatlng factor ecetyitiydrola 9.0 

111109 AJ940675 Hs.20914 hyrxrlheOcrf protein RJ2305B 9.0 

50 120217 AAB62257 Hs£6035 ESTs 9.0 

112891 T03927 Hs,293H7 ESTs, Moderately slmlar to A46010X^in 6.9 

131601 NM_007115 Hs.29352 tumor necroafe factor, aVhatoduced pro 8,6 

126262 AI085153 H 5.1 43764 ESTs, WaaHty starter to unknown Jrisapte 8.8 

113903 AJ368034 Hs36679 ESTs, Weakly slmlar to A46Q10 X4kiked 8.7 

55 105253 AW9974B4 Hs.5003 K1AA04 56 protein 8.6 

102581 AU07722B Hs.7725S enhar^ofzeste{DrosopNls)homok)g2 8.6 

105172 AA1&75S4 H&300496 fritochondrtcd solute earner 8.6 

100380 082343 Ks.74376 neuroblastoma (nerve tissue) protein 8.6 

129229 AF01375& rte.109543 pdyaderrylats blnoTng protahn'rvteracfiti 8.5 

60 10Z725 AB026187 Hs.159156 protocadberm 1 1 8.5 

130296 A1347487 Hs.1 32761 class I cytoKne receptor QA 

132294 AB023191 Hs^4131 KIAA0974 protein B.2 

118644 AA443241 rfbosomal protein W4 &2 

106575 AW970602 Hs.1 05421 ESTs &2 

65 133142 AW952412 Hs£5»74 ESTs, WeaWy similar to A40348 Etev/Sex- 8.1 

119499 A19189Q6 Hs.55080 EST& 8.1 

100248 NMJJ1515Q Hs.78398 WAAD071 protein &.1 

114837 BE244930 Hs,166B9S ESTs 8.0 

„^ 107098 A1B23593 Ha^TeaB ESTs aO 

70 123960 AW0&2B62 Hs.2B7733 hypothetical protein FU23189 7.9 

105280 AA894638 Hs.1460O ESTs 7.9 . 

115147 AA745781 H&.38399 hypotheflcal protein MGC2454 7.9 

134570 U66515 Hs.172280 8Wi/Br^ related, mafrin associated, acS 7.9 

133063 A1654133 Hs^0212 thyroid receptor interacting protein 15 7.8 

75 105730 AW377314 Hs.5364 DKFZP5M052 protein 7^ 

126426 AA1259B4 gkzn27h06.r1 Stratagene neuroeplthelKim 7.8 

107254 Z45996 Hs-22543 Homo sapiens mRNA; cDNA DKFZp761H912 (f 7.8 

134367 AA339449 Hs.62285 prxjspnoribcsyfglydnsmkla fermyHjansfer 7.8 

nrx 11SC82 AF262297 Hs.91546 cytochrame P45D rafinold metaboWng pr 7.7 

80 125400 AL110151 Hs.128797 OKFZP5B6S30824 proteto 7.7 

129413 AW37761D Hs.1 1123 DKF2PS646092 protein 7.7 

116766 A1608657 Hs.95097 ESTs 7.7 

129075 BE250162 Hs.83765 dlhyftofatete reductase 7.6 
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105143 AI36BB36 Hs.24808 ESTs. Weakly sircar to 13B022 hypotheti 7.6 

132438 AW363687 Hs.82916 chapBraiUncoiilanin9TCP1»subtjntt6A( 7.6 

118036 AI471862 Hs.1960GB Homo sapiens CDNARJ1 1723 lis. done HE 7.6 

131170 NMJH4253 od2{odd02rten^DraaophRa)homo(i3g1 7.5 

5 104548 R3939B Hs.91559 ESTs 7.5 

131495 AA812434 SMC2 (stoit^urgt mantenanca of chromoso 7.5 

132543 BE588452 Hsi44037 protein regiiatorcFf cytokinesis 1 7.4 

124040 U23752 Hs.32964 SRY (sex determining region Y)-box 11 7.4 

_ ^ 127695 AA714731 Hs.291457 ESTs, Weakly stniJar b heterogeneous ri 73 

10 132559 AF119B48 Hs.270B63 hyiKilheM protein PRCK5B0 7.3 

101050 ALI077324 Hs.1832 neuropeptide Y 7.2 

111023 R3S252 Ha.13055B ESTs, WeaWy similar to 21Q9260A B cell 7.2 

116401 AWH 93940 Hs.59698 ESTs 7.1 

105127 AA045648 Hs.301957 nitdix (nudeoside diphosphate linked mo5 7.1 

15 128478 AA70B205 Hs.100343 ESTs 7.1 

110456 H5234B HsJ5636 ESTs 7.1 

118846 AW29359B Hs^DB95 homeoboxC4 7.1 

120934 AA22619B gtKnc26aQ7,s1 NCLCGAPJVl Koran sapiens 7.1 

„ 126132 AA225632 gb:nc08a07.r1 NCLCGAP_Pr1 Homo sapiens 7.0 

20 127431 AW7713S8 Hs.175437 ESTs. Moderately similar to PC4259 tarri 7.0 

106157 W37943 HsJ4892 KIAA1323 protein 6.9 

117852 AW877787 Hs.136102 KIAA0B53 protein 6.9 

106213 N45018 Hs.B789 hypoUiaticai protesi OKFZp7B1 J17121 6.9 

118D13 AI674126 Hs.94031 ESTs 6.9 

25 120147 AK17116 hemoglobin, beta 6.8 

116267 34905 Hs.44653 Homo sapiens cDMA: FU22669 lis, clone H 6.8 

132581 AK000631 Hs.52256 hypothetical proteh RJ2G624 6.8 

120325 AA195B51 AP-2 bete transcription factor 6.6 

133276 AW978439 Hs^9S04 ESTs 6.8 

30 127742 AW293496 Hs.tB013B ESTs 6.7 

127664 AA806164 Hs.116502 ESTs 6.7 

108776 AF133123 Hs.90647 rjerwratiranscripfcn factor HIG,po]«> 6.6 

121553 AA4124B8 Hs.46820 TATA box bititfoig protetn (TBP)-assodaie 6.6 

107059 BE614410 Hs23044 RAD51 {S. cerevfeiae) homolog {E catt Re 6.6 

35 133656 AA319146 Hs.75426 secreiogranin I! (chfomogranln C) 6.6 

115197 R18656 ESTs 6.5 

112263 W39609 Hs.22003 solute carrier family 6 (neurotransmrtte 6-5 

103346 XS7613 H&5464 thyroid hormone receptor coariJvatiria. pr GA 

. rt 126967 AA2D5976 gb3xj46al0.r1 Stralagene htfT neicon {937 6.4 

40 118499 N67274 Hs.50141 EST 6.4 

105301 AW352357 Ha.7457 MAGE1 protein 6.3 

113617 AIBB9372 Hs.17207 KtomosapiBnscIM4AFU11922fls.cloneHE a3 

127968 AA83O201 Hs.124347 ESTs 6.3 

134719 AA3529B5 Hs.B9232 cfiromobox homolog 5 (Drosophita HP1 alph B.3 

45 110748 AW531S92 Hs.301434 KIAA1 387 protein 6.3 

115844 AI373062 Hs33293B hypothetical protein MGC5370 6.3 

121885 AA934683 Hs38467 ESTs, Highly similar to AF257737 1 cIEa 6.2 

102398 U42359 gb:Human N33 protein form 1 (N33) gene, &2 

130B62 AA497044 Hs.20887 hypothetical protein FU10392 6.2 

50 1322S6 Al 078645 Hs.431 rnorcne leukemia viral (bmM) oncogene h 6,2 

106383 AA447453 Hs27BBD Homo sapiens mRNA; cDMA DKFZp5S6M0723 {f 6.2 

13518S U73328 Hs.172648 dtetaWess aomeobox 4 6.2 

105510 Z42047 Hs.263978 Homo sqfens PR02751 mRNA. complete cds 6J2 

109261 BE466639 rfe.61779 Homo sapiens cDNA FU13591 te, clone PL 6.1 

55 114357 R41577 Hs.6107 Homo sapiens cDMA FU14839 fa, clone Otf 6.1 

128501 AL133572 Hs.199009 protein containing CXXC domain 2 8.1 

109747 A1223M1 Hs.22969 ESTs, WeaWy shrflar to Z141_HUMAN ZINC 6.1 

126361 AW172570 Ks.14600 ESTs 6.1 

115773 AW445044 Hs^6207 Human DNA sequence from clone RP4-530I15 6.0 

60 104689 AA420450 Hs392911 Rakophllin 60 

113575 AW1381BB Hs.15671 ESTs. WeaWy skriOar to KBF3JHUMAM NUCLE 6.0 

121830 AW39405S Hs,9B427 ESTs, Weakly slmter to 138022 hypotheti &9 

11929B NMJ01241 Hs.155478 cycllivT2 5.9 

109841 H01052 gb^32h01^1 Boares placenta Nh2r# Homo 5.9 
65 115622 AW88691 Hs^OJWM Homo sapiens mRNA; cDNA DKFZp564D0472 (f 5* 

122969 AW821252 Hs.164336 hypotheScaJ protein 5.B 

109872 R65841 Hs^fi653 ESTs 5.8 

114208 AUM9463 Hs.7859 ESTs 5.8 

„ 113494 T91451 Hs.8B53a ESTs 5\B 

70 127684 AA668631 Hs.159971 WAA0379 protein 5.B 

129300 W94197 Hs.1 10165 libosomal proteJn L2B homclog 5.B 

127489 AA650Z50 Hsl272076 ESTs 5.8 

133479 W0165B Hs.238797 ESTs.ModteraterySinilarto I38022hypc4 5.8 

105909 AA195191 hypothecs) protein FU 20729 5.8 

75 101255 BE365864 H&.149B94 rritocharHlrial translations! Initiation f 5.6 

134676 W28051 Hs.87819 Homo sapiens, done MGC2492, mRNA, comp 5.7 

130695 AA641767 Hs.21015 hypothetfcat protein DKFZp564L0964slmn 57 

126395 AI468004 Hs.278958 hypolheSce] protein FLJ12929 5.7 

130723 BE247676 Hs.1 8442 E-1 enzyme 5.7 

80 107230 AW34467 Hs34650 ESTs 5.7 

102745 AW753865 Hs.74376 dfadomedrn related ER locafized protet 5.6 

108699 M121514 Hs,70832 ESTs 5.6 

12BOQ0 F12310 gHtSC38O041 normalized hfant brain cDN 5.6 
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117357 N24829 gb:yxS8h1fcs1 Soaras melanocyte 2NbHM Ho 5.6 

12062? AA382525 Rs.1 32967 Human EST done 122837 martner transpose 5.6 

12349+ AW1790t9 Hs.112110 mitochondrial ribosomal protein U2 6\5 

_ 100438 AA013051 Hs.9141? topoisomefase (DNA) SI binding protein 5.5 

5 126165 AI741816 H3.125B97 ESTs 5.5 

1259-34 AA193325 Hs,32646 hypo!li9ticalproynFU21S0t &S 

124370 AI243499 Hs.170652 ESTs 5.5 

106135 » AL1 17474 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 5.5 

10666B R49390 Hs.254129 KIAA1678 5.4 

10 132967 AA3161B1 Hs.61635 slxfcsnsmfimbfanaepnhsJiad enflgenof 5.4 

109606 243371 Hs.7012 ESTs 5.4 

115816 BE042915 Hs.22572 Homo sapiens cONAFU 13675 fis. done PL 5.4 

125166 AA61062O Hs.1B1165 major Hstocompeawnty compfsx, class 5.4 

131165 BE2B0074 Hs.23960 cycinBl 6.4 

15 111227 TOB701 Hs.12268 ESTs 5.4 

105516 AK001269 Hs3073B hypothetical protein FU 10407 5.4 

103409 NM_0044S4 Hs.43697 efe variant gene 5 (elated motecule 5.4 

116C0B T79153 HB.485B9 zinc finger protein 228 5.3 

^ 106394 Z42993 Hs^5320 Homo sapiens cbne 25142 mRNABeqoence 5J3 

20 115121 AI634549 Hs.325422 ESTs 5.3 

126464 N71807 gb:yz29d09j1 Soar98_niuHjp' 1 e_sclerosb_ 5.3 

115596 AW29B597 H8-618B4 Komo sapiens, done IMAGE:4298026, mRNA, 5.3 

119040 RQ2394 Hs. 269436 ESTs. Moderately slmBar to PC4259 ferri 5.3 

128040 AW500486 Hs.180610 spicinrj factor proWglutamine rich ( 5.3 

25 110151 H18835 Hia.31608 hypotheScal prate *m FU20 041 5.3 

105539 AB040884 H&109694 WAA1451 protein 5.2 

130567 AA383092 Hs.1608 repltcatfon protein A3 (14kD) SJ2 

111077 N41367 Hs.173002 ESTs,W0a%SRnllarto!38022nypolhell E2 

120630 A155B170 Hs.96886 ESTs 5.2 

30 106516 AL137311 Hb-234074 Homo sapiens mRNA; cDNA DKF7p761G021 21 ( S.2 

103318 X83301 Hs.324728 SWA5 5.2 

116129 AF18S011 H&49163 putative libonucteaso 111 5.2 

131253 R71602 Hs34853 ESTs 5.2 

116680 AW902848 Hs.Z73B29 ESTs 5.2 

35 123949 AA62166S HS.20B957 EST 5l2 

105511 AB037829 Hs>3862 regulator of nonsense transcripts 2; DKF 5.2 

12SB47 AW181885 Hs^6974S ESTs 5.1 

108301 AA06972B Ks.1B45B2 ribosomal proton L24 5.1 

110739- AXJSS660 Hs.323401 dpy-30-lika protein 5.1 

40 104899 AA054726 H&J285574 ESTs 5.1 

125972 AI927475 Hs.35405 ESTs, Wghly similar to unnamed protein 5.1 

107869 AW97B998 Hs.58595 ESTs. WesHy similar to 138022 bypotheU 5.1 

121309 AA293834 Hs.97312 ESTs 5.1 

125321 T85552 Hs,17a2S4 ESTs 5.1 

45 102627 AUJ21&18 Hs.158174 zlncflnger protein 184 (KnjppeMkfl) 5.1 

104446 AFDS4555 Hs.7351 cyclfc AMP phosptioproteln, 19 kD 5.0 

126020 H79863 Hs.1 14243 ESTs 5.0 

116B14 H50834 Hs.77899 gb;yp8Ba10*1 Scares fetal liver spleen 5.0 

130622 AI582291 Hs.16846 ESTs, Weatf y staffer b 04HU1M debrisoqc 6.0 

50 110818 AL1 57503 Hs.27552 Homo sapiens mRNA; cDMA DKFZ05B6N2424 ft 4.9 

127765 AA871146 Hs.1 29074 ESTs, Moderately similar to MEG1 MOUSE M 4.9 

108768 AF181721 Hs.61345 RU2S 4.9 

120464 AA253170 Hs.96473 EST 4.9 

114518 AW163267 Hs.106469 suppressor cfrerl (S.cerevteias) 3-lite 4.9 

55 129012 R81936 riboeomaJ protein L44 4.9 

124973 A1476066 Hs.102243 ESTs.WeaWysfnuterto l78B85sarfnafth 4.9 

124648 AA24S086 Ha. 125034 hypothetic protean FU 13 340 4.9 

126904 AA948033 30553 ESTs 4.9 

131526 AKJ05169 Hs.2BZ74 Homo sapiens cDNA:FU22049«s,clocifiH 4.9 

60 130637 AA3S6764 Hs.17109 Integra) men* ran e protein 2A 4.8 

125769 AA083456 flbanOSgQ&r1 StratagenabNT neuron {937 4.8 

126086 H75B81 gb:yr77g01.rt Soaroa fetal liver spleen 4.8 

100169 ALU37228 Hs.82043 01 23 gene product 4.8 

130262 D63216 Hs.153664 nizzieoVelated prateh 4.6 

65 109260 AW97B515 Hs.131915 WAA0863 protein 4.6 

103120 BE410731 Hs,74050 Mncu^lyrrprrammraiLtrcntofilki 4.8 

111099 R15337 Hs.2195B Homo sapiens mRNA; cDNA DKFZp547D0B5 (fr 18 

126142 H86261 Hs.40568 ESTs 4.8 

126802 AW805510 Ks.97056 hypoihefcal protein FU21634 4.8 

70 105848 AW954064 Hs.24951 ESTs 4.7 

127987 A1022103 Hs.124511 ESTs 4.7 

129706 AA443241 jibcsomal proteki U4 4>7 

1295BB M30435 He.11556 Homo sapiens cDNA FU12566 fis, clone MT 4.7 

126629 A1096739 Hs.107260 bypothotical protein DKFZp5Q5H0623 47 

75 111348 AA034922 H&9565 ESTs 4.7 

100448 AF2348B7 H&57652 cadherin, EGF LAG seven-pass G-typa rece 4.7 

127542 AA703664 Hs.t79833 ESTs, Moderately skrflar to ALU5_HUMAN A 4,7 

132452 AW973521 Hs.247324 rflHochondria! ribosomal protein S14 4.7 

_ _ 123778 AW352149 Hs-102314 ESTs 4.7 

80 126621 A1475110 HsJ03933 ESTs 4.7 

110343 AW136703 H5.1726B EBTs 4.6 

102963 XD2404 Hs^74534 celcrtofAvrelatBd potypeptda, beta 46 

127207 AA377165 HSv44B33 ESTs 4.6 
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113974 

133096 

109920 

1034B7 

125353 

100883 

135047 

111148 

107881 

108480 

132742 

104912 

110223 

113047 

101031 

113002 

105304 

112386 

106489 

122792 

112651 

111346 

123596 

125447 

106178 

102250 

105B6B 

127496 

127315 

110193 

105012 

129445 

114721 

120922 

115167 

102407 

106111 

123629 

103616 

100269 

11272B 

135098 

106034 

133571 

106246 

125724 

132206 

130227 

129650 

130382 

106073 

105403 

116138 

134126 

115041 

10GO12 

118732 

13Q632 

125960 

133916 

106232 

129223 

109929 

123723 

101266 

132369 

124320 

103023 

116133 

129001 

119271 

114767 

126107 

115333 

100571 

134104 

120150 

134470' 

129948 

106BB2 

126199 

112727 

118B72 



AW629223 

H05840 

M7436C3 

AB033043 

BE245294 

AL134197 

A9G20690 

BE379594 

AL133092 

AA0254B0 

AAB13192 

H19836 

AI571940 

J0507D 

BE243513 

AW134924 

AA8317S5 

AA452054 

AW1 88551 

AA595B02 

AW970976 

AA421130 

AI562222 

AUJ49935 

AJ249361 

AA378760 

AIQ31650 

AF116622 

A1004874 

AF09815B 

W52452 



AA481003 

AA74S209 

AW602154 

AWS75398 

AF251237 

NMJJ02647 

NMJ01949 

R91913 

AW274526 

AW952M5 



AL036917 

AL3601SQ 

AA425204 

BE397151 

AF1092S8 

NI4_003450 

AL157441 

A1473827 

AA374756 

NMJW3747 

AA252457 

A12406B5 

AWi$2225 

AW&36O06 

AJ754693 

AL039185 

AB037828 

U40714 

AA773137 

ALD39779 

L36645 

AA310393 

1195749 

AW50D470 

AW449597 

AA443323 

AJ061118 

A135BB65 

H75477 

BE241812 

L14561 

L35253 

8E005771 

X54942 

AIS37162 

AA149537 

AW0B8276 

T91029 

AK392D1 



Hs.34145 

Hs.64794 

Hs.1 11323 

Hs.172106 

HS.14S377 

Hs.180789 

Hs.93597 

Hs.7762 

Hs.49136 

Hs.66055 

Hs.292812 

Hs.200596 

Hs.31697 

Hs.7549 

Hs.161738 

Hs.7212 

Hs.190325 

Hs.171914 

Hg.119338 

H&.99519 

Hs.33410 

Hs.293653 

Hs.1 12640 

H5.12B686 

Hs.301763 

Hs.74122 

Hs.334642 



Hs.310764 

Hs.9329 

Hs.29797 

Hs.103822 

Htk97128 

Hs.43728 

Vfe.B2143 

Hs.6451 

Hs.112208 

Hs32971 

Hs.1169 

Hs.272104 

Ha277721 

HS.1492B 

Hs.177656 

Hs,288B21 

Hs318501 

Hs.334721 

Hs.153003 

Ha.1 18258 

Hs.155204 

Hs.T7634 

Hs.31793 

HSJ83560 

Hs.131814 

Hs.86543 

Ks.165909 

Hs.20082 

Hs.146966 

Hs.77553 

Hs.15370 

Hs.239307 

H9.294Q27 

H&278672 

Hs.73964 

Ha.190044 

Hs.102342 

Hs.117990 

Hs.313652 

Ha.1'07fi12 

Hs.65328 

Hs.154443 

Hs.93361 

Hs.87860 

H6.7B545 

H8.79107 

Hs.153746 

Hs^3758 

Ks_26358a 

Hs.26994 

H5.125829 

H&.150S9 

Hs.283316 



ESTs, WftaWy similar to B49647 GTP-bindi 4.6 

zlncfingar protein 183 (RING finger, C3 4.6 

ESTs 4.6 

nucteoporin BBkD 4.6 

hy pottiBtical protein DKFZpT&T L0424 4.6 

S164 protein 4,6 

cycBn-dependeni Wnase 5. regulatory su 4.6 

paraneoplastic antigen MA2 4.6 

ESTs, Moderately ami* to ALU7JiLIMAN A 4.6 

hypothetical protein DKFZp434t042B 4.5 

ESTs, Weekly similar to T3 3466 hypothec 4.5 

WAA0547 gene product 4.5 

ESTs 4.5 

ESTs 4.5 

matrix metaJtoprolelnase 9 {gelatinase B 4.5 

hypothetical protein PP1044 4.5 

ESTs 4.5 

Homo sapiens cDMA FL114209 lis, clone NT 4.5 

ESTs 4.5 

hypolheacalprotenFU14007 4.5 

ESTs, Weakty simitar to T17279 hypotbeli 4.5 

ESTs 4.5 

EST 4.5 

ESTs 4.5 

WAA0554 protein 4.4 

caspase 4, apoptosts-related cystefoe pr 4.4 

tubulin alpha 1 4.4 

ESTs 4.4 

gbrHomo sapiens done FLB4217 mRNAseque 4.4 
Homo sapiens mRNA; cDNA DKFZp434M082 (fr 4.4 

chromosome 20 open reading frame 1 4.4 

ribosomal protefo L10 4.4 

ESTs 4.4 

ESTs 4.3 

hypothetical proteoi 4.3 

E74-Eke factor 2 (ets domain transcript 4.3 

PR006S9 protein 43 

XAGE-1 protein 4.3 

prKKphotosffide-3-kInase, class 3 43 

E2Ftransc^p5on factor 3 43 

ESTs, Moderate^ similar to ALUI^HUMAK A 4.3 

membrane component, chromosome 17, auifa 4*3 

hypothefical protein FU12903 43 

melanoma antigen, family D, 1 43 

WAA1B38 protein 4.3 

Homosaofens mRNA full length irraartcON 43 

hypothetical protein FLM3391 4.3 

serirK^reonlrie teiase 16 43 

prostate cancer associated protein 1 43 

zmcfinger protein 174 43 

downstream neighbor of SON 42 

ESTs 42 
Homo sapiens mRNA far WAA1771 protein, 42 

tenkyrase.TRFI^ntsracSng ankyrin-reia 42 

ESTs, Moderately slmlar to TO 0256 hypot 4.2 

ESTs 42 
ESTs, Weakly similar to 138022 hypdhafi 42 
zinc ffnger protein 3 (AB-51) 4J2 
ESTs 42 
fiiyrotd hormone receptor tnterador 7 42 
K1AA1 407 protein 42 
tyrosyt-tRNA synthetase 42 
ESTs 4.2 
irnrnbrane component, chromosome 11, surfa 42 
EphA4 4.2 
ESTs 42 
EST 42 
multinifiofionel polypeptide simtar to 6 41 
EST, Weakly smtlarto 13S022 hypotoetta 41 
BPOZproteln 41 
Fanconi anemia, comptementoiion group F 4 1 

irtnkhromosorne maintenance deficient (S, 41 
ESTs 41 
protein tyrosine phosphatase, non-recept 41 
ATPase, Ca++ transporting, plasma membra 41 
mt!oa^n-activ3ted pioteto kinase 14 4.1 
hypothetical protein FU22490 41 
CDC2B protein Wnasa 2 4.1 
ESTs 41 
hypothafiidprotetnFU20477 41 
ESTs 41 
ESTs 41 

4.1 
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131253 

134417 

113609 

127153 

124105 

100031 

106897 

128659 

133928 

125955 

103100 

119005 

118751 

128797 

111387 

11B676 

107348 

120528 

115652 

102935 

115399 

133615 

118129 

126522 

131965 

102507 

126029 

10716B 

105808 

134087 

133195 

112996 

129428 

118967 

1Z7335 



AU077002 Hs.24950 

NMJW641B HS.B2921 

T93263 H5.16B75 

A1732303 Hs.186518 

H11484 Hs.79133 



133529 



103080 
131971 
129703 



AF03S023 

AW630G87 

AW403084 

A1470523 

HMJ105574 

ALD3B511 

N74210 

NM_002975 

AI244489 

N45312 

AW973753 

AI923511 

BE093589 

BE5S1850 

AF151534 

M62843 

N57493 

W31912 

W792S3 

U52154 

AA704253 

W5757« 

AI133161 

U51166 

AW34760 

BE276112 

AA2569D6 

AK8B670 

AAB30210 

AW958Q37 

W45623 

1129725 

A1678638 

AU077231 

BE5671D0 



110924 

112921 

120820 

106459 

114253 

105476 

1226B2 

130580 

130174 

115577 

129785 

126127 

109793 

103175 

100154 

10514TJ 

129052 

103319 

106319 

102391 

127202 

126872 

107315 

113119 

126962 

116203 

123377 

122798 

107780 

132733 

120649 

131163 

125708 

117417 

106272 

110834 

123663 

124250 

127038 

107711 



AW467000 

AW058463 

R91095 

AA347417 

M789081 

BE149666 

AL117352 

AA984531 



M29551 

AA393167 

H19006 

K9542S 

F13088 



H 6-07 2D 
AB006524 
BE27S031 
X83492 
W22335 
AA2S6674 
AA828125 
AW450979 
AA316241 
T47910 
R12Q14 
AW137166 
AW938163 
AW36628S 
AA018927 
AB020631 
AA687322 
AA099524 
AW962S63 
A1241733 
AW959731 
AW273860 
A1039482 
AA35Q25S 
AA233014 
W96141 



H3.167496 

Hs. 10331 5 

Hs.7766 

Hs.139336 

Hs.184585 

Hs.125316 

H5.50454 

Hs.1Q5927 

HsJ285724 

Hs-46506 

H5.18242B 

Hs.104413 

KSu38178 

Hs.60505 

Hs.92023 

HSJ5236 

rte.2116B 

Hs.35962 

Hs.193044 

Hs.1 69359 

Hs.237955 

H3.2B6131 

Hs.173824 

Hs379949 

Hb.7165 

Hs.111364 

Hs.216756 

H&J14263 

Rs,286 

Hs.74571 

Hs^080 

Hs.6456 

Hs.62332 

Hs.154936 

Hs.179999 

K6.1069B5 

Ha.12940 

Hs.4276 

Hs.96869 

Hs.4029 

Hs.14831 

Hs.77196 

Hb.1 59293 

Hs.334370 

H6.151531 

Hs,41294 

Hs.184780 

Hs.8040 

Hs79227 

Hs.81892 

Hs.14912 

Hs.15B210 

H*82359 

Ks.7392 

H8.77494 



Hs.90691 

H340976 

Ks.87306 

Hs.271297 

Hs.145696 

Hjl269213 

Hs.123654 

Hs.192843 

Hs.23754 

Hs.136260 
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regulator of G-proteln signalling 5 4.0 

solute earner family 35 [CMP-stalic acl 4.0 

ESTs, Weakly similar to S23650 retroviru 4.0 

hypothetical protein MGC2599 similar to 4 J} 

ESTs 4.0 

AFFX control -DepX-M 4.0 

RAN binding protein 6 4.0 

trinucleotide repeat containing 1 4.0 

i*»qiiitifM5on)uga0ng enzyme E2E 1 (homo 4.0 

ATP-Hnclng cassette. sub-farnKy C [CFTR 4.0 

UM domain only 2 (rhombotto-fike 1 } 4,0 

ESTTs, Weakly simitar to S33990 finger pr 40 

ESTs 4.0 

stem cell growth factor; tymphocyte seer 4.0 

ESTs 4,0 

ESTs 4.0 

ribosomal protein S2 4.0 

ESTs 4.0 

hypotheScai protein FU23458 4.0 

small nuclear ribofiucJeoprotgin polypept 4.0 

core hlstone macroH2A2.2 4.0 

ELAV [embryonic lethal, abnormal vision, 4.0 

gfcyy54c08.s1 SoaresjrralBple_ J scterosIs_ 4.0 

gb2c76d03*1 Pancreatic Islet Homo sapl 4.0 

ESTb 3.9 

potassium Inwardry-recBfying channel, s 3.9 

ESTs 3.9 

RAB7, member RAS oncogene family 3.9 

CGI-101 protein 3.9 

trrymlne-DNAglycosylase 3.9 

WAA1007 protein 3.9 

zinc finger protein 259 3.9 

ESTs, WeaWy sbrjnlaT to ubiquitous TPR m 3.9 

ESTs 3,9 

ESTs, Moderately similar to ALU1_HUMAN A 3,9 

ribosomal protein M 3.9 

ADP-ribosylaSon factor 1 3.9 

mrtagefyactivated protein kinase 7 3£ 

chaperonln contaWng TCP1, subuntt 2<b 3.9 

cycln 01 (PRAD1: parathyroid adenomata 3.9 

hypothetical protein MDS025 3.9 

Homo sapiens, cfcne JMAGE34570Q3, mRNA 39 

ESTs 3.9 

ztne-flngers and be in cob axes 1 3,9 

KIAA1701 protehi 3.9 

EST 3-8 

glorra-smpfllfled sequenced 3.6 

Homo sapiens, Similar to zinc finger pro 3.8 

Human DNA sequence from clone RP5-876810 3.8 

ESTs 3.8 

uncharaderizfid hypothalamus protein HBE 3.8 

protein phosphatase 3 (forrnerty 2B), cat 3,8 

ESTb 3.9 

ESTs 3.8 

gb:zb80d09Ls1 SoaresjaenescenUbroWas 3-8 

heparan suHata (glucosamine) 3-0*uifol 3,6 

myurrmin (M^roWn) 2 <1G5kD) 3.9 

KtAAOIOI gene product 3.8 

K1M0286 protein 3.8 

hypothetical protein MGC2555 3,6 

tumor necrosis factor receptor Guperfaml 3.8 

hypothetical protein MGC31&9 3.8 

daojeyfluanosine kinase 3,8 

gb:oa71a09.s1 NCLCGAP_Ov2 Homo sapiens 3.7 

gkUUH-BI3^a-*12<>-Ui.s1 NCUCGAP_Su 3.7 

rrwieophc^n/nuctooplasmin 3 3.7 

gb:yb18b1 1*1 Stratagene fetal spleen (9 3,7 

ESTb 3.7 

ESTs 3.7 

ESTs 3-7 

splicing foctor(CCl.3) 3.7 

ESTs 3,7 

PCFIIphomdog 3.7 

leucine zipper protein FKSG1 4 3.7 

ESTs 3-7 

ESTs 3,7 

ESTs 37 

ESTs 3.7 

ESTs 3.7 
Homosapiens mRNA; cDNA DKFZp434P0810 (f 3.7 

EST, Weakly similar to 2109260A B eel g 3.7 

ESTs 3.7 

ESTs 37 
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130850 AB040922 Hs.20237 DKFZP566C1 34 protein 3.7 

119126 R45175 Hs.117183 ESTs 3.7 

125466 RJM234 Hs.1B9461 ESTs 3.6 

105150 AA631977 Hs.155995 KJAA0643pralein 3.6 

5 103163 AU077018 Ks.3235 kerc*in4 3.6 

104495 AW976687 Hs.292979 ESTs 3.6 

107599 AW6B407Z Hs.60136 ESTs 3.6 

113577 AJ300699 Hs.111334 PRO0470 protein 3.6 

^ 102631 Y08B90 Hs.113503 karyopherin (importn) beta 3 3.6 

10 109411 R98861 Hs. 109655 sax comb on mWleg (DrosophllaJ-llte 1 3.B 

127563 ABD35B9B Ha.150587 Wnesin^ prote'm 2 3.6 

131112 H15302 Hs.16B950 HoraoBap»an3mRNA;craAOKFZ P 566A1046(f 3.6 

11411B RJ1S98 Hs.175930 ESTs 3,6 

103937 AA934C63 l-fe.13836 ESTs, Weakly similar to 136022 hypotheti 3.6 

15 125174 W51835 Hs.231082 EST 3.6 

104799 AA029703 Qb:ze9SiOB.s1 Soares_feteLhsafLNbHH19W 3.6 

125952 AL543463 Hsj675S RaP2interacling protein 6 3.6 

135191 X16B68 Hs.333497 cytochrome P450. sutfarnily 1ID (debrlsoq 3.6 

124367 AI6B31&3 Hs.99348 dfetaUsss homeoboxS 3.6 

20 113560 TS1015 ESTs 3.6 

119232 AW55226 H&.117659 ESTs, Weakly stmflar to T46481 hypotheti 3.6 

113988 W87536 Ks.36473 ESTs, WeaHy similar to JC5238 galactosy 3.6 

115173 BEB12940 Hs.88252 ESTs 3.6 

126600 AA69S949 Hs.191385 ESTs 3.6 

25 127256 A!73B6t0 Hs^67967 ESTs. Moderately similar to ALU8HIMAN A 3.6 

123419 T660B7 Hs.112482 Homo sapiens unkrawn mRNA sequence 3.6 

102025 U04O45 Hs.76934 mutS (E. co5) bomoJog 2 {colon cancer, 3.6 

100661 BEH23001 Hs J 32746 Homo sapiens fibosoma] protein L39 mRNA, 3.6 

_ 132339 D6003Q Hs.45127 chontfroltln suffate proteoglycan 5 (neur 3.6 

30 119691 AW1S0215 rte.62348 hypotheHcal protan FU 11753 3.6 

134853 BE263326 Hs.90260 5^m^onidazt)le-4-c^rboxsjTiWe ribonucte 3.6 

134075 NLLP12201 Hs.78979 Golgi apparatus proton 1 3.6 

126468 T23625 H&.1505B0 pulaBve translation initiation factor 3.6 

127229 AA316161 Hs.61635 six trensmsmbrane epithelial antigen of 16 

35 108451 AA079195 gb:zm92h12.s1 Stratagena ovarian cancer 3.5 

133421 AF134160 Hs.7327 ctaudln 1 3.5 

135332 AW393883 Hs.98968 hypottaftsl protein FU23058 3.5 

132520 AA257992 Hs.50651 Jams kinase 1 {a protein ty rosin a kinas 3,5 

125562 A1484372 Hs.98968 hypoihefcat protein FU23056 3-5 

40 126996 BE161065 Hs. 167531 imW^^^^^^^^^ 2 3 - 5 

125553 R53305 gb:yh1fic10.r1 Soares Infant brain 1NIB H 3.5 

117265 AA451966 RAB9-Bke protein 35 

130215 BE301683 Hs. 152707 glioblastoma amplified sequence 3l5 

109482 A10O223B Hs.11462 splicing factor, ai^nlnafeerine-ricli 11 3.5 

45 133726 AI60318B Hs.252716 oxystarol-bindiiig proteln-re!ated ptotei 3.5 

132317 BE262438 Hs.44592 beta-1,4 irannosyltransferase 3.5 

117613 AW029507 Hs,1611Q2 ESTs 15 

100944 107518 much 6. gastric 3.5 

105226 R5895B Hs.26608 hypothetical protein MGC15880 3.5 

50 125032 T74684 gb:yc58d02.s1 Slratagane fiver (937224) 3,5 

123720 AA609734 Hs.11 2755 EST 15 

126646 AA730767 r&285753 SCGIO-llk&frotehi 9.5 

118443 AW9B2195 HS.3398CB LBP protein 32 3.5 

126770 AB015982 Hs.143460 protein kinase C. nu 3.5 

55 106916 AA076049 : Hs.274415 Homo sapiens cDMA FU10229 fe, ckme HE 3.5 

131244 A1638429 rk247G3 RAM binding protein 1 15 

128765 AF073310 Hs.143648 Insulin recepter substrate 2 35 

111223 AA652773 Hs.334838 KIAA186S protein 15 

^ 104857 A&20902 H&.190S6 ESTs, KtooeTataty similar to SB5657 alpha 3.5 

60 105305 A1580B60 HS.28B149 putatVo rneihyllransferase 3.5 

133562 BE391579 Hs.75087 Fas-a^vated s«ina/tfireantoe kinase 3.5 

128527 AA504583 H&.101047 trawoipBon factor 3 (E2Almmunogbbul 15 

106367 AA504747 Hs.136102 WAA0853 protein 3.6 

103392 X94563 gb:Hsapfens dbl/acbp &a& axon 1 & 2. 3.5 

65 101066 AA362524 Hs.250959 hlstetinl 3.5 

133423 T640B4 Hs.196005 Homo sapiens cONA FU11723 fe, done HE 15 

117714 M45226 Hs.46495 EST 3.5 

110966 AL133117 HaJB1376 Homo sapiens mRNA; cDMA DKFZp5B6L1 121 (f 3.5 

114093 AF060219 Hs57007 chromosome condensation Hike 3.4 

70 117147 AW901347 Hs.36592 hypc*BCcal protein RJ23342 34 

133347 BE25775B Hs.71475 add cluster protofn 33 3.4 

132456 AB011084 Hs.48924 K1AA0512 gene product; AUEX2 3.4 

133765 M62194 Hs.75929 cadberin 11, type 2, OB^henn (osteob 3.4 

109734 AI927212 Hs.3734 ESTs 3.4 

75 132786 BED83422 Hs.56851 hypothetical protein MGC268B 3.4 

106685 AU076617 Hs.16251 cteavsgesnd poVe^enyiaConspeciftcIa 3.4 

105593 AA279341 Hs. 1741 51 aldehyde oxidase 1 ZA 

131381 M92642 Hs.26208 toftagen, type XVI, alpha 1 3.4 

1086B7 BE544475 Hs^4347 ESTs 3.4 

80 113115 AI141426 Hs.8705 ESTs 3.4 

107234 AA972447 Hs.268833 Homo sapiens mRNA; cDMADKFZp434K087 [fc 14 

123110 AA4862S6 Hs.193510 EST 3A 

131019 W2B614 chorionfcsomafexriamniotroptri hormone 1 {p 3.4 
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129079 AK0DQ157 Hs,1 08502 hypoCietical proton RJ201 50 3.4 

111122 N63753 Hs.16492 DKFZP564G2022 protein 3.4 

103934 BE273111 Hs.1 34200 DKFZP564C186 protein 3.4 

12B571 A1885045 Hs.2115B6 phrjspholrwslHcte-3-Wnase, regulatory sti 3.4 

5 12325B AA430929 Hs.105274 ESTs, WesWy similar to RMS1_HUMAN REGUL 3.4 

128326 Z40313 063 30 Homo sapiens ctorw 1MAG&23371, mRNA seq 3.4 

120149 AA227609 Hs.94834 ESTs 3.4 

128817 BE395776 Hs.1 68640 ankylosis, progressive (mouse) homolog 3.4 

^ _ 110679 AAQ047M Hs.t083t1 ESTs, WeaWy similar to T00351 hypolheV 3.4 

10 122365 AA813546 Hs.99034 GTP-Wntfng protein Rho7 3-4 

134455 BE378152 Hs33530 hypotfieticej protein 14 

115506 AB037766 K&.452Q7 hypothetical protein KJAA1335 3.4 

134267 AJ174595 Hs.196209 RAD (RNA export 1. S.pcrnbe) homofag 3.4 

106891 AA443164 Hs.23259 hypcrthefcal protein FU 13433 3,4 

15 105169 BE24S294 Hs.ie0789 3164 protein 3.4 

120120 BE547267 Hs.59791 hypothetical protein HGC1 3153 3.4 

126638 AAB49257 Hs.86998 ESTs 3.4 

128531 H03721 Hs£3236 rfeosomsl prolan S 15a 3.3 

105650 AU049951 Hs.22370 Komosapfens mRMA;cDNADKFZp564O0l22 (f 3.3 

20 10B859 AL121500 ESTs 3,3 

112276 R53442 Hs.26038 ESTs, Weakly simlar to t3aG22 hypotheO 3.3 

125693 H23989 Hs.169743 Homo sapiens done 25121 neuronal drad 3.3 

113474 R50752 Hs33856 hypothetic^ proleJn M6CS297 3-3 

„ 132994 AA112748 Hs.279905 clone HQ031QPRO0310p1 3L3 

25 124691 RQ5835 Hs.110153 ESTs 3.3 

112511 AW970420 dynactln 2 (p50) 3.3 

132985 AL045579 Hs£2113 KIAA0717 protein 33 

125743 H17151 HaJ416 gb:ym37a05.r1 Soares Infant brail 1NIB H 3.3 

133363 AI8662B6 Hs.71962 ESTs, WeaWy smla-lo B 36298 proJlne-T 3.3 

30 10790B AF087999 ESTs 3.3 

105312 BE613348 Ha.211579 meknoma ceil adhesion mrfecute 3.3 

110256 H63947 Hs.237955 RAB7> member RAS oncogene femJfy 33 

132073 AA295052 Hs3B516 Homo sap"ens> clone MGG158B7. mRMA, com 33 

107090 AWfi092DB Hs.1B3297 DKFZP566F2124 protein 3.3 

35 105463 AAA25974 Hs3264S hypothetical proteki FU21901 3.3 

109592 AJ198D59 Hs.26370 ESTs 3.3 

113649 N94768 Hs-16400 ESTs. WeaWy simitar to K1AA1 435 protein 3.3 

103998 AA24SB84 gb:k35l7.seqj= Human fete! heart, Lambda 33 

116439 AA251S94 Hs.43913 PIBF1 gene prcdoct 13 

40 129535 AA397972 Hs.1 12603 ctvnerin tchimaerln) 1 3.3 

113283 T66813 Ks.12947 EST 13 

128992 H04150 Hs.107708 ESTs 13 

133160 N54968 Hs.66309 hypothefcrf protein MGC1 1061 33 

134076 AF086215 Hs.789B0 gteHomo sapiens ful length Insert cDNA 33 

45 128301 U9D552 Hs-284283 butyrophTSn, subfamily 3, member A1 . 33 

127728 AW404061 protein kinase C* beta 1 33 

126516 R95872 Hs.117S72 ctemofciie tinting protein 2 33 

127506 T61039 Hs.76067 rfcosornal protein LlOa 33 

104769 AA025887 Hs.293943 hypothetical protetiMGC1 1266 3.3 

50 126666 AA648385 Hs.151999 ESTs 33 

130453 U80735 Hs.173854 PAX transcription activaSoo domain He 33 

107131 AW961605 Hs.21145 liypothetJcal protein RG0B3MD5,2 33 

130422 AW160S14 Hs.1 80034 cleavage sDmuleCan factor, 3* pre-RNA, 33 

121292 AA4O1807 0fczvG5f1U1 Soare3jotaUeti!8jNb2HFB_ 3.3 

55 123284 AA48898B Ks. 2 93796 ESTs 33 

130734 AW137091 H3.1BS24 KJAA1052 protein 33 

105400 AF198620 Hs. 10283 RNA binding rnofif pro tetn 8A 33 

105014 AA121123 Hs^69267 ESTs, Wealthy skrilar fe AF161361 1 KSPC0 33 

^ 114988 AA251089 ob3s04K&s1 NCLCGAPJ3CB1 Homo sapiens 33 

60 132867 AF226667 Ks.58553 CTP synthase tl 33 

124169 BE079334 Hs.271630 ESTs 13 

114652 A1521936 Ha. 1071 49 novel protein similar to arcftaeal, yeast 33 

113376 AI799751 Hs3635 ESTs 33 

111520 A19B5359 Ks.301134 ESTs 13 

65 121748 BE536911 Hs.23454S hypothetical protein NUF2R ' 13 

106834 AUD44182 Hs.28070 K1AA0753 gene product 13 

128809 AA788242 Hs.80618 hypothetical protean 33 

129619 AA209534 Ks.2B4243 tetraspan MET-6 protein 33 

125770 A1292320 Hs.61361 hstefogeneous nuclear ribonucteoprateln 33 

70 116734 AV^00992 HsJ937S6 DKFZP686D2223 protein 13 

100253 D3B024 Hs.157425 double homecbox, 2 33 

130999 AA326683 Hs.21992 likely crmotog of mouse variant pol/ade 33 

114092 H31213 Ks.14825 ESTs, WeaHyslm1artoKlAA1503 prolern 3.3 

__ 1292M AA318224 Hs^!flB141 ESTs 33 

75 119127 AA708035 Hs.1 2248 ESTs 13 

123910 AA621262 Hs.179923 ESTs, lAfeaWy similar to S65657arpha-1C. 33 

132664 A17404S1 Hs.54542 EffTs 13 

118397 BE139479 Hs.161492 ESTs 33 

107003 AW13W37 Hs^4790 K1AA1573 protein 33 

SO 126735 M69113 H 3.225795 gluiathfaoe S-transferaae pi 12 

130847 AI672483 Hs.20220 l^sse protein 12 

101186 AA020956 Hs.17S861 core-binding factor, beta subuntt 12 

113628 T9431B Hs.1 7359 ESTs, Moderately simitar to RL44_HUMAN 6 12 

789 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



121782 
119863 
10S103 
123008 
133761 
110561 
115239 
117467 
12BB62 
129673 
105154 
12B722 
109966 
129315 
109517 



AA307279 
M66146 
H09103 

«M_014563 Hs.174038 

A1S31874 Hs.155140 



101754 

1103B4 

134118 

134669 

100780 

125728 

129794 

129056 

107128 

103038 

130553 

125384 

132305 

131136 

102983 

117639 

127070 

126153 

132676 

104946 

122110 

127705 

109442 

100474 

115536 

119651 

127211 

134964 

105551 

119750 

125580 

104590 

110724 

116429 

133915 

123721 



AB039670 
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K45282 
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AL157E1B 

BE561958 

AW954565 

AF161399 

A1769958 

ACQQ4770 

AA9269G0 

AF0B2649 

BE178S02 
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AA377165 
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HS.1DB336 
Ks.4756 
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Hs44344 
Hs.23413 
Hs.118638 
Hsj44B33 
Hs.146162 
Ha<40730 
H&261038 
H8.73B4B 
Hs.301240 

Hs.86999 
NMJW0699 H9-30D2BO 
AKD01463 Hs.62180 



129179 
108799 
108807 
133461 
132571 
134937 
125766 
112369 
127002 
128179 
117121 
126566 
120403 
127930 
114250 
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104558 
124777 
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123485 
109416 
132490 
115348 
117297 



AW452957 Hs.63346 Rocno sspfens, ctooe MGC:15203, mRMA, cwn 3.2 

AA081218 Hs.58608 Homo sapianscDMAFU 14206 fls, done NT 12 

BE620779 Hs.12094 mitochondrial rtocsomal protein 130 3.2 

AA620552 gkae5Bg1t.s1 Stratagene King cartaioma 3.2 

AF041430 Hs.75922 brain protein 13 3.2 

AA379597 H 5^199 HSPC150 prttein similar to ubiqulita-con 3.2 

BE251328 Hs.73291 hypothetical protean FU) 10881 3.2 

N29682 Hs.44071 ESTs, Weakly slmlar to AUU5.JJUMAN ALU S 3.2 

BE250742 Ks.106o73 eukaryotic IrflrtslaDon Initiation factor 3.2 

Hs.1191 KJAAD073 protein 3L2 

Hs.35947 mettiyl-CpG binding domain protein 4 3.2 

Hs.11125 HSPC033 protein 3.2 

Ks.30897 EST 3.2 

spondyloepiphyseal dysplasia, late 3.2 

casein kinase 2, alpha 1 polypeptide 3.2 

ALEX1 protein 3.2 

T1A1 cytotoxic gr anu! ^associated RNA-bl a2 

ESTs 3.2 

IOM0116 protein 3.2 

PR024B3 protein 3.2 

immunoglobulin heavy constant mu 3.2 

B-ceU CLUtyrnphoma 11B (zinc finger pro 3.2 

hypothetical protein FU13433 3.2 

ESTs, WeoWy Mar to ALUEJUJMAN lllf 3.2 

flap slmcture-speclfic endonuctease 1 3.2 

CDC28 protein kinase 1 3.2 

pMlarytumoT-tranefarming 1 32 

ESTs, WesWy sfrilar to 178585 serina/th a2 

hypothec protein FU20S34 3.2 

WAA1273 protein 3.2 

non-metaslafe cdls 1, protein (NM23A} 32 

ESTs 3L2 

BSTs 3.2 

ESTs 3.2 

ESTs. Weakly similar to 138022 hypothetf 3.2 

rarctnoen^ryunlc anlgen-relaled cell ad 3l2 

melanoccrSn 1 receptor (alpha melanocyt 3.2 

gb-A) 003322 Selected chromosome 21 cfJNA 3.2 

ESTs, Weakty stouter to S65857 alpha-IC- 3L2 

amylase, alpha 2A; pancreatic 3.2 

anIIRn (DrosophOa Scraps homolog}, eel 3.2 

small ffidudble cytokine sutfemily E, me 3.2 

hypolhetfcal protein FU22004 Z2 

hlppocalctn-like protein 4 3.1 

rlbonirdease HI, large subunft 3-1 

ESTs 3.1 

Homo sapiens, done 1MAGE:3B40937, mRNA, 3w1 

rw dear receptor subfamily 1, group l,m 3.1 

Homo salens cDWA FU13471 fe, clone PL 3.1 

putative nucleotide binding protein, est 3.1 

HomosaptenacOMAHJ14S67fi».cbneTH 3.1 

phosphoprotehi associated wtti GEMs 3.1 

ESTs 3.1 

ESTs 3.1 

ESTs 3.1 

hypothetical protein FU20B44 3.1 

cytochrome F450» subfamly llA (phenobar 3.1 

suppressor of G2 a'JeJa of SKP1» S. core 3.1 

AI251449 Hs.171939 ESTs 3.1 

8E1745B7 H&.289721 growth arrest speclfc trareo^jt 5 3.1 

AW9B6243 Hs.4243 hypotheses* protein FLH2050 3.1 

AL353940 Hfe24S79 liypothefcal protein DKFZp761P10lO 3.1 

AW293689 Hs.127116 ESTs 3.1 

H95044 H&32138G EST 3.1 

AF2S5303 Hs.112227 roerrtrano-assodflted nucleic veto binding 3.1 

ABOBQOB HS.2953S2 Homo sapiens cDNA FU 14459 lis, clone HE 3.1 

AAB09672 Hs.123304 ESTs 3.1 

AI314699 Hs,13297 ESTs 3.1 

AKD016S3 Ha.273344 DKFZP5640MB3 protein 3.1 

AA21T777 gbsnCTdOZsl Slrategene musde 937209 H 3,1 

N52090 Hs.47420 EST 3.1 

R56B78 Hs,6B959 hypothetical protein MGC4816 3,1 

Ri1933 Hs.140237 ESTs, WeaWy similar to ALU1_HUMAN ALU S 3.1 

N3556B Hs5245 hypothetical proteVi FU20S43 3,1 

AW297206 H&1S401B ESTs 3.1 

AF109213 Hs.108787 phosphatxiytinositol glycan. class N 11 

AU17S54 Hs.11990& nucfeclar protein NOP5/NOP58 3.1 

AA120311 H3J9B79 ESTs 3.1 

Al 308376 HS.103M9 hypoUieh"cel protein DKFZp7BlDl1 2 3,1 

BE268386 Hb.B5945 ESTs. WeaWy similar to A46010 linked 3.1 

NMJW1290 Hs.4980 UM domain hmding 2 3.1 

AA281S62 H&-766 ESTs 3.1 

AW779B29 gb:hn88a05.x1 NOjCGARJQdH HomoaapieD 3.1 
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AW014396 AW769445 AJ58B869 AW33B296 AJ8 01 548 Al 367702 Al 8 88410 
C141B7 AA496211 D80456 AA484051 D80455 AA52B446 
AW862258L07615 

AA356764 NM_004867 AF038953 W16828 R50284 AI929589R01594 AA010474 W37917 N64212 M78175 R5751 1 AA329322 W77834 
M169Q69 AA285240 AA356670T93146 AA325779 AA357064AA328151 AA306339 AA306BQ1 N31260 AA30S202 AA312530 K242703 
AA364371 N42259 AL021766 H509B5 H3803B AI655S65 Al 565732 A1953952 AW972835 A1077300 A194B482 AI760966 AA64B335 W72197 
N32815 AA768073AW338181 AW001742AI819B96AI015668N58011 AA6775B4 Al 1 83750 AW1 0 55 1 4 W37875 AW301 679 AA775257 

795 
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AA730357 All 2B690 AA63Q149 AW241515 AJ1470Q3 AH 83499 N9G923 A1S68582 AA354145 AJ 147297 BE550502 AIQ85777 AI7O90B7 
AA773947 AA469192 A1199289 A1127085 HB9232 N30719 AW28165 AW029469 AJ709314 AA70O834 AA778057 H89125 A1654762T93061 
N59622 AI360625 AA71S9B2 A1248242 AA233925 AW26251 2 AW43666S AA01Q378 AA4B4Q1 0 AW438790 N22875 AA4S3718 A11 B5407 
AA736720 AA664567 AA648672 AA503406 AA506123 AA503140 H46638 N76812 M730938 C60S48 AA483039 AA507705 AA48Q448 
Al76aS54AA656955 AA730Q03AA33O312AW021127 BE172C31 AW961 052 AA33S07Q AA648172 BE172892 AA174175 AA492D12 AA491S90 
AW118733 AA4fl4130 AA730783 AA730510 DB2577 D62B28 A1469294 AIBQ1343 N53447 AA740226 AI831983 AA51G146 D79799 A1758212 
AA48404G R36039 A103B772 A1039410 AAS2B387 AA730576 AA527231 AA74811 1 D62946 U3S439 AA721708 H22130 AA559229 H51620 
AA2B335S AA730970 RD1595 N72450 AA469123 AA5793B9 AA480477 AA283306 AA347822 AA328013 

AW24S508 A1278454 AI369343 AW291235 AI343946 AI3471B9 AA970152 AI27966B A1971B94 AA664359 AK91039 AAQB459A AAD81641 
AA316t81 BE439545 AC0WO53 AF1 86249 AW3861 01 AC004969 AA730199 AA0G2121 A16B6139 AH67942 AA809228 AI184070AI394674 
AW969977 AA032279 AVW79284 AA51 3174 AA888312 A1453179 AA4B3363 AA528432 AA579511 

AU077228 U61 145 AL044256 AA206185 AL044975 NMJ04456 X95S53 AW8S0905 AA332606 AW965453 BE299226 U52965 BE 2750 09 
AA428252 AA769212 AA312778 AA81 0901 AA46131 5 A! 9692BB AI953668 AW296248 AI291422 AA7Q4747 Z44716 BE 541 235 AW60B8O5 
AA5885B3 AW403788 AW«fl386 AA053521 T87571 AA581163 BE183241 T9790B RB9149 A1808196 BE61B521 AI970278 AI552680 AW130143 
AJ970272AW197254 BE550449 AI828403 AA010143AWOQ5107 AA251B1SAA80SB36 N53459 AAD55309 AA643678 A1345816AJ349B18 
AI635728 At B712SG AI018437 A126261 4 H75287 M033155 AI203388 AA205S02 AI475265 AW594186 AI696D59 AI33331 2 AW264710 AA876764 
AA864892 AA1731SS AE92Q66 T79837 W939B5 AA506427 AAB15135 AI376342 AA01O144 AI310421 AJ223350 AAG70576 K33926 T89364 
A10S0913 N55098 M101673 AA62S665 AA251B40 N68857 A18S0053 H05930 AI474199 AI43S166 AAS28B10N64452 A1205S38AA05531O 
BE244495AI365179T9953SAl2t)308BZ40549AW139858AA999527R91847AM^ AW0689B2H51748A4091291 H13030 AA4S9732 
T99430AI051764 R45991 

054746 AU077189 NMJW0729 R71188 H87563 HBO701 AW95S319 AA376289 R82442 K46452 R22481 R64323 AWD43GSQ L00354 AW005571 
H19813 A1745597 AI7892G2A1205962AW2O5841 RS4324AI201975 H46993 C18C67 R71189 N29729A179983B AJ091209 H67455R82443 
AW970996 AWQ72136 N40841 H13148 AI219052 N30191 T280Q7 AJ817428 AA503602 

AFO50147 NWL007015 A800BDOO AB0Q5999 BE21B114 A1703147AJ985891 BE2220M BE549774 BE32761B A1B51459 BE549987 AI657194 
BE222414 BHD 46984 AW470357 AW339338 C02525 AW590220 A1298082 AW2B6996 AI33313S AI867818 AA730514 BE348513 AA405500 
AW136179 AA327B34 BE32735D BE327366 

AWS69726 AA424fl23 AI2B6636 AJ271 571 AA4239B8 A1266634 AA7027aQ AA452037 AI424866 AA742931 M483361 

AA737106 N357G5 N35779 AFC86045 N301 52 AW374028 A147B237 AA9236B2 AI382236 AA846668 A1627389 AI371 709 H9B083 H95983 

H26576 H95984 AA745021 M639188 H26S75 

AF061573 NM.00259OAA78O13B AW135620 Alt97a84AA483348RB546B AW969724 H29216 
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Table 73 1 Sequences 

Seq ID NO: bi dna sequence 
Nucleic Acid Accession #= KM__053 056.1 
^ Coding sequencer 210. .1097 

1 11 21 31 41 51 

i i i i r i 

CACAOGGACT ACAGGGGAGT TTTGTTCAAG TTGCAAAGTC CTGGAGCCTC CAGAGGGCTG 60 

. TCGGCGCAGT AGCAGCGAGC AGCAGAGTCC GCACGCTCCG GCGAGGGGCA GAAGAGOGCG 120 

10 AGGGAGCGCG GGGCAGCAGA AGCGAGAGCC GAGCGCGGAC CCAGCCAGGA CCCACAGCCC 180 

TCCCCAGCTG OCCAGGAAGA GCCCCAGCCA TGGAACACCA GCTCCTGTGC TGCOAAGTGG 240 

AAACCATCCG CCGCGCGTAC CCCGATGCCA ACCTGCTCAA CGACCGGGTG CTGCGGGCCA 300 

TGCTGAAGGC GGAGGAGACC TGCGCGCCCT CGGTGTCCTA CTTCAAATGT GTGCAGAAGG 360 

AGGTCCTGCC GTCCATGCGG AAGATCGTCG CCACCTGGAT GCTGGAGGTC TGCGAGGAAC 420 

15 AGAAGTGCGA GGAGGAGOTC TTCCCGCTGG CCATGAACTA CCTGGACOGC TTCCTGTCaC 480 

TGGAGCOCGT GAAAAAGAGC CGCCTOCAGC TGCTGGGGGC CACTTGCATG TTCGTGGCCT 540 

CTAAGATGAA GGAGACCATC CCCCTGACGG CCGAGAAGCP GTGCATCTAC ACCQACAACT 600 

CCATCCBGCC CGAGGAGCTG CTGCAAATGG AGCTGCTCCT GGTGAACAAG CTCAAGTGGA 660 

ACCTGOCCGC AATQAOCOCG CACGATTTCA TTGAACACIT CCTCTCCAAA ATGCCAfiAGG 720 

20 COGAGGAGAA CAAACAGATC ATCCGCAAAC ACGCGCAGAC CTTCGTTGCC CTCTGTGCCA 780 

CAGATGTGAA GTTCATTTCC AATCGGCCCT CCATGGTGGC AGCGGGGAGC GTGGTGGCCG 840 

CAGTGCAAGG CCTGAACCTG AGGAGCCCCA ACAACTTCCT GTCCTACTAC CGCCTCACAC 90 0 

GCTTCCTCTC CAGAGTGATC AAGTGTGACC CAGACTGCCT CCGGGCCTGC CAGGAGCAGA 960 

TCGAAGCCCT GCTGGAGTCA AGCCTGCGCC AGGCCCAGCA GAACATGGAC CCCAAGGCOG 1020 

25 CCGAGGAGGA GGAAGAQGAG GAGGAGGAGG TGGAOCTGGC TTGCACACCC ACCGACGTGC 1060 

GGGAJCGTGGA CATCTGAGGG CGCCAGGCAG GCGGGCGCCA CCGCCAOCCG CAGCGAGGGC 1140 

GOAGCCGGCC CCAjGGTGCTC CCCTGACAOT CCCTCCTCTC CGQAGCATTT TGATACCAGA 1200 

AGGGAAAGCT TCATTCTCCT TGTTGTTGGT TGTTTTTTCC TTTGCTCTTT CCCCCTTCCA 1260 

TCTCTGACTT AAGCAAAAGA AAAAGATTAC CCAAAAACTG TCTTTAAAAG AGAGAGAGAG 1320 

30 AAAAAAAAAA TAGTATTTGC ATAACCCTGA GCGGTGGGGG AGGAGGGTTG TGCTACAGAT 1380 

GATAGAQGAT TTTATAOOCC AATAATCAAC TCGTTTTTAT ATTAATGTAC TTGTTTCTCT 1440 

GTTGTAAGAA TAGGCATTAA CACAAAGGAG GCGTCTCGGG AGAGGATTAG GTTGCATCCT 1S00 

TTAOGTGTTT AAAAAAAAGC ATAAAAACAT TTTAAAAACA TAGAAAAATT CAGCAAACCA 1S60 

TTTTTAAAGT AGAAGAGGGT TTTAGGTAGA AAAACATATT CTTGTGCTTT TCCTGATAAA 1620 

35 GCACAGCTGT AGTGGGGTTC TAGGCATCTC TGTACTTTGC TTGCTCATAT GCATGTAGTC 1680 

ACTTTATAAG TCATTGTATG TTATTATATT CCGTAGGTAG ATGTGTAAOC TCTTCACCTT 1740 

ATTCATGGCT GAAGTCACCT CTTC3QTTACA GTAGCGTAGC CTGGCCGTGT GCATGTCCTT 1800 

TGCGCCTGTG ACCACCACCC CAACAAACCA TCCAGTGACA AACCATCCAG TQGAGGTTTG 1860 

TCGGGCACCA QCCAGCGTAG CAGGGTCGGG AAAGGCCACC TGTCOCACTC CTACGATACG 1920 

40 CTACXATAAA GAGAAGAOGA AATAGTGACA TAATATATTC TATTTTTATA CTCTTCCTAT I960 

TTTTGTAGTG AGCTGTTTAT GAGATGCTGG TTTTCTAOCC AACGGCCCTG CAGCCAGCTC 2040 

ACGTCCAGGT TCAACCCACA GCTACTTGGT TTGTGTTCTT CTTCATATTC TAAAACCATT 2100 

CCATTTCCAA GCACTTTCAG TCCAATAGGT GTAGGAAATA GCGCTGTTTT TGTTGTGTGT 2160 

GCAGGGAGGG CAGTTTTCTA ATGGAATGGT TTGGGAATAT CCATGTACTT GTTTGCAAGC 2220 

45 AGGACTTTGA GGCAAGTGTG GGCCACTGTG GTGGCAGTGG AGGTGGGGTG TTTGGGAGGC 22 BO 

TGCGTGOCAQ TCAAGAAGAA AAAGGTTTGC ATTCTCACAT TG CCAGG ATG ATAAQTTCCT 2340 

TTCCTTTTCT TTAAAGAAGT TGAAGTTTAG GAATOCTTTG GTGCCAACTG GTGTTTGAAA 2400 

GTAGGGACCr CAGAGGTTTA OCTAGAGAAC AGGTGGTTTT TAAGGGTTAT CTTAGATGTT 2460 

^ TCACACCGGA AGGTTTTTAA ACACTAAAAT? ATATAATTTA TAGTTAAGGC TAAAAAGTAT 2520 

50 ATTTATTGCA GAGGATGTTC ATAAGGCCAG TATGATTTAT AAATGCAATC TCOGCTTGAX 2580 

TTAAACACAC AGATACACAC ACACACACAC ACACACACAC AAACCTTCTG CCTTTGATGT 2640 

TACAGA.TTTA ATACAGTTTA TTTTTAAAGA TAGATOCTTT TATAGGTGAG AAAAAAACAA 2700 

TCTGGAAGAA AAAAAOCACA CAAAflACATT GATTCAGCCT GTTTGGCGTT TCCCAGAGTC 2760 

ATCTGATTGG ACAGGCATGG GTGCAAGGAA AATTAGGGTA CTCAACCTAA GTTCGGTTOC 2820 

55 GATGAATTCT TATCCCCTGC GCCTTGCTTT AAAAAACTtfA GTGACAAAAT AGACAATTTG 2880 

CACATCTTGG CTATGTAATT CTTGTAATTT TTATTTAGGA AOTGTTGAAG GGAGGTGGCA 2940 

AGAGTGTGGA GGCTGACGTG TGAGGGAGGA CAGQQGGGAG GAGGTGTGAG GAGGAGGCTC 3000 

GCGAGGGGAA GGGGOGGXGC CCACACOGGG GACAGGCCGC AGCTCCATTT TCTTATTGCG 3060 

CTGCTACCQT TGACTTCCAG GCACGGTTTG GAAATATTCA CATCGCTTCT GTGTATCTCT 3120 

60 TTCACATTGT TTGCTGCTAT TGGAGGATCA GTTTTTTGTT TTACAATGTC ATATACTGCC 3180 

ATGTACTAGT TTTAGXTTTC TCTTAGAACA TTGTATTACA GATGCCTTTT TTGTAGTTTT 3240 

TTTTTTTTTT ATGTGATCAA TTTTGACTTA ATGTQATTAC TGCTCTATTC CAAAAAGGTT 3300 

GCTGTTTCAG AATACCTCAT GCTTCACTTA GCCRTGGTGG ACCCAGCGGG CAGGTTCTGC 3360 

_ _ CTGCTTTGGC GGGCAOACAC GCGGGCGCGA TCCC3LCACAG GCTGGGGGGG GCCGGCCCCG 3420 

65 AGGCOGCGTG CGTGAGAACC GCGCOGGTGT CCCCAGAGAC CAGGCTGTGT CGCTCTTCTC 3480 

TTCCCTGCGC CTGTGA3X5CT GGGCACTTCA TCTGATCGGG GGCGTAGCAT CATAGTAGTT 3540 

TTTACAGCTG TGTTATTCTT TGCQTQTAGC TATGGAAGTT GCATAATtAT TATTATTATT 3600 

ATTATAACAA GTGTOTCTTA CGTGCCAOCA C3GGOGTTGTA CCTGTAGGAC TCTCATTCGG 3660 

GATGATTGGA ATAflCTTCTG GAATTTGTTC AAGmTTGGG TATOTTTAAT CTGTTATGTA 3720 

70 CTAGTGTTCT GTTTGTTATT GTTTXGTTAA TTACACCATA AIGCTAATTT AAAGAGACTC 3780 

CAAATCTCAA. TGAAGCCAGC TCACAGTGCT GTGTGCCCCG GTCACCTAOC AAGCTGCCGA 3840 

ACCAAAAGAA TTTGCACCCC GCTGOSGGCC CACGTGGTTG GGGCCCTGCC CTGGCAGGGT 3900 

CATCCTQTGC TCGGAGGCCA TCTCGGGCAC AGGCCCAOCC CGOCCCRCCC CTCCAGAACA 3960 

CGGCTCAOGC TTAOCTCAAC CATCCTGGCT GOSGCGTCTQ TCTGAACCAC GCGGGGGCCT 4020 

75 TGAGGGAOGC TTTGTCTGTC GTGATGGGGC AAGGGCACAA GTCCTGGATG TTGlGTGTAT 40 00 

CGAGAGGCCA AAGGCTGGTG GCAAGTGCAC GGGGCACAGC GGAGTCTGTC CTGTGACGGG 4140 

CAAGTCTQAG GGlCTGGGCG GCGGGCGGCT GGGTCTGTGC ATTTCTGGTT GCACCGCGGC 4200 

GCTTCCCAGC ACCAACATGT AACCGGCATG TTTCCAGCAG AAGACAAAAA GACAAACATG 4260 
AAAGTCTAGA AATAAAACTG GTAAAACCCC AAAAAAAAAA AAAAAA 

Seq ID NO: B2 Protein sequence 
Protein Accession % -. HP 444284 . 1 
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MEHQLLCCEV BTIRRAYPDA NIjLNDRVI/RA MUtAEETCAP SVSTCKCVQK EVUPSMRKIV €0 

ATWMI#EVCBE QKCEEEVPPL AMNTLDRFLS I,EPVKKSRI.Q LrLGATCMFVA SKMKETIPI/T 120 

AEKLCIYTDN SlRPEBLIiQM ELLIiVNKLKW NLAAMTPHDF IEHFL8BKPE AEENKQIIRJC 180 

5 HAQTPVAI.CA TDVKFISNEP SMVAAGSWA AVQGLWIiRSP WNFI*3YYELT RFLSRVIKCD 240 
PDCLIlACQEQ IEAIiLESSI*R QAQQBJMDPKA AESEEEEEEE VDIiACTPTDV RDVDI 



Seq ID NO: B3 DNA sequence 
Nucleic Acid Accession #: JCM w 044166 
10 Coding sequences 1..1576 

1 11 21 31 41 SI 

11)11! 

CTTTTOTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAOAA ATGGGQGCTT fiO 

15 TGGAGAACTA QGATGTTTOG GGGGAAGOGC TAAGGAGCGA GGGCTGCTGG AAGACGAGCG 120 

CGCCCTTCAQ CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 180 

CAGGGOGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 

CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 300 

CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG BCGAGCGACG CCAAGCTCTG 360 

20 CGCTCTCTAC AAAGAGGCCG AGCTOOGOCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 

TCCGGTGCCC ACCTCCQAOC ACGTGGCCGA GATCGTGGGC AOGCAAGGCT GCAAGATTAA 480 

GGCCTTOAQG GCCAAGACCA ACAGCTACAT CAAGACACOG GTGAGGGGCG AGGAACCAGT 540 

GTTCATQGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC GGGCGGGAAA TCATCTCAGC 600 

AaCQSAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 

25 GGCTCCTGCT CTGCCCGGOC AGGTGACCAT CCGTGTGCGQ GTGCCCTACC GOGTGGTGGG 720 

GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAQCAGCAAA CCAACACATA 7a 0 

CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 840 

CGTGGAGCGT GCGGBCGAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAQATCCT 900 

CGAQTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 

30 CTACTCCGAC GCCTGGCGGG TGCACCAQCC CGGCTGCAAG CCCCTCTCCA CCTTCOGGCA 1020 

GAACAGCCTG GGCTGCATCU QCGAGTGCGG AOTGGACTCT GGCTTTGAGG CCCCACGCCT 1DB0 

GGGTGAGCAQ QQCGGGGACT TTGGCTACGG OGGGTACCTC TTTCCGQGCT ATGGCGTGGG 1140 

CAAGCAGGAT GTGTACTACG GCGTGGOCGA GACTAGCCCC CCGCTGTGGG OGGGCCAGGA 1200 

GAACGCCAOG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 

35 CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCOG CGGGACCOQA 1320 

GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 

TGQGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 

AGTGACTGCC GCCCTTGTGC CCTGGGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 1500 

CATC3GCGAG AGGAOGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA GBCAAGCCAT IS 60 

40 CCGAATATTC TOCPAAGCCC CGTGCCCCAT GGCTOCGGOG CCCACTCCAC TGGGCCCACC 1620 

CTGGACCIGT TTTCCACTAA AGCCTTTTGG AAAGCGGTGA TTTGAGGGGC AAGGTGCTTA 1680 

GAGATACTCG CTCGCTGGGG AAGGGGGGAG GGAGGCAGTG GTGGCTGGAG GGTGCGCCAC 1740 

TTTCAGAGCC TCTGGTCACC CTGl*CCTGGA AAOATTGGGA GGGGGCCAGA CTGAAAATTT 1800 

TACTAGAQTT ACAACTCTGA TACCTCAACA CACCCTTAAA TCTGGAAGCA GCTAAGAGAA 1860 

45 ACTTTTGTTT' TGCCAGAGGT GGCCACTAAG GCATTCTGAC GOCCTCTGCC CACCTCCCCC 1920 

GCTGTGTGTC ACTCCACCCC 2TCTTC0GAG GAGGGGGTGG GTAAAAGGGA GAGGGAGAAT 1980 

TACCACCTGT ATCTAGAGG7 QCTCTTTGCA ATCCCTAAGC CCTCTGGTCC TGACCTCCGA 2040 

CCTCCCAGCT CTGTCTTGXT GCTTGTCTTT GTCTTTCTTC CCTTOCCCCT GOCCCTGCCC 2100 

CIACCAGCOC AGCTTTGGGG ACACC»TCCT TCTGGGGAGA AOTAGGGGGA GGAATATTTG 2160 

50 OATOOTCCCT CCATTCCTCT TCAGGCATCT GGftGGCCCTC TCCCCCACTC CTOCAAAGAA 2220 

ACATCTCAAA TTATTGATGG AATGTATCCC CATTCTCAGT GAAAATGTGA GGAG5GOACT 2280 

AATACTGGGG TAAAGGGTCA AACCCGCACC TTCATCACTA TGGGCATTAT ATTTAGGGAG 2340 

TAGTTCTTGG GCXGGATTTT CTGGTTGTGG AAGTGQGGGC GCCAGAGTAG TQTOTCTGCT 2400 

ATTTAAAGGA GGAGGAAAGQ QCGTGAGGCA GGAGOAGAGA CTGGTGOTGG GAAGAGCTGC 2460 

55 TCCTCCCATG CAQTGCCCGA CTCCCTGCAC CCCTCTCAAC CTGACCTGAA OCTTTATTGA 2 520 

ATCCTTATTA GCTTGAATCC TTATTAGCTT QAAXCCTCCA TGCAAATCAT GGAGTCT3TG 2S80 

TCCCACCTGA TGTGQTTGAG GAGAAOCCAG GTCTTCAAAG AGGGGTCAGC CTGGGGCAAA 2640 

GCAGGACTGG GGGGAQGTGG GCAGCAGGGC CTATtTCTGAG AATCACATAT TGTTACAGGC 2700 

. _^ CTTGCACCCC CTTTGCTGCT TCCCTCCMC TCATTTGGQG CCGCCACCAG CTCTCCACCC 2760 

60 TCCTGGTTOC GCTGGOCGQG CCAAGAGAOG ATGGAGGGAT GGGAGTCGCA GGAGATC3CTT 2820 

GTAAATA3TG GGGTGGGACT GTTCTGAGTG ATCACCCGAG CACTTAAAGC TCCAGAGTCC 28B0 

CATTCTTCCT GGATGQAGGA GGTGGAQGTG CAGAGGGGAT TTCCTCCTCT CCTTCCTCCT 2940 

BTCOAGAATT AACACCTCTC CACAGGCTTC OCCTCCAGAA CACCAGCCAG GGAGGGGTGG 3000 

GGAAGGAGGT CACAGCCAAG AAAACTGGCC TGTGAGGACT TCCCTCCTTC CCGCCCATGT 3060 

65 GASCCATCCT GAGATGTCTG TACAATAGAA ACCAAACCAA ATGGGCACCC TCGGTTQCCG 3120 

GGGGGCAGGT GGGGAGGGGG GTGGGAAGAA GGQATGTCTG TCTGTC3TCC GCCTCCCCCT 3180 

CTCCACTCTT TACCCACAAA GGCAGAAGAC TGTTACACTA GGGGGCTCAG CAAATTCAAT 3240 

CCCACCCTTA CCAATTGAGC CAAACCTAGA AACAAACACA AAACAGGAAT AGTGAGAGAC 3300 

_ AAAATAGAGG AGAOAAAGAG AGCATGAGAG GGAGCGAGAC AGGOGACCAA CACAGftflGAG 3360 

70 AGAAAACAAA AATAGCAAAA AAAAAAAAAA AAAGCAGTTC TTXATAATTT AATATTCTAT 3420 
TTTAATAAAG GCGTTTTATTA CCATATAAAT QTAGCAAAGA ACCTGGGCTA ATATGAA 



75 



Seq ID NO: B4 Protein sequence 
Protein Accession £j XP_044166.2 



1 11 21 31 41 51 

1 I I I I I 

FCFAMPSLW 30IP4EHMGGF OHIiQCFGGfiA KDRGIiTjEDER ALQIAIjIJQLC IjUGLGEPPAP 60 

RA<3EDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 120 

80 AliYJCEAKIiRIj KGSSNTTECV PVPTfiSHVAE IVGHQGCKIK AUIAKTNTYI KIPVRGEEPV 180 

FMVTGRREDV ATARREII3A ABHF5MrRA9 RNKSGAAFGV APALPGQVTI EVRVFYRWG 240 

LWGPKCATI KRIQQQTKTY IITPSRDRDP YFEITGAPGH VEHAftKEIET HIAVKTGKII* 300 

EYNNEMDFIA GSPDAA1DSR Y3DAHRVHQP GCKPI*STFRQ N8IX3CIGECG VDSQFEAPRL 360 

GEQGGDPOTO GYliFPGTCVa KODVYYGVAE TSPPliWAGQE BATPTSVLF8 SASS833S3A 420 
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KARAGPPGAH E SPAT SAG PE LAGkPRRPPG EPLQGFSKLG GGGLRSPGGG 
VTAALVPCGH NLFCMECAVR ICERTDPECP VCHITATQAI EIFS 

Seq ID HO: B5 tttJA sequence 

Nucleic Acid Accession #: NM_0003Q9.1 

Coding sequence: 209-.13G3 



480 



i 
1 

CATTCCCACC 
GTTCAGTTCA 
AATAAGAATA 
ATAATCTATA 
TCATTCAGTC 
TGATTGTCAT 
CATTCTTGGT 
GAGAAATOTT 
CAT GTGT C TC 
GATGTGTAAG 
GGTTCTCATT 
TAATAGACAT 



GTACAAAGAC 
TACCACTCTC 
CTTCAAGATA 
TAAGTACAGG 
ATTTGCAGTC 
GATCATTQCT 
ATCCACTTGT 
GCAGTTCTTC 
CATGTCCACG 
ATTTAAAAAA 
GGTCCOGGAT 
TCTCCCAAQG 
TTACTGCTTT 
TCTTCTGQAA 
AATATAAAGA 
GCTGACTTCA 
TTAGATTAGA 
TAGTGTGTTA 
GAAGTCATTC 
TTTT3TCACC 
TTAAAAATGA 
ACAGGAATOA 
AGAGCATTTT 
TATAT TTATT 
TTGTCAAGCT 
ACAAATATCG 
CATCTTTCAA 
AGGGAAAAAT 
GTGTGACTTG 
GTTAATGTGC 
AAACIATATT 
TGTTTGATTT 
GGAGTGTCTT 



11 

1 

CTTCCTTCTT 
AGGGAATGAA 
AGCTGAACAG 
ACAACCAAAC 
CACTCTAATT 
CTGCOCTTGG 
GTCTCTGBAA 
ACCAACATCC 
CCCTTTACAT 
TTGAATCCTT 
GCTGTGGAAC 
GCTTATGTAG 
ATCTACCAAG 
AAATAOGTGT 
CTCTTGGTGC 
TATATACGCC 
TCCAGTGAAA 
TGCTGGCTCC 
ACCTGCAACC 
GTCAACOCCA 
TTCAACTTTT 
ATGCACACAG 
ATCAACAACA 
GACATCTGTT 
AATGGGGTTG 
TGTTGTAGTT 
ATAGTTTTGA 
CT 1*1 TAT ACT 
GAAGTACCTG 
TTAGATTGTC 
CAATAGTAAC 
AGAAGTGGTT 
TTAAGGGAGG 
ATAAAAAGAC 
AGAGAGAAAG 
AGAGTAATTA 
TGAATTGATG 
TCTGGTCTAA 
TTTTCCATAC 
AGGAAGTAAC 
ACACAAAAAC 
TGGCGTCTTA 
CTAATTTCAT 
TTAAAGAACA 
TAAAAGGGCG 
TCAGTTTCAT 



21 
I 

TAATAAGCAG 



TTGACCTGCT 
CAATCAAAAT 
TCTCAGAGAA 
CCATGATATT 
ACCTGGCCTr 
TGATTGTGAA 
TTGTCTACAC 
TTGTGCAATG 
GACATCAGCT 
GTATTGCTGT 
TAATGACTGA 
GCTTTGATCA 
TGCAGTATTT 
TAAAAAGGAG 
CCAAAAGAAT 
CrrCTTACCAT 
ACAATCTGTT 
TATTTTATGG 
GTGATTTCCG 
ATGTTTCCAA 
ATGATGATAA 
TAAAAACAAG 
AAATCATTTG 
OTCATAATTA 

ccagacatct 
Gtacttattg 

CCATCCAATA 
AACAQATTGG 
AGTATGCAAA 
TQAGGTTTCT 
CTTTGATTTC 

atacttctca 

CAGCTCCCCA 
ATTTAATAAA 
GTCAAGAGAT 
TATGTACTCG 
AGCAGTGCCT 
ACCAAGGTAC 
TGCAGATACT 
TAAATAATGC 
GTATCTTGTA 
AGACATACTT 
GACATTTTAT 

tttttctcat 



31 

I 

GAGCGAAAAA 
TAATTTTGGT 
TTGAAGAAAC 
GAATTCAACA 
GAATGCCCAG 
TAOCTTAGCT 
GATCATAATC 
CCTTTCCTTC 
ATTAATGGAC 
TGTTTCAATC 
GATAATCAAC 
QATTTGGGTC 
TGAGCCGTTC 
ATTTCCATCQ 
TGGTCCACTT 
AAACAACATG 
CAATATCATG 
CTTTAACACT 
ATTCCTGCTC 
GTTCCTGAAC 
GTCTCQGGAT 
AACTTCTTTG 
TGAAAAAATC 
CACAACCTGC 
AAAATGACTA 
CATTTGGAAC 
TTGAAGTGCT 
GAATGAAATT 
CGGTCATTAG 
GOCATCCTTA 
AGCAGCATTC 
GTTTTTTGGT 
CTCCCGACTG 
GCTGCAAATA 
ACTTCAAAAC 
GTAAATTAGT 
TTTCCATTTT 
AAAGACTTTC 
ATATAGTGAC 
AATGTTAAAG 
TCATATAQCC 
ACTGTAAAGA 

atcatgattg 
caatgtatta 

TAAAATCAAT 
CCCATGACTT 



41 

I 

GACAAATTCC 
AAATGGATTC 
ATACTGTCCA 
TTATTTTCCC 
CTTCTGGCTT 
CTTGCTTATG 
ATCTTGAAAC 
TCAGACTTGC 
CACTGGGTCT 
ACTGTSTCCA 
CCTCOAGGGT 
CTTGCTGTGG 
CAAAATGTAA 
GACTCTCATA 
TGTTTTATAT 
ATGGACAAGA 
CTGCTCTCCA 
GTGTTTGATT 
TGCCAOCTCA 
AAAAACTTCC 
GATGATTATG 
AAGCAAGCAA 
TGAAACTACT 
AACATACTTT 
AGATTTTCTT 
AAAAGGTGTG 
TTTTGTGAAT 
TCTTTAAAGT 
ATTGGGTCAT 
CTTTATGATA 
AGGAGCOGAA 
GGTTTTTGTT 
ATTGTCACTT 
TTATGGAGAA 
CATTTTGGTA 
ATTGCTGCAA 
TTTTACAGAC 
CGGTTACAAT 
TGATTTTAAC 
GAATATTCAC 
CATTTTAACT 
TTACTGAATA 
AQCCTCAGAA 
TACAGATAAA 
ATTGTTTTTG 
CCCTCCGATG 



51 
I 

AAAGAGGATT 
CAATATCGGG 
TTTGTCTAAA 
AGGTTGAAAA 
TTGAAAATGA 
GAGCTGTOAT 
AAAAGGAGAT 
TTGTTGCCAT 
TTGGTGAGGC 
TTTTCrCTCT 
GGAGACCAAA 
CTTCTTCTTT 
CACTTGATGC 
GGTTGTCTTA 
TTATTTGCTA 
TGAGAGACAA 
TTGTGGTAGC 
GGAATCATCA 
CAGCAATGAT 
AGAGAGACTT 
AAACAATAGC 
GCCCAGTCGC 
TATAGCCTAT 
GATTACCTGT 
GTCTTGCTTT 
GGCTTTGGGG 
TTATGCATAT 
ATTACGATGC 
CTTGATTAGA 
BGCATCATTT 
AGATAGTCTT 
TGTTTTTTTT 
AAATCAAAAT 
TTGGGCACCC 
CCTGACAACA 
ATAGCTAAAT 
TGTTCAGTGT 
TTGTAGAAAC 
TTTCAATGTC 
TTTACCTAGC 
TGTATAAACT 
GTTGTGTCAT 
TCATTTGGAG 
GTATTACATG 
CTTTTTCTGA 
GT 



Seq ID HO: B6 Protein sequence 
Protein Accession fr* NP_0 0090 0.1 



1 
1 

MHSTLFSQVE 
fcUIIMCQKB 
CVSITVfitFS 
DKP FQHVTXiD 
KHNMMDKMHD 
LFLLOHiTAM 



11 

1 

NESVESNFSE 
MRNVTHIIiIV 
LVLXAVBRHQ 
AYKDKYVCFTJ 
NKYRSSBTKR 
ISTCVHPISry 
AFKKINNNDD 



21 31 41 51 

1111 
KNAQliliAFBK" DDCHLPXAHI FTLALAYGAV III1GVSGHI1A 
NLSFSDLLVA IKCLPPTFVY TLMDHWVFGE AMCKLNPFVQ 
lil INPRGWRP NNRHAYVQIA VtWVLAVASS LPFL1YQVMT 
GFPSDSHRfcS YTTLLLVLQY FGPLCFIFIC YFKJYIRLKR 
INIMLLSIW AFAVCWLPLT IF8TVFDHNH QIIATOJHNL 
GFLNKNFQTCD I.QPFFNFCDF 
NEKI 



Seq ID NO * B7 DMA sequence 

Nucleic Acid Accession 41 - MM_002590.2 

Coding sequence $ 204.. 3416 



1 
I 

GCTTCTGTAA 
CTGGAGGCTC 
TTCCGAGAGC 
GGATCGCACT 
TCCCCTTGCA 
GATACAGCAC 
TGCATAT3AA 
TGCTCCGGGT 
AGCGGCTGTG 



11 
I 

GAGACGGAGA 
GGAGCTGCTA 
CTGAGACTGA 
TGAGGCTGGA 
QCTCTTCAGC 
CTTOGAGGAG 
AGTATCGGGT 
GCGCGAAGGC 
TGGCCAGGCC 



21 
I 

GGCGCAGAGT 
CCCGCAGACT 
CGCCCGACCT 
GGCATGAGTC 
CFCTGCTGGG 
GATGCCOCCG 
GACACAAGCT 
GACGGGCAGC 
CCGCAGTGCG 



31 
I 

GAGGGCGGGT 
TCTCOOGCAC 
GQAAACCAGA 
CTGTGAGGCG 
TQCICTCAGT 
GCACGGTCAT 
TCCGCCTGAX 
TGACCGTCOa 
TGCTQGCCTT 



41 
I 

CCGCGOGTCC 
AGGGCTCGCA 
GAAGACTTCC 
TTGGGGCAGC 
GGCCCAGAGC 
CGGGACCCTG 
GAAGCAATTC 
GGACGCCGGC 
OGATGTGGTC 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
650 
720 
7BQ 
840 
300 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1B00 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



60 
120 
180 
240 
300 
360 



51 
1 

TCAGAGCCCG 
AAGAGCGTGA 
TTAGCCTTTC 
CCCTGCCTTT 
AAAACAGTCC 
GCCGAGGAOC 
AACAGCTCTC 
CTQGACCGCG 
AGCTTCTCGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



799 



WO 03/042661 



AGGAGCAGTT CCGGCTGGTG CACGTGOAGG TAGAGOTGAG GGACGTCAAC GACCAGGCGC GOO 
CGCGCTTOCC CAGGGCCCAG ATCCCGGTAG AOGTGTCCGA GGGTGCGGCA GTGGGCACGC 660 
GCATCCCCTT GGAGGTGCCG GTGGACGAGG ACGTGGGCGC CAACGGGCTG CAGACCGTGC 720 
GCCTGGCCGA GCCGCACAGC CCCTTTCGOG TGGAGCTGCA GACGCGAGCG GACGGCGCTC 780 
AGTGCGCAGA CCTGGTGCTG CTGCAGGAGC TGGAOOGCGA GAGCCAGGCC GCCTACAGCC 840 
TGGAGCTGGT GGCCCAGGAC GGCGGCCOCC OGCCGCGCTC CGCCACGGCT GCOCTCAGCG 900 
TGCGCGTCCT GGATGCGAAT GACCACAGCC OGGCCTTCCC GCAGGGCGCC GTGGCCGAAG 960 
TGGAGCTGGC GGAAGACGCG CCCGTGGGCT CCCTGCTTCT CGACCTGGAC GCAGCCGACC 102 0 
CCGACGAGGQ ACCTAACGGC GACGTGGTGT TCTCATTTGG CGCCCGCAJCC CCGCCGGAGG 1080 
CGCGCCGCCT CTTTCGGCTT GACCCGCGAT CAGGCCGCCT CACCX2TGGCC GGGCCCGTGG 1140 
ACTACGAGCG TCAGGACACC TACGAGCTGG ACGTGCGGGC GCAGGACCGC GGACCGGGGC 12 00 
CCCGCGCTGC CACCTGCAAG GTCATCGTGC GCATCCGAGA CGTCAATGAC AACGCACCCG 1260 
ACATGGCCAT CACCCCGCTG GCCGCCCCAG GCGCGCCGGC AACCTCACCC TTCGCCGCTG 1320 
CGGCCGCCGC CGCTGCACTC GGGGGAGCGG AOGCTAGCTC GCCGGCGGGA GCCGGGACGC 1380 
CGGAGGCTGG TGCCACTTCG CTGGTGCCGG AGGGGGCGGC GCGCGAGAfiC CTGGTGGCCC 1440 
TGGTCAGCAC CTCGGACAGG GACTCGGGCG CCAACGGGCA AGTGCGCTGC GCCCTCTATG 1500 
GGCACGAGCA CTTCCGGCTG CAGCCGGCCT ACGCGGGCAG CTACCTGGTG GTGAOCGCBG 1560 
CGTCGCTGGA CCGCGAACGC ATCGCCGAGT ACAACTTGAC GCTGGTGGCC BAGGATCGCG 1620 
GCGCGCCCCC GCTGCGCACA GTGCGGCCCT ACACGGTGCG TGTGGGCGAC GAGAACGACA 1680 
ACGCGCCGCT CTTCACGCGG CCSGGTCTATG AGGTGTCGGT GCGCGAGAAC AACCCGCCAG 1740 
GCGCCTACCT GGOCACJGGTG GCCGCCCGCG ACCGGGACCl 1 GGGCCGCAAC GGOCAGGTCA 1800 
CCTACCGGCT GCTGGAGGCC GAGGTGGGCC GCGCCGGGGG CGCCGTGTCC ACTTATGTCT 1860 
CGGTGGACCC AGCTACCGGA G C CAT CTACG CGCTQCGCAG CTTOGACTAT GAGACGCTGC 1920 
GCCAACTCGA CGTTCGCATC CAAGCTAGCG ACGGCGGCTC CCCTCAGCTT TCCAGCAGCG 1980 
CCCTAGTGCA AGTGCGCGTG CTGGACCAGA ACOACCATGC GCCAGTCCTG GTGCAOCCGG 2040 
CGCCAGCCAA T3GCTCCCTA GAAGTGGCGG TGCCTGGGCG CACCGCAAAG GACACGGTTG 2100 
TGGCCCGTGT GCAGGCCCGG GATGCAGACG AGGGAGCCAA CGGGGAGCTG GCGTICGAGC 2160 
TGCAGCAGCA GGAGCGGOGC GAAGCCTTCG CCATCGGCCG CCGCACGGGG GAGATACTGC 2220 
TCACCGGGGA CCTCTCGCAG GAGCCACCCG GTCGCGTGTT CAGGGCGCTC CTGGTCATAT 2280 
CCGACGGCX3G CGGTCCCCCG CTCACCACCA CCGGAACTGT CAGCTTCGTG GTAACAGCAG 2340 
GGGGCGGGCG TGGGCCGGCT GCGCCTGCCA GTGCAGGAAG CCCGGAGCGT TCCCGCCCGC 24 00 
CTGGCTCTCG GCTCGGGGTG TCCGGGTGSG TBCTGCAATG GGACACGCCG CTGATCGTCA 2460 
TCATCGTGCT GGCCGGGAGC TGCACGCTGC TGCTGGCCGC CATCATCGCC ATCGCCACCA 2520 
CCTGCAACOG CCGCAAGAAQ GAGGTGCGCA AAGGGGGGGC OCTCCGGGAA GAGOGGCCOG 2580 
GGGCGGCGGG CGGCGOAGCC TCGGCTCCCG GCTCCCCGGA GGAGGCCGCC CGGGGAGCCG 2640 
GGCOCAOGCC CAACATGTTC GACGTGCTCA CCTTCCCTGG CACCGGCAAA GCGCCCTTTG 2700 
GCAGCCCOGC GGCGGACGCG CCTCCGCCTG CGGTCGCCGC GGCCGAAGTG CCGGGCTCAG 2760 
AGGGC6QCAG CGCCACTGGG GAAAGOGCCT QTCACTTCGA GGGGCAGCAG OGGCTCCGCG 2620 
GCGCGCACGC CGAGCCCTAC GGTGCCTCOC CGGGTTTTGG AAAGGAGCCG GCGCCCCCTG 2880 
TGGCGGTGTG GAAAGGACAC TGCTTCAACA CCATTTCTGG CAGAGAAGCA GAGAAGTTCA 2940 
GCGGCAAAGA CAQCGGTAAA GGGGACAGTG ATTTCAAOGA CAGCGATTCC GACATCAGCG 3000 
GGGACGCTCT GAAAAAGGAT CTCATCAACC ACATGCAGAG TGGACTGTGG GCGTGCACCG 3060 
CTGAGTGTAA GATCCTGGGC CACTCTGACC GCTGCTGGAG CGCATCCTGC AGCGGGCCCA 3120 
ACGCACATCC ATCGCCTCAC CCAOCAGCCC AGATGTCAAC CTTCTGTAAG AGCACGTCAC 3180 
TGCCTCGGGA TCCTCTGCGC AGGGACAATT ACTACCAGGC CCAGCTGCCC AAGAGAGTGG 3240 
GGCTQCAGAG CGTCTATGAG AAAOTACPGC ACAGAGACTA TGACAGGACA GTCACTCTGC 3300 
TCTCOCCTOC CCGTCCAGGG AGGCTCCCAG ACCTGCAGGA GATTGGAGTA CCCCTCTACC 3360 
AGTCCCCTCC TGGCAGGTAC CTGTCCCCGA AGAAGGGAGC CAATGAAAAT GTGTAATCCC 3420 
ATGCTGCATG TCTTCACACA TATACAGGTC ACTCCGAGAA GCCCCTAAGT TATTGACCGG 3480 
TTTCAGTGTr GTATATATAA ATATGCAAGA TGTGCCTTAC AATGAAGTTG TTGGAA0CTA 3S40 
TTTCCAATCA CATTGGTACT GTTTGGTATT TGCAAACAAA AATGTAGTTA ATGTAATTTT 3600 
TATGAAATGT GTGCAGTATT TAATTTTTCT TATGC1ATTG ACTTTGATTT CAATTQCGGC 3660 
TTGCCATTTT CTTAGTGTTT TTAACCTGIA CATTGTGTAA TGTAATGTTT GTATATAATG 3720 
AAATTTTGTT ATATTTTTAT ATAATAAAAG CTAAAGTGGA AQTTATTGCG AAAGGAACTG 3780 
TCTGTAAGAC AAAAAACAAA ACATGTTGGA ATTACTTAAT TGAAATTTAT CTTTCACCTG 3840 
AAACAACCTA GTOTTTGAGA AATTTTGCCT TGCCAAGTAT AACTTGTATA TCTTGACTCT 3900 
GTGGTAGATT TCAAGTTCAA TGTTATTTAA TTACA^TTGG TTTTCCGTAA ACCGTGTCAC 3960 
TTAtTAAGCAC AGTAATAAAA GATTTGTCAT GTGTTTGAAG AAAAAAAAAA AAAAAAAAAA 4020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 



Seg ID NO: BB Pxotein sequence J 
Protein Accession. #: NP_0025&1.2 

1 11 21 31 41 51 

111111 

WSFTORWGSP CLPPLQLFSb CWVLSVAQSK TVRYfiTFEED APGTVlGTLA EDLHMKVSGD 60 

TSFRLMKQFSr 3SLLKVREGD GQLTVGDAGL DRKRLCGQAP QCVLAFDWS FBQBQFRLVH 120 

VEVEVEBVMD HAPRFPRAQI PVEVSEGAAV GTRIPLEVPV DBDVGAKGLQ TVRLAEFHSP 180 

FKVELQTRAD GAQCADLVLX> QELDRESQAA YSLELVAQDG GfiPPRBATAA LSVEVLDAND 240 

HSPAFPQGAV AEVKLAEDAP VGSLLLDLDA ADPDE3HIGD WFAFGARTP PEAHRU?RXiD 300 

P2SGRUELAG PVDYEHQDTY ELDVRAQDRG PGPHAATCKV JVRIMDVNEN APJUAITPLA 360 

APGAPATSPF AAAAAAAALG GADASSPAGA GTPEAGATSIj VPEGAARESL VALVSTSDRD 420 

SGAMGQVaCA LYQHEKFRLQ PAYAGSTIjW TAASLDRERI AKHJIiTLVAE URGAPPLHTV 460 

SPYTVRVGDK NDNAPUTRP VYEV6VHENN PPGAYIiATVA ARDRDLGRHG QVTYRUrEAE 540 

VGRAGGAVST YVSVDPATGA 1YALRSFDYE TLRQLDVRIQ ASDGGSPQL8 S9ALVQVRVL 600 

EQMPHAPVLV HPAPANGSLS VAVPGRTAKD TWARVQARD ADEGANGEIA FBLQQQBPRB 660 

APAIGHRTGE ITJiTGDLSQE PPGRVFRAMj VIGDGGHPPTj TTTATVSFW TAGGGRGPAA 720 

PA3AGSPERS RPPG9HLGVS GSVLQHDTPIl IVirVLAGSC TLI»I»AAIIAl ATTCNRRKKE 7 BO 

VRKGGAIiRSE RPGAAGGGA& APG&PEEAAR GAGPRPNMFD VLTFPGTGKA PFGSPAADAP 840 

PPAVAAAEVP GSEGGSAIGH 9ACHFEGQQR LSGAHAEPYG ASPGFGKBPA PPVAVWKGHS 900 

FMTISGREAE KFfiGKDSQKG DSDFNDSDfiD ISGDAL1CKDI* XMHMQ3GI.WA CTAECKILGH 960 

SDRCHSPSCS GPNAHPSPHP PAQKSTFCKS T9LPRDPLRR DNYYQAQLPK TVGLQ8VYEK 1020 
VLHRDYDRTV TUiSPPRPGR LPOLQEIGVP LYQSPPGRYI> SPKKGANENV 



800 



WO 03/042661 



PCT/US02/36810 
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Seq ID NO: B9 DNA sequence 

Nucleic Acid Accession AL121939.12 

Coding sequences 185- .1426 

1 11 21 31 41 51 

111 III 

AGCGGAGTST CftGTGGCGAG CTCCGGGTGC TGTGGCCCGG CCTTGGCGGG GCGGCCTCCG 60 

GCTCAGGCTG GCTGAGAQQC TCCCAGCTGC AGOGTCCCCG CCCGCCTCCT GGGGAGCTCT 120 

GATCTCAGCT GACAGTGCCC tcggggacca aacaagcctg gcaggacaaa attagaagat ibo 

CAAAATGGAA AATATGCTGC TTTGGTTGAT ATTTTTCACC CCTGGGTGGA CCCT CATTG A 240 

TGGATCTGAA ATGGAATGGG ATTTTATGTG GCACTTGAGA AAGGTACCCC GGATTGTCAG 300 

TGAAAOGACT TTCCATCTCA CCAGCCCCGC ATTTGAGGCA GATGCTAAGA TGATGGTAAA 360 

TACAGTGTGT GGCATCGAAT GCCAGAAAGA ACTOCCAACT CCCAGCCTTT CTGAATTGGA 420 

GGATTATCTT TCCTATGAOA CTGTCTTTGA QAATGGCACC CQAACCTTAA CCAGGGTGAA 4BD 

AGTTCAAGAT TTGGTTCTTG AGCCOACTCA AAATATCACC. ACAAAGGGAG TATCTCTTAG 540 

GAGAAAGAGA CAGGTGTATG GCACCGACAG CAGGTTCAGC ATOTGGACA AAAGGTTCTT 600 

AACCAATTTC CCTTTCAGCA CAGCTGXGAA GCTTTCCAGG GGCTGTAGTG GCATTCTCAT 660 

TTCCOCtCAG CATGTTCTAA CTGCTGCCCA CTGTGTTCAT GATGGAAAGG ACTATGTCAA 720 

AGGGAGTAAA AAGCTAAGGG TAGGGTTGTT GAAGATGAGG AATAAAAGTG GAGGCAAGAA 780 

ACGTCGAGGT TCTAAGAOGA GCAGGAGAGA AGCTAGTGGT GGTGACCAAA GAGAGGGTAC 840 

CAGAGAGCAT CTGCAGGAGA GAGCGAAGGG TGGGAGAM3A AGAAAAAAAT CTGGCCGGGG 900 

TCAGAGGATT GCGGAAGGGA GGCCTTCCTT TCAGTGGACC CGGGTCAAGA ATACCCACAT 960 

TCCGAAGGGC TGGGCACGAG GAGGCA1GGG GGAGGCTACC TTGGACTATG ACTATGCTCT 1020 

TCTGGAGCTG AAQCGTGCXC ACAAAAAGAA ATACATGGAA CTTGGAATCA GCCCAACGAT 1080 

CAAGAAAATG CCTGGTGGAA TGATCCACTT CTCAGGATTT GATAACGATA GGGCTGATCA 1140 

GTTGGTCTAT GGGTTTTGCA GTGTGTCYGA CGAATCCAAT GAICTCCTTT ACCAATAYTG 1200 

□GATQCTtSAG TCGGGCTCCA CCGGTTCGGG GGTCTATCTG OGTCTGAAAG ATCCAGACAA 1260 

AAAGAATTGG AAGCGCAAAA TCATTGOGGT CTACTCAOGCr CACCAGTGGG TGGATCGTCCA 1320 

CGGGGTTCAG AAGGACTACA ACGTTGCTQT TCGCATCACT OCCCTAAAAT ACGCCCASAT 1380 

TTGCCTCTGG ATTCACGGGA ACGATGCCAA TTGTGCTTAC GGCTAACAGA GACCTGAAAC 1440 

AGGGCGGTGT ATCATCTAAA TCACAGAOAA AACCAGCTCT GCTTACCGTA GTGAGATCAC 1500 

TTCATAGGTT ATGCCTGGAC TTGAACTCTG TCAATAGCAT TTCHACATTT TTCAAAATCA 1560 

GGAGATTTTC GTCCATTTAA AAAATGTATA GG1GCAGATA TTGAAACTAG GTGGGCRCTT 1620 

CAATGCCAAG TATATACTCT TCTTTACATG GTQATGAGTT TCATTTOTAG AAAAATTTTG 16SO 

TTGCCTTCTT AAAAATTAGA CACACTTTAA ACCTTCAAAC AaGTATTATA AAXAACATGT 1740 

GACTCCTTAA TGGACTTATT CTCAGGGTCC TACTCTAAGA AGAATCTAAT AGGATGCTGG 1800 

TTGTGTATTA AATGTGAAAT YGCATAQATA AAGGTAGATG GTAAAGCAAT TAGTATCAGA I860 

ATAGAGACAG AAAGTTACAA CA£AGTTTGT ACTACTCTGA GATGGAYCCA TTGAGCTCAT 1920 

GCCCTCAATG TTTATATTGr GTTATCTGTT GGGTCTGGGA CASCTTAGTTT AGTTTTTTTG 1980 

AAGAATTACA AATCAGAAGA AAAAGCAAGC ATTATAAACA AAACTAATAA CTGTTTTACT 2040 

GCTTTAAGAA ATAACAATTA CAATGTGTAT TATTTAAAAA TGGGAGAAAT AGTTTQTTCT 2100 

ATGAAATAAA CCTAGTTTAG AAATAGGGAA GCTGAGACAT TTTAAGATCT CAAGTTTTTA 2160 

TTTAACTAAT ACTCAAAATA TGGACTTTTC ATGTATGCAT AGGGAAGACA CTTCACAAAT 2220 

TATGAATGAT CATGTGTTGA AAGCGACATT ATTTTA1GCT ATACATTCTA TGTATGAGGT 22 B0 

GCTACATTTT TAGGACAAAG AATTCTGTAA TCTTTTTCAA GAAAGAGXCX TTTTCTCCTT 2340 

GACAAAATCC AGCTTTTGTA TGAGGACTAT AGGGTGAATT CTCTGATTAG TAATTTTAGA 2400 

TATGTCCTTT CCTAAAAATG AATAAAATTT ATGAATATGA CTTAAAAAAA AAAKWOGACG 2460 

CGGGGQCGAA TTTAGTAGTA GTAGTCGACC CGGGAATTCC GGACCGGTAC C1GCAGGCGT 2520 
ACCAGC 

8eq ID NO* B10 Protein sequence 
Protein Accession tfi CAC3SG71.1 



1 11 

I I 
MEKKLLWI*IF FTPGWTItlEG 
VCGIEOQKBIi PTPSLSBLBD 
KRQVYGTDSR PSIEiDKRFIiT 
SKKKRVQLLK MRNKSGGKKR 
RIAKGRPSPQ WTRVKNTMIP 
HMPGGMIHF8 GFDNDRADQEi 
NWKRKIIAVY GGHQWVDVHG 



21 31 

1 i 
S324EWDFMWH LEKVPRTVSK 
YL3YETVFEN GTRTOWEVKV 
EJfPFSTAVKli STGCSGIL3S 
XGSKR&HKBA SGGDQRSGTH 
KGnARGGNS} ATLDYDYALXt 
VYHFCSVSDE SNDLLYQYCD 
VQKDYNVAVR ITPLKYAQIC 



41 51 

I I 
RTPHLTSPAP EADAKMMWJT 
QDLVI*BPTQN XTTKGVSVfiR 
V QKVXiTAAH G VHDGKDTVKG 
EHLREBAXGG RHEKKSGRGQ 
EIiKRflHKKKY MELQIQPTIK 
AESGSTGSGV YLRLKDPDKK 
LWIUGNDANC A7G 



60 
120 
180 
240 
300 
360 



Seq ID NO: 11 DNA sequence 

NUcleic Add Accession #s NM_00203S,1 

Coding sequence: 108- -1106 



1 
I 

AGGCGOCCGC 
CCGGCGGCCC 
CTGCCGCCTT 
CCAAGCCCCT 
GGAAGTGCAT 
ATOAGGATAA 
AGGTGGTGCT 
TAAAACAAGC 
CAGTGTCAGG 
ATTACCTGGG 
TGGGCftGGAT 
CCTACTCTGC 
AOCCATATAA 
C0GAAOAAAA 
GCAAACCAGA 



11 
I 

CCGCCGCGCG 
CCGCC5CGGCC 
CCTCGTGGCC 
CGCOCTGCCC 
TGCTATOGAG 
GCTGCTOCA& 
TTGCATATCA 
ACAGGAGAAA 
AAAATTTGAA 
CAGCGTGTAC 
CGTGTTTGTG 
ATCCAAGTTT 
TGTCTACATC 
CAGAACAAAG 




TTCGTGCTGC 
GGGGCGCATG 
TGCTATAAAC 
GCAAAGAAAO 



CTGGGTCCAG 
GATCTTGAAG 
CCCAGCC5GGG 
TCCTCCCAQG 
GCCATAAGGG 



31 
I 

CT0GOCGCAG 
CCTGQCCCCG 
TGCTGTAOVT 
TGGTGGTTAC 
AAGGAGCTTI 
AAATTGAAAT 
CTCAAGACTA 
TGGACATGCT 
TTAGTACCTT 
GGGTGATCAC 
CAGGACftGrr 
GATTGGCAGA 



41 
I 

OCCAGCCCXG 



GGTGTCTCCG 
AGQAGCTTCC 
TATAACTCTG 
GCACTCTATT 
TAACCAAGTA 
GGTAAATTGT 
TGAAAOGTTA 
C3W0CATGAAG 
GGGATTATTC 



CCITTGGAGA 
AAACAAATTG 



CTCGACTTAT 
TTAAAGAXGC 



CACAGACACA 
IXCAGAGACC 
CATACAAGGA 



51 
\ 

OacGCSCTTGC 
CTGCTGCTGG 
CTCATCAGCC 
AGTGGCATCG 
GTTGCACGAA 
AATGACAAAC 
GAGAATGICA 
GCAGGAATGG 
ATGAGCATCA 
GAGCGGGGGG 
GGTTTCACAG 
ATGGAGGTGA 
OCTGGCTTTG 
AC3VTCTGTGT 
AATTTCAACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



801 



WO 03/042661 



PCT/US02/36810 



GTTCCCTTGG CTCAGATGGG TACATGCTCT CGGCCCTGAC CTGTGGGAT3 GCTCCAGTAA 960 

CTTCTATTAC TGAGGGQCTC CAGCAGGTGG TCACCATGGG CCTTTTCCGC ACTATTGCTT 1020 

TGTTTTACCT TQGAAGTTTT GACAG CATAG TTCGTCGCTG CATGATGCAG AGRGAAAAAT 1080 

CTGAAAATGC AGACAAAACT GCCTAATCTT CTTACCCCTT GGAAGAAGAC TGTTTCCAAA 1140 

5 TAATTTGAAC AGCTTGCTGC TAAATGGGAC OCAATTTTTG GCCTATAGAC ACTTATGTAT 1200 

TGTTTTCGAA TACGTCAGAT T6GACCAGTG CTCTTCAGGA ATGTGGCTBC AAGCAAGGGG 12 CO 

CTAGAAGTTC ACCTCCTGAC AGTATTATTA ATACTATGCA AATATGGAAT AGGAGACCAT 1320 

TTGATTTTCT AGGCTTTGTG GTAGAGAGGT GAAGGTATGA GAATTAATAG CGTGTGAACA 1380 

AAGTAAAGAA CAGGATTCCA GAATGATCAT TAAATTTGTT TCTATTTATT CTTTTTTGCC 14.40 

10 CCCCTAOAGA TTAAGTCCAG AAATGXACTT TCTGGCACAT AAAGAAATCT TGAGGACTTT 1500 

GTTTAAACCT TCCATAAAAA AACAATTTTC GGTTTCTCGG GTTCTCTCTC TCTGTCTCTC 1560 

TGTCTCTCTG TCTCTCTGTC TCTCTGTCTC TCTCTCTCTC TCTCTTTCTT TCTTTGTGTA 1620 

rTTTATTCAA GATGAGTTGG ACOCATTGCC AGTGAGTCTG AATGTCACTG ACAGCOCTGT IfiHO 

GTTGTOCTCA GGACTCACTC XGCTGCTGGT QGAAACTCAT GGCtTCTCTC TCTCTTTGAT 1740 

15 CCCATAAAGC TACOAGGGGG ACGGGAOAGG GCAGTGCAAT GGGAAGTAAA GAGATATTTT 1300 

CCAGrAQGAA AAGCAATGCT TTCTTGTCTT TAGACTCAAA TGCTTAGGGA ACGTTTCATT 1860 

TCTCATTCAT GOGGAAAGGC AGCCTCCTTA AAT3TTTTCT GAAGAGCGGT AAAATCTAGA 1920 

AGCTTAAGAA TTTACAGTTC CTTCAATAAC CATGATGACC TGAAGTTCAC CTATCCCATT 1980 

TTAGCATCTA CTTGTTTTTC CCATCTCTTC CTTTCCAATT TTGCTTATAC TGCTGXAATA 2040 

20 TTTTTGTAAA AAAAAAAAAA AAGGAAAAAA AAGACCAGCT AAAATTTTCG ACTTQACTTT 2100 

TTAACTTAAC TCATGAATTA ATTAAAGCAA ATGAAAAAAT TAAAAAOTGT GACTTTTTCT 2160 

CGGAGCATAT ATGTAGCTTT TAQGAAAGGC TGATGATGGT ATAAAGTTTG CTCATTAAGA 222 D 
AAAAAAGACA AGGCTGATTT TGAAGAGAGT TGCTTTTGAA ATAAAATGAT CA 

25 Seq ID BDt B12 Protein sequence 
Protein Accession fr: NP_002026.1 

1 11 21 31 41 51 

™ 1 I I I 1 I 

30 MLLLAAAFXjV AFVIiLIiYMVS P1»I3PKPLAL PGAHWVTGG S9GIOKC1AI ECYKQGAFIT 
IiVARMEDKLIi QAKKEIEMHS IMDKQWLCI SVDVSQDYNQ VENVIKQAQE KLGPVDMLVN 
CAGWAVSGKF EDLBVSTFER LMSINXLGSV YPfiRAVITTW XERRVGRIVF VSSQAGQLGL 
FGFTAYfiAfiK FA I RQEAERIj QMEVXPYNVY ITVAYPPDTD TPGFAEENRT KPLBTSLISE 
TTSVCKPEQV AKQIVKDAIQ GNFNSSLGSD GXMIiBAIiTCG MAFVTSITEG LQQWTMGIjF 
35 RTIALiFYlrGS PDSXVHRCMM QREKSENADK TA 

Seq ID NO: 613 dna sequence 
Nucleic Acid Accession ft: CAT Cluster 

40 1 11 21 31 41 51 

I l t i I I 

CTTGGGATGC ATTATATATT GCATTATATT TGCCGGTAAA ACTCGGTTAA TTTTTAAAAT 
CGGCAAAATA TTGGTGCCTT TCCCGAAATT TGGTCCOGGG CTCCCTTATA GGATAATTGG 
TTTGGATTOG ITAAGTCCAA TTATTAAATG CTGCGGTTTC AAATTCCCAG CTGGAAGGAC 

45 CAOCCATTTA AAAACTTCAS AAGGCRGGAT CCTGCTCAAT TTATAAGGCT TTGGAAAATT 

CCAGGCATTQ GTTTGACATA TTTCCAGAGC TCAAATCTGC AGTGTTCCAC ATGCACATAC 
AAGATCCAGA GTCTCATGTT AAAATCACTT ACATACCCAQ AAAGACCACC ACTXTGCAGG 
TATAATATTG TACTTAAAGA CAGAGAGGAA GTGTTTCTTA ATCCaAACAC ATGTACACCA 
AAGAACACAT AAGATGCCTT CTTCCATCAA ATGCACTTGC TTGTGAATTA ATGGACTTGT 

50 AAATGAAACA ATGCAATCAG TCTTTTATAA TGCACTGTTC AATTTGAGAT TCAAOTATTT 
CTATTTCTTG GAAAAAATTT TAAGAATCAA AAATAAAGAA AATAAAAAGT GCATACAGTT 
AAA CAT T CCA AAAAAAAAAA AA 



60 
120 
1B0 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



55 Seii ™ MO» B14 DNA sequence 

Nucleic Acid Accession #* XM_086767-l 
Coding sequence; 276,, 6n 

1 11 21 31 41 51 

60 | i i I I I 

CTTGTTCATG GCATCTTTAG AAACAAACTG CAATTTTATT TCATTTCCTT GTCGTTCATA 60 

CAAAGATTAC AAGACTAGCT TATGTGTGGA CTGTGACTGr TTTAAGGAAA AATCATGTCC 120 

TCGGCTGGGT TATCAAGCCA AGCTATTTAA AGGTGTTTTA AAAGAAAGGA TGGAAGGAAG 180 

ACCTCTTAGG ACCACTGTQT TTTTGGATAC AAGTGGTACA TATCCATTCT GTAGCCXATT 240 

65 ATTTTGTTCX CAGTATAATT GTTCCAGATA AAACTATGAT GGATGGCTCG TTTTCATTTA 300 

AATTATTAAA TCAGCTTGGA ATGATTGAAG AGCCAAGGCT TTATGAAAAG AACAAACCAT 360 

TTTATAAACT TCAAGAAGTC AAGATTCTTG CTCAATTTlA TAATGACTTT GTAAATATTT 420 

CAAGCATTGG TTTOACATAT TTCCAGAGCT CAAATCTGCA GTGTTCCACA TGCACATACA 480 

AGATCCAGAG ACTCATGTTA AAATCACTTA CATACCCAGA AAGACCACCA CTTTGCAGGT 540 

70 ATAATATTGT ACTTAAAGAC AGAGAGGAAG TGTTTCTTAA TCCAAACACA TGTACACCAA 600 

AGAACACATA AGtATGCCXTG TTCCATCAAA TGCACTTGCT TGTGAATTAA TGGACTTGTA 660 

AATGAAACAA TGCAATCAGT CTTtUTATAAT GCACTGTTCA ATTTGAGATT CAAGTATTTC 720 

TATTTCTTGG AAAAAATTTT AAGAATCAAA AATAAAGAAA ATAAAAAATQ CATACAGTTA 780 
AACATTCC 



75 



Seq ID NO: B15 Protein sequence 
Protein Accession tt : XP_086767 . 1 



1 11 21 31 41 51 

80 | | ) | I I 

MMDGSFeFKL LNQIjGMIEEP RIiYBKNKPFY KLQEVKILAD FYWDFVNIS8 IGLTYFQSBN 

LQC8TCTYKI QRLKUCSLtTY PERPPIiCRYN IVLKDRBEVF LNPNTCTPKN T 

Seq ID NO: B16 DNA sequence 

802 



WO 03/042661 



PCT/US02/36810 



Nucleic Acid Accession »i CAT cluster 



1 11 21 31 41 51 

.111111 

D TTTTTTTTTT TTTTTTTTTT TTAAATGCAA AGGAGCAACA AAATTTATTG ACTGAATTAA €0 

ACACAACAGT AAAATGGCAG TGTTGTAATT TCATTTTCAG ATGTTTGAAT GGAACAAGAA 120 

AAGTGCTATT AGCCCAAGCT TGTTACATTC ATTAAAAGAG TGACTATCAA AAACAGCAAC 180 

ATGCACAATG GTACATATGC ACAAAATGGA ATT AT AT CAA CAAATATACA AAATACCCAA 240 

AATAAAATAT TTACAGGTTT AAAAATATAA ACATTGATTC CTCTATCCCA TTAAACCATT 300 

10 GOAGTGGAGA AAGGAGGAAA GACCCTATTG CTATTTAGAA TCCTTTTTAA AACAAGTTTT 3 60 

TAAAACATAG AATTAGTTCT AGGAGACAAT TTTTGATGTT TTTCAGGQGT TTAACATTCT 420 

ATTATAAAAA TAACATCTAT AAACOTACTA ACAATTTTCC TCCTGTGCAC AAAAATAATA 480 

CTGOCAAAAC CTGTCCTCAA AGACATGCCT OACTTTCAGQ AAAGCTAATT ATGGAAATGG S40 

ABTTT CTOBT TTGGGTTATC TTTGTTACTA TTTTCAAATA ACCAGCAACT CCCTATATTA 600 

15 CACTGAGATA CTTTATATAA ATAACGTGGG CGAAACCTGA AGTTCACAAT GAGCCTGCTA 660 

GQTAGCTGGT GTCAAGTACA AATGATAGGA ATTQACTTTG CCAACAACAT CAAAAGCATT 720 

TTCCCTGATA TTCCTGATAG ACCTACCACT ATCAGATCCT CCATATTCAA TAAGATTTAT 780 

CCTGGAAGCA ATGAAAATGT TAAATATTAC TTTGCTAGAG TTTCTCCTCG TTTATTTAGA B40 

AATAAATGTG TAGTGGGGAC CAGTGGTTGT AATGTAGATA CTTGAGAAGT TTCATTGATT 90 0 

20 CCTTCAGACC CATGCOG 



25 



Seq ID HO: B17 DNA sequence 

Nucleic Acid Accession #i HM_022898.1 

Coding sequence; 268-- 273 9 



1 11 21 31 41 SI 

111111 
TTTTCTTGCT TTTCTTCCCT TTTTTTTCTT TTTGCAAACA AAACAAAAAA CAHCATAGAA 60 

GAAAGAGCAA AATAAAGAAG AAGAAGAGGA GGAAGAGAGG GAAAGAGAGG AAGGGAAAAA 120 

30 AAACAGCAAC CCQOGCAGAG GAGGAGGTGC GGCGGCGGCG GCGGCGGCGG CAGCGGCGGC 190 

AGCGGCGCGG CGGCGGCTCG GACCCCCTCC CCCGGCTCCC CCCATCAGTG CAGCTCTCCG 240 

GGCGATGCCA GAATAGATGC CGGGGCAATG TCCCGCCGCA AACAGGGCAA CCCGCAGCAC 300 

TTGTCCCAGA GGGAGCTCAT CACCCCAGAG GCTGACCATG TGGAGGCCGC CATCCTCGAA 360 

GAAGACGAGG GTCTGOAGAT AGAGGAGCCA AGTGGCCTGG GGCTGATGGT aOGTGGCCCC 420 

35 GACCCTGACC TGCTCACCTG TGGCCAGTGT CAAATGAACT TCCCCTTGGG GGACATCCTG 480 

GTTTTTATAG AGCACAAAAG GAAGCAGTGT GGCGGCAGCT TGGGTGCCTG CTATGACAAG 540 

GCCCTGGACA AGGACAGCCC GCCACCCTCC TCACGCTOCG AOCTCAGGAA AGTGTCOGAG 600 

CCGGTGGAGA TCGGGATCCA AGTCACCCCC GACGAAGATG ACCACCTGCT CTCACCCACG 660 

AAAGGCATCT GTCCCAAC3CA GGAGAACATT GCAGGTAAAG ATGAGCCTTC CAGCTACATT 720 

40 TGCACAACAT GCAAGCAGCC CTTCAACAGC GCGTGGTTCC TGCTGCAGCA CGCGCAGAAC 780 

ACGCACGGCT TCCGCATCTA CCTGGAGCCC GGGCCGGCCA GCAGCTCGCT CACGCCGCGG B40 

CTCACCATCC CGCCGCCGCT CGGGCCGGAG GCCGTGGCGC AGTCCCCGCT CATGAATTTC 900 

CTGGGCGACA GCAACCCCTT CAACCTGCTG CGCATGACGG GCCCCATCCT GCGGGACCAC 960 

CCGGGCTTCG OCGAGGGCCG CCTGCCGGGC ACGCCGCCTC TCTTCAGTCC CCCGCCGCGC 1020 

45 CACCACXrrGG ACCCGCACCG CCTCAGTGCC GAGGAGATGG GGCTCGTCGC CCAGCACCOC 1080 

AGTGCCTTCG ACCGAGTCAT GCGCCTQAAC CCCATGGCCA TCGACTCGCC CGCCATGGAC 1140 

TTCTCGCGGC GGCTCCGCGA GCTGGCGGGC AACAGCTCCA CGCCGCCGCC CGTGTCCCCG 1200 

GGCCGCGGCA ACCCTATGCA CCGGCTCCTG AACCCCTTCC AGCCCAGCCC CAAQTCOOCG 1260 

' TTCCTGAGCA CGCCGCCGCT GCCGCCCATG CCCCCTGGCG GCACGCCGCC CCCGCAGCCG 1320 

50 CCAGCCAAGA GCAAGTCGTG CGAGTTCXGC GGCAAGACCT TCAAGTTCCA GAGCAATCTC 13 BO 

ATCGTGCACC GGCGCAGTCA CACGGGCGAG AAGCCCTACA AGTQCCAGCT GTGCGACCAC 1440 

GCGTGCTCGC AGGCCAGCAA GCTCAAGCGC CACATGAAGA CGCACATGCA CAAGGCCGGC 1500 

TCGCTGGCCG GCCGCTCCGA CGACGGGCTC TCGGCOGCCA GCTCCCCCGA GCCCGGCACC IS 60 

AQCGAGCTGG CGGGCGAGGG CCTCAAGGCG GCCQACGGTG ACTTC03CCA CCACGAGAGC 1620 

55 GACCCGTCGC TGGGCCACGA GOCGGAGGAG GAGGACGAGG AGGAGGAGGA GGAGGAGGAO 1680 

GAQCTGCTAC TGGAGAACGA GAGGCGGCCC GAGTCGAGCT TCAGCATGGA CTCGGAGCTG 1740 

AGOCGCAACC GCGAGAACGG CGGTGGTQGG GTGCCCGGGG TCCCGGGCGC GGGGGGCGGC 1B0O 

□CGGCCAAGG CGCTGGCTGA CGAGAAGGCG CTGQTGCTGG GCAAGGTCAT GGAGAACGTG I860 

. . GGCCTAGGCa CACTGCCGCA GTACGGCGAG CTCCTGGCCG ACAAGCAGAA GCGCGGCGCC 1920 

60 TTCCTGAAGC GTGCGGCGGG CGGCGGGGAC GCGGGCGACG ACQACGACGC GGGCGGCTGC 1980 

GGGGACGGGG GCGCGGGCGG CGCGGTCAAC GGGCGCGGGG GCGGCTTCGC GCCAGGCACC 2040 

GAGCCCTTCC OCGOGCTCTT CCCGCGCAAG CCCGCGCCGC TGCCCAGCCC CGGGCTCAAC 2100 

AGCGCCGCCA AGCGCATCAA GGTGGAGAAG GACCTGGAGC TGCCGCCOGC CGCGCTCATC 2160 

. CCGTCCGAGA ACGTGTACTC GCAQTGGCTG GTGGGCTACG CGGCGTCGCG GCACTTCATG 2220 

65 AAGGACCCCT TCCTGGGCTT CACGGACGCA CGACAGTCGC CCTTCGCCAC GTCGTCCGAG 2280 

CACTC3TCCG AGAACGGCAG CCTGCGCTTC TCCACGCCGC CCGGGGACCT GCTGGACGGC 2340 

GGCCTCTCGG GCCGCAGCGG CACGGCCAGC GQAGGCAGCA CCCCGCACCT GGGCGGCCCG 2400 

GGCCCCGGGC GGCCCAGCTC CAAGGAGGGC CGCCGCAGCG ACAOSTGCGA GTACTGCGGC 2460 

AAGGTGTTCA AGAACTGCAQ CAACTTGACG GTGCACCGGC GGAGCCACAC CGGCGAGCGG 2520 

70 CCTTACAAGT GCGAGCTGTG CAACTACGCG TGCGCGCAGA OCAGCAAGCT CACGCGCCAC 2580 

ATGAAGACaC ACGGGCAGAT CGGCAAGGAG GTGTACCGCT GCGACATCTG CCAGATGCCC 2640 

TTCAGCGTCT ACAGCACCCT GGAGAAAC&C ATGAAAAAGT GGCACGGCGA GCACTTGCTG 2700 

ACIAACGACG TCAAAATCGA GCAGGCCGAG AGGAGCTAAG CGCGCGGGCC CCGGCGCCCC 2760 

GCACCTCTAC AGTGGAACCG TTGCCAACCB AGAGAATGCT OACCTGACTT GCCTCCGTGT 2B20 

75 CACCGCCAOC CCGCACCCCG CGTGTCCCCG GGGCCCAGGG GAGGCGGCAC TCCAACCTAA 2880 

CCTGTGTCTG CGAAGTCCTA TGGAAACCCG AGGGTTGATT AAGGCAGTAC AAATTGTGGA 2940 

GCCTTTTAAC TGTGCAATAA TTTCTGTATT TATTGGGTTT TGTAATTTTT TTGGCATGTG 3000 

CAGGTACTTT TTATTATTAT TTTTTCTGTT TGAATTCCTT TAAGAGATTT TGTTGGGTAT 3 060 

CCATCCCTTC TTTGTTTTTT TTTTAACCCG GTAGTAGCCT GAGCAATGAC TCGCAAGCAA 3120 

80 TGTTAGAGGG GAAGCATATC TTTTAAATTA TAATTTGGGG GGAGGGOTGG TGCTGCTTTT 31 B0 

TTGAAATTTA AGCTAAGCAT GTGTAATTTC TTGTGAAGAA GCCAACACTC AAATGACTTT 3240 

TAAAGTTGTT TACTTTTTCA TTCCTTCCTT TTTTTTGTCC TGAAATAAAA AGTGGCATGC 3300 

AGTTTTTTTT TTAATTATTT TTTAATTTTT TTTTTGGTTT TTGTTTTTGG GGTGGGGGGT 3360 

GTGGATGTAC AGCGGATAAC AATCTTTCAA GTCGTAGCAC TTTGTTTCAG AACTGGAATG 3420 

803 



WO 03/042661 



GAGATGTAGC ACTCATGTCG TCCCGAGTCA AGCGGCCTTT TCTGTGTTGA TTTOGGCTTT 3460 
CATATTACAT AAGGGAAACC TTGAGTGGTG GTGCTGGGGG AGGCACCCCA CAGACTCAGC 3540 
GCCGCCAGAG ATAGGGTTTT TGGAGGGCTC CTCTGGGAAA TGGCCCGACA GCATTCTGAG 3600 
GTTGTGCATG ACCAGCAGAT ACTATCCTGT TGGTGTGCCC TGGGGTGCCA TGGCTGCTAT 3660 
TCGCTGTAGA TTAGGCTACA TAAAATGGGC TOAOGOTACC TTTTTGGGGA GATGQGGTGG 3720 
CCTGCAGTGA CACAO AAAGG AAGAAACTAG CGGTGTTCTT TTAGGCGTTT T CTGOC TTGA 3780 
OSGCTTCTCT CTTTTTTTAA ATCACCCCCA CCACATAAAT CTCAAATCCT ATGTTGCTAC 3840 
AAGGGGTCAT OCATCATTTC CCAAGCAQAC GAATGCCCTA ATTAATTGAA GTTAGTGTTC 3900 
TCTCATTTAA TGCACACTGA TGATATTGTA GGGATGGOTG GGGTGGGGAT CTTGCAAATT 3960 
TCTATTCTCT TTTACTGAAA AAGCAGGGGA TGAGTTCCAT CAGAAGGTGC CCAGCGCTAC 4020 
TTCCCAGGTT TTTATTTTTT 1TTTCCTATC TCATTAGGTT GGAAGGTACT AAATATTGAA 4080 
CTGTTAAGAT TAQACATTTG AATTCTGTTG ACCCGCACTT TAAAGCTTTT GTTTGCATTT 4140 
AAATTAAATG GCTTCTAAAC AAGAAATTGC AGCATATTCT TCTCTTTGGC CCAGAGGTGG 4200 
GTTAAACTGT AAGGGACAGC TQAGATTGAG TGTCAGTATT GCTAAGCGTG GCATTCACAA 4260 
TACTGGCACT ATAAAGAACA AAATAAAATA ATAATTTATA GGACAOTTTT TCTACTGCCA 4320 
TTCAATTTGA TGTGAGTGCC TTGAAAACTG ATCTTC CTAT TTGAGTCTCT TGAGACAAAT 4300 
GCAAAACTTT TTTTTTCAAA TGAAAAGACT TTTTAAAAAA GTAAAACAAG AAAAGTACAT 4440 
TCTTTAGAAA CTAACAAAGC CACATTTACT TTAAGTAAAA AAAAAAAAAA TTCTGGTTGA 4500 
AGATAGAGGA TATGAAATGC CATAAGACCC AATCAAATGA AGAAATAAAC CCAGCACAAC 4560 
CTTGGACATC CATTAGCTGA ATTATCCTCA GCCCCTTTTG TTTTTGGGAC AACGCTGCTT 4620 
AGATATGGAG TGGAGGTGAT TTAGTGCTGA ATTAAAACTC AAGTOACACA AGTTACAAQT 4680 
TGATATCGTT GAATGAAAAG CAAAACAAAA ACAATTCAGG AACAACGGCT AATTTTTTCT 4740 
AAAGTTAAAT TTAGTGCACT CTGTCTTAAA AATACGTTTA CAGTATTGGG TACATACAAG 4800 
GGTAAAAAAA AAATTGTGTG TATGTGTGTT GGAGCGATCT TTTTTTTTTCA AAGTTTGCTT 4860 
AATAGGTTAT ACAAAAATGC GACAOTGGCC GCGTGTATAT TGTTTTCTTT TGGTOACGGG 4920 
GTTTTAGTAT ATATTATATA TATTAAAATT TCTTGATTAC TGTAAAAQTG GACCAGTATT 4980 
TGTAATAATC GAGAATGCCT GGGCATTTTA CAAAACAAGA AAAAAAATAC CCTTTTCTTT 5040 
TCCTTGAAAA TGTTGCAGTA AAATTTAAAT GGTGGGTCTA TAAATTTGTT CTTGTTACAG 5100 
TAACTGXAAA GTCGGAGTTT TAGTAAATTT TTTTCTGCCT TGGGTGTTGA ATXTTTATTT 5160 
CAAAAAAAAT GTATAGAAAC TTGTATTTGG GGATTCAAAG GGGATTGCTA CACCATGTAG 5220 
AAAAAGTATG TAGAAAAAAA GTGCTTAATA TTGTTATTGC TTTGCAGAAA AAAAAAAAAT 5260 
CACATTTCTG ACCTGTACTT ATTTTTCTCT TCCCGCCTCC CTCTGGAATG GATATATTGG 5340 
TTGGTTCATA TGATGTAGGC ACTTGCTGTA TTTTTACTGG AGCTOGTAAT TTTTTAACTG 5400 
TAAGCTTGTC CTTTTAAAGG GATTTAATGT ACCTTTTTGT TAGTGAATTT GGAAATAAAA 5450 
AGAAAAAAAA AACAAAAACA AACAGGCTGC CATAATATAT TTTTTTAATT TGGCAGGATA 5520 
AAATATTGCA AAAAAAACAC ATTTGTATGT TAAGTCCTAT TOTACAGGAG AAAAAOGGXT 5580 
OTTTGACAAC CTTTGAGAAA AAGAAACAAA AGGAAGTAGT TAAATGCTTT GGTTCACAAA 5640 
TCATTTAGIT GTATATATTT TTTGTCGGAA TTGGCCTACA CAOAGAACCG TTCQTGTTGG 5700 
GCTTCTCTCT GAACGCCCCQ AACCTTGCAT CAAGGCTCCT TGGTGTGGCC ACAGCAGACC 5760 
AGATQGGAAA TT ATTTDT GT TGAGTGGAAA AAAATCAGTT TTTGTAAAGA TGTCAGTAAC 5820 
ATTOCACATC GTCCTCCCTT TCTCTAAGAG GCCATCTCTA AGATGTCAGA TGTAGAGGAG 5880 
AGAOAGCGAG AGAACATCTT CCTTCTCTAjC CATCACTCCT GTGBCGGTCA CCACCACCAC 5940 
CTCTCCCGCC CTTACCAGCA GAAAGCAATG CAAACTGAGC TGCTTTAGTC CTTGAGAAAT 6000 
TGTGAAACAA ACACAAATAT CATAAAAGGA GCTGGTGATT CAGCTGGGTC CAGGTGAAGT 6060 
GACCTGCTGT TGAGACCGGT ACAAATTGGA TTTCAGGAAG OAGACTCCAT CACftGCCAGG 6120 
ACCTTTCGTG CCATGGAQAG TGTTGGCCTC TTGTCTTTCT TCCCTQCTTT GCTGCTTTGC 6180 
TCTCTGAAAC CTACATTCOG TCAGITTCCG AATGOGAGGG CCTGGGATGA ATTTGGTGCC 6240 
TTTCCATATC TCGTTCTCTC TCCTTCCCCT GCGTTTCCTC TCCATCCTTC ATCCTCCATT 6300 
GGTCCTTTTT TTTTCTTTCA TTTTTTATTT AATTTCTTTT CTTCCTGTCT GTTCCTCCCC 6360 
TAATCCTCTA TTTTATTTTT ATTTTTTGTA AAGCCAAGTA GCTTTAAGAT AAAGTGGTGG 6420 
TCTTTTGGAT GAGGGAATAA TGCATTTTTA AATAAAATAC CAATATCADG AAGCCATTTT 6480 
TTATTTCAGG AAATGTAAGA AACCATTATT TCAGGTTATG AAAGTATAAC CAAGCA TOCT 6540 
TTTGGGCAAT TCCTTACCAA ATGCAGAAGC TTTTCTGTTC GATGCACTCT TTCCTCCTTG 6600 
CCACTTACCT TTGCAAAGTT AAAAAAAAGG OGGGAGQGAA TGGGAGAGAA AGCTGAGATT 6660 
TCAGTTTCCT ACTGCAGTTT CCTACCTGCA GATCCAGGGG CTGCTGTTGC CTTTGOATGC 6720 
CCCACTGAGG TCCTAGAGTG CCTCCAGGGT GGTCTTCCTG TAGTCATAAC AGCTAGCCAG 6780 
TGCTCACCAG CTTACCAQAT TGCCAGGACT AAGCCATCCC AAAGCACAAG CATTGTQTGT 6840 
CTCTGTGACT GCAGAGAAGA QAflAATTTTG CTTCTGTTTT GTGTTTAAAA AACCAACACG 6900 
GAA0CAGA.TG ATCCCOAGAG AGAGQCCTCT AGCATGGGTG ACCCAGCCGA CCTCAGGCOG 6960 
GTTTCCGCAC TGCCACAACT TTGTTCAAAG TTGCCCCCAA TTGGAACCTQ CCACTTGGCA 7020 
TTAGAQGGTC TTTCATGGGG AGAQAAGGAG ACTGAATTAC TCTAAGCAAA ATQTGAAAAG 7000 
XAAGGAAATC AGCCTTTCAT COCGGTCCTA AGTAACCGTC AGCCGAAGGT CTOGTGGAAC 7140 
ACAGGCAAAC CCGTGATTTT GGTGCTCCTT G1AACTCAGC CCTGCAAAGC AAAGTCCCAT 7200 
TGATTTAAGT TGTTTOCATT TGTACTGGCA AGGCAAAATA TTTTTATTAC CTTTTCTATT 7260 
ACTTATTGTA TGAGCTTTTG TTGTTTACTT GGAGGTTTTO TCTTTTACTA CAAGTTTGGA 7320 
ACTATTTATT ATTGCTTGGT ATJTGTGCTC TGTTTAAGAA ACAQGCACTT TTTTTTATTA 7380 
TGGATAAAAT GTTGAGATGA CAGGAGGTCA TTTCAATATG GCTTAGTAAA ATAT TTATT G 7440 
TTCCTTTATT CTCTGTACAA GATTTTGGGC CTCTTTTTTT CCTTAATGTC ACAATGTTGA 7500 
GTTCAGCATG TGTCTGCCAT TTCATTTGTA CGCTTGTTCA AAACCAAGTT TGTTCTGGTT 7560 
TCAAGTTATA AAAATAAATT GGACATTTAA CTTOATCTCC AAA 

Seq TD HOc B18 Protein sequence 
Protein Accession ft- NP_075049-1 

1 11 21 31 41 51 

1111)1 

KSimiQQGKPQ HI>SQaEI»lTP EADHVKAAIL ESDEGLEIEB ESGLGLMVGG PDPDLLTCGQ 60 

CQMNFPLGDI DVFIEHKRKQ CGGSLGACYD KAIJ)KD8PPP S9RSELrBKVS BPVB IGIQVT 120 

PDSPDHUjSP TKGXCPKQEN rAQKDKPSSY 1CTTCKQPEH GAHFIiLQHAQ KTHGFRIYIiE 180 

PGPASSSI/TP RI«T1PPPLGP EAVAQSPLMW FLGDSHPENIi IrHMTGPILRD KPQFGBGRLP 240 

GTPPIiFSPPP EHECLDPHRIiS AEEMGLVAQH PSAFDSVKftL NPMAXD3PAM DPSRRLHSliA 300 

GMSSTPPPVS PGRGNPMHRL LNPPQPSPKS PPL8TPPLPP MPPGGTPPPQ PPAKfiKSCEP 360 

CGKTFKFQSN IiIVHUHSHTG EKPYKCQLrCD HACSQABKLTC TtHMKTHMHKA GSLAflftSDDG 420 

LSAAS9PEPG TSELAGBQUC AADGDFRHHE SDP3LGEEPE EEDEEEESEE BELUJBMHSR 480 



804 
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PESSPSMDSE LSRBREHGGG GVPGVPGAGG 
KT.TADKQKRG AFlaKRAAOOQ DMDDDDRGG 
KPAPIiPSPGL NSAAXRIKVE KDLEIiPPAAIj 
ARQSPFATSS EES SENG SILR FSTPPGDLLD 
GRK9DTCBYC GKVFKNC6WL TVHRR8HTGE 
EVYRCDIGQM PFSVYSTJUEK HMKKWHGEHLr 



GAAKAfcADEK ALVLGKVMEN VGLGALPQYG 
CGDAGAGGAV IK3RGGGFAPG TEPFPGLFPE 
IPSEKVY9QW IiVGYAASEHF MKBPFLGPTD 
GGLSGRSGTA SGGSTPHLGG PGPGRPBSKE 
RPYKCELCNY ACAQSSKLTR HMKTHGQIGK 
LTNDVXIEQA EES 



Seq ID NO: B19 ENA sequence 

Hucleic Acid Accession NM_000399.2 

Coding sequence: 333. ..1769 



540 
600 
660 
720 
780 



1 11 21 31 41 51 

1 I I I I I 

TAACTGAGCG AGGAGCAATT GATTAATAGC TCGGCGAGGG GACTCACTGA CTGTTATAAT 
AACACTACAC CAGCAACTCC TGGCTTCCCA GCAGCCGGAA CACAGACAGG AGAGAGTCAG 
TGGCAAATAG ACATTTTTCT TATTTCTTAA AAAACAGGAA CTTGTTTGCT ACTTTTATTT 
CTGTTGATTT TTTTTTCTTG GTGTGTGTGG TGGTTGTTTT TAAGTGTGGA GGGCAAAAGG 
AGATACCATC CCAGGCTCAG TCCAACCCCT CTCCAAAACG GCTTTTCTGA CACTCCAGGT 
AGCBAGGQAG TTGGGTCTCG AGGTTQTGOG AEGAGCAAAT GATGACCGCC AAGGCCGTAG 
ACAAAATCCC AGTAACTCTC AGTGGTTTTG TBCACCAGCT GTCTGACAAC ATCTACCCGG 
TGGAGGAECT CGCCGCCACG TC3GGTGACCA TCTTTCCCAA TGCCGAACTG GGAGGCCOCT 
TTGACCAGAT GAACGGAGTG OCCGGAJSATG GCATGATCAA CATTGACATG ACTGGAGAGA 
AQAQBTCB T T GGATCTCOCA TATCCCAGCA GCTTTGCTCC CGTCTCTGCA CCTAGAAACC 
AGACCTTCAC TTACATGGGC AAGTTCTCCA TTGACCCTCA GTACCCTGGT GOCAGCTGCT 

ACCCAQAAGG cataatcaat attgtgagtg caggcatctt gcaaggggtc acttocccag 

CTTCAACCAC AGGCTCASCC AGCGTCACCT CTGCCTOCCC CAACCCACTG GCCACAGGAC 
CCCTGGGTGT GTGCACCATG TCCCAGACCC AGCCTGACCT GGACCAOCTG TACTCTCCGC 
CACCQCCTCC TCCTOCTTAT TCTGGCTGTG CAGGAGACCT CTACCAGGAC CCTTCTGCGT 
TOCTGTCAGC AGCCACCACC TCCIACCTCTT CCTCTCTGGC CTACCCAjCCA CCTCCTTOCT 

ATCCATCCCC caagccagcc acggacocag gtctcttccc aatgatccca gactatcctg 

GATTCTTTCC ATCTCAGTGC CAGAGAGACC TACATGGTAC AGCTGGCCCA GACCGTAAGC 
CCTTTCCCXG CCCACTGGAC AOCCTGCGGG TGOCCCCTCC ACTCACTOCA CTCTCTACAA 
TCCGTAACTT TACCCTGGGG GGCCCCAGTG CTGGGGTGAC COGACCAGGG GCCAGTGGAG 
GCAGCGAGGG ACCCCGGCTG CCTGGTAGCA GCTCAGCAGC AGCAGCAGCC GCCGCCGCCG 
CCGCCTATAA CCCACACCAC CTGCCACTGC GGCCCATTCT GAGGCCXCGC AAGTACCCCA 
ACAQACCCAG CAAGACGCOG GTGCACGAGA GGCCCTACCC GTGCCCAGCA GAAGGCTGCG 
ACCGGCGGTT CTCCCGCTCT GACGAGCTGA CACGGCACAT CCGAATCCAC ACTGGGCATA 
AGCCCTTCCA GTGTCGGATC TGCATGCGCA AjCTTCAGCCG CAGTGACCAC CTCACCACCC 
ATATCCGCAC CCACACCGGT GAGAAGCCCT TCGCCTGTGA CTACTG1GGC OGAAAGTTTG 
CCCGGAGTGA IGAGAGGAAG CGCCACACCA AGAICCAOCT GAGACAGAAA OAGCGGAAAA 
GCAGTGCCCC CTCTGCATCG GTGCCAGCCC CCTCTACAGC CTOCTGCTCT GGGGGGGTGC 
AGGCTGGGGG TACCCTGTGC AGCAGTAACA GCAGGAGTCT TGGCGGAGGG OGQCTQ3CCC 
CTTGCTCCTC TCGGACCCGG ACACCTTGAG ATGAGACTCA GGCFGATACA CCAGCTCCCA 
AAGGTCCCGG AGGCCCTTTG TCCACTHGAQ CIGCACAACA AACACTACCA CCCTTTCCTG 
TCCCTCTCTC CCTTTGTTGG GCAAAGGGCT TTGGTGGAGC TAGCACTGCC CCCTFTCCAC 
CTAGAAGCAG GTTCTTCCTA AAACTTAGCC CATTCTAGTC TCTCTTAGGT GAGTTGACTA 
TCAACCCAAG GCAAAGGGGA QGCTCAGAAG GAGGTGGTGT GGGQATCCCC TGGCCAAGAG 
GGCTGAGGTC TGACCCTGCT TTAAAGGGTT GTTTGACTAG GTTTTGCTAC CCCACTTCCC 
CTTATTTTGA CCCATCACAG GTTTTTGACC CTGGATGTCA GAGTTGATCT AAGACGTTTT 
CTACAATAGG TTGGGAGATG CTGATCCCTT CAAGTGGGGA CAGCAAAAAG ACAAGCAAAA 
CTGATGTGCA CTTTATGGCT TGGGACTGAT TTGGGGGACA TTGTACAGTG AGTGAAGTAT 
AGCCTTTATG CCACACTCrG TGGCCCTAAA AXGGTGAATC AGAGCATATC TAGTTGTCTC 
AACCCTTGAA GCAATATGTA TTATATACTC AGAGAACAGA AGTGCAATGT GATGGGAGGA 
ACGTAGCAAT ATCTGCTCCT TTTCGAfiTTG TTTGAGAAAT GTAGGCTATT TTTTCAGTGT 
ATATCCACIC AGATTTTGTG TATTTTTGAT GTACCCACAC TGTTCTCTAA ATTCTGAATC 
TTTGGGAAAA AATGTAAAGC ATTTATGATC TCAGAGGTXA ACTTATTTAA GGGGGATGTA 
CAT AT T CT CT GAAACTAGGA TGCATGCAAT TGTGTTGGAA GTGTCCTTGG TCGCCTTGTG 
TGATGTAGAC AAATGTTACA AGGCTGCATG TAAATGGGTT GCCTTATTAT GGAGAAAAAA 
ATCCACTCCCT GAGXTTAGTA TGGCXGTATA TTTATGOCTA TTAATAT TTG GAATTTTTTT 
TAGAAAGTAT ATTTTTGTAT GCTTTGTTTT GTGACTTAAA AGTGTTACCT TTGTAGTCAA 
ATTTCAGATA AGAATGTACA TAATGTTACC GGAGCTGATT TGTTTGGTCA TTAGCTCTXA 
ATAOTTGTGA AAAAAIAAAT CTATTCTAAC GCAAAACCAC tEAACIGAAGT TCAGATATAA 
TGGATGGTTT GTGACTATAG TGTAAATAAA TACTTTTCAA CAAT 

Seg ID MO= B20 Protein sequence 
Protein AcceBoion ttt KP 000390.2 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
240O 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2B80 
2940 



1 
I 

mmtakavdxi 
nhjmtgekrb 

liQGVTSPAST 
LYQDPSAFXiS 
TAGPDEKPFF 
AAAAAAAAAY 
1RIHTGHKPF 



11 
I 

PVTLSGFVHQ 
LOLPYP88FA 
TA3S3VT8AB 
AATTSTS8SL 

cprjyrijRVPP 

NPHHLPLEPI 
QCRXCMKNF6 
PGASVPAPST 



21 
\ 

LSDNIYPVKD 
PVSAPKNQTF 
PNPLATGPLG 
AYPPPP8YP6 
PLTPLSTIEU 
LRPEKYP17RP 



31 

i 

IAAT8VT1FP 
TYMGKFSIDP 
VCTMSOTOPD 
PKPATDPGU? 
FTUGGP6AGV 
SKTFVHERPY 



ASCSGGVQPG GTJjCSSN8S3 



41 
1 

KAELGGPFDQ 
QfYPGASCYPE 
LDHLYSPPPP 
PMIPDTPGFF 
TGFGASGGSB 
PCPAEGCDER 
DYGGRKFARJ3 
IGGGPLAPCS 



51 
I 

MNGVAGDGH1 60 

GI1NIVSAGI 120 

PPPYSGCAOD 1B0 

PSQCOftDLHG 240 

GPRLPGSSSA 300 

FSEBDSLTEH 360 

SHTR'i'P 



Seq ID NO i B21 DNA sequence 

Nucleic Acid Accession #i W4 - 004962.2 

Coding sequence: 457., 1893 



21 
I 



31 

1 



41 
I 



51 
I 

805 



WO 03/042661 



CACACACGGG COCACGCACA CGGCAGCCGG GCCAGGGACG ACCCTGTCAG CTGCAGCCCC 60 

AGAGGTCCGG GGCGCGCAOC CGGGTCCCCT CGAGGGCGCA GCOGGCCGCC CCGCCCCGCC 120 

CCTCGAAGCA GCOGGGCCGG G CGCGCAGTG GGCTACAAAC TTTCGCAGCG CGAGTCCGCC 1B0 

AAOGCAGCGC GCCGACTCGG GCTCBGCTCG GCTCTGCGCT GCTCCGGACG GCTGTOACCG 240 

CTGGCOGGGG GCTCGGGCCG CCGGTACCCA CGGACCGCGC GCOCGGGTGC CXGCTCCGCT 300 

AAQCCCCTCG CCCCGCGCGG ACCTCGGTAT CCAGCGCCCT GCTGCCCGGG CTCTCCCCGC 360 

GCGCCCTACT GCCGCGAGGT CAGTCCGCAG CCTCCGGTOC GCCAGCGCTC GCCTTCCT CC 420 

TCCTGGACTT CGGCCCTTTG CCGCCCTCAC CACGCCATGG CTCATGTCCC CGCTCGGACC 480 

AGCCCGGGAC CCGGGCCCCA GCTGCTGCTG CTGCTGCTGC CGTTGTTTCT GCTGTTGCTC 540 

CGGGAXGTGG CCGGCAQCCA CAGGGCCCCC GCCTGGTCCG CACTGCCCGC GGCCGCCGAC 600 

GGCCTGCAGG GGGACAGGGA TCTCCAGCGG CACOCTGGGG ACGCGGCCGC CACGTTGGGC fiSO 

CCCAGGGCCC AGGACATGGT CGCTGTCCAC ATGCACAGGC TCTATGAGAA GTACAGCCGG 720 

CAGGGCGCGC GGCCGGGAGG GGGCAACACG GTCCGCAGCT TCAGGGCCAG GCTGGAAGTG 780 

GTCGACCAGA AGGCCGTGTA TTTCTTCAAC CTGACTTCCA TGCAAGACTC GGAAATGATC 840 

CTTACGGCCA CTTTCCACTT CTACTCAGAG CCGCCTCGGT GGCCTCGAGC GCTCGAGGTG 300 

CTATGCAAGC CGCGGGCCAA GAACGCTTCA QQCCQCCCGC TGCCCCTGGG CCCGCCCACA 960 

CGCCAGCACC TGCTCTTCCG CAGCCTCTCG CAGAACACGG CCACACAGGG GCTACTCCGC 1020 

GGGGCCATGG CCCTGGCGCC CCCACCGCGC GGCCTGTGGC AGGCCAAGGA CATCTCCCCC 1080 

ATCGTCAAGG CGGCCCGCCO GGATGGCGAG CTGCTCCTCT CCGCCCAGCT GGATTCTOAG 1140 

GAGAGGGACC CGGGGGTGCC CCGGCCCAGC CCCTATGCGC CCTACATCCT AGTCTATGCC 12 00 

AACGATCTGG CCATCTCGGA GCCCAACAGC GTGGCAGTQA CGCTGCAGAG ATACGACCCC 1260 

TTCCCTGCCG OAGACCCCGA GCCCCGCGCA GCCCCCAACA ACTCAGCGGA CCCCCGCGTG 1320 

CGCCGAGCCG CGCAGGCCAC TGGGCCCCTC CAGOACAACG AGCTGCCGGG GCTGGATGAG 1380 

AGGCCGCCGC GCGCCCACGC ACAGCACTTC CACAAGCACC AGCTGTG3CC CAGCCCCTTC 1440 

OGGGCGCTGA AACCCCGGCC AGGGCGCAAA GACCGCAQGA AGAAGGGCCA GGAGGTGTTC 1500 

ATGGCCGCCT CGCAGGTGCT GGACTTTGAC GAGAAGACOA TGCAGAAAGC CCGGAGGAAG 1560 

CAGTGGGATG AGCCGAGGGT GTGCTCCCGG AGGTACCTGA AGGTGGACTT CGCAGACATC 1620 

GGCTGGAATG AATGGATAAT CTCACCGAAA TCTTTTGATG CCTACTACTG CGCGGGABCA 1680 

TGTGAGTTCC CCATGCCTAA GATCGTTCGT CCATCCAACC ATGCCACCAT CCAGAGCATT 1740 ■ 

GTCAGGGCTG TGGGCATCAT CCCTGGCATC CCAOAGCCCT GCTGTGTTCC CGATAAGATG 1800 

AACTCCCTTG GGGTCCTCTT CCTGGATGAG AATCGGAATG TGGTTCTGAA GGTGTACCCC 1860 

AACATGTCCG TGGACACCTG TGCCTGCCGG TGAOACCACT CCAGGGTGGA AAGAAQOCAC 1920 

OCCCAGCAGA GCTGCCTTCT CGGAQCCTTC TGCAACCAGG ACTTGTGGTG CAGCTGCAGA 1980 

CACAGAGCAC AGCTCATGGG CAACATCACT GGGGCCCAGA GAGAGCTGTC CGCCAGTGCA 2040 

TCATTAGGGQ GTCTTTCATT GCTAGTGACT AGCCCCTTAA ATGCCAGCCT GAGTACCTGA 2100 

AGGAATCTGG GAATTAGCCC TGGCCTGAAA GTGGCCCATC ATTCATACCC ACTGTTCTGA 2160 

AGGCTTGAAA ACAAAACATA TCCACAACAT TGGCTTGATG TGATCATCAT CTCATAACTG 2220 

AGCAAGAAGA CTATGCAAAT CTTAGGGCGC TCGCTCCCTG CACACGGAAA GAACTCTGTT 2280 

TAAATGCTCA GTTCAGAACA CTTTGGGQCA CATAGTGATT TTGGAAAACA GGATAATCGT 23 40 

GGTGTAAATG AGTGTTTCCT TTCAARGTCC ACTQCAGAGC TTTTATCCAT ATGOTATGCA 2400 

CATGTAGCCA ATATTGGTTT CTTTTTCTTA ATATATATAT TTTATTTTAA AACAACAAAA 2460 

AGGGAGGGCG TTOACAOCAT TCCCCACAGA GATAGTCATG CTGAGTGTGG GTTGTTTAAA 2520 

CATGCATATT GAAATAACAC ATATAGTAAC GTGGGAATAC TAAAAAA'TAA CCAAGATTTT 2580 

AT AT T TTTGT AAATTATACT TTCTATACTG TAQATTGTGT ATGTTATGTG TTTTTATGGA 2640 
AAGCTAATAA ATTAAAGGTA CAGTGGTATC TTGA 

Scq ID NO: B22 Protein sequence 
Protein Accession #i HP^004953.1 

1 11 21 31 41 51 

I I i I I I 

MAHVPAETSP GFGPQLiLLIi LPLFLiitiliRD VAGSHRAPAW SALPAAADGIi QGDRDLQRHP 60 
GDAAATLGPS AQDKVAVHMH KL.YKKYSRQG ARPGGGNTVR GFSARLEWD QKAVYFFNIiT 120 
SMQDSEMILT ATFHFYSEPP RWPRAI^VLC KPRAKNASGR PIiPXiGPPTHQ KT.TiFRSIiSQN 180 
TATQGLfcRGA MAIAPPPRGti HQAKDISPXV KAARRDGELL LSAQLDSBER DPGVPHPSPY 240 
AFYILVYAND LAISEFNSVA VTIdQRYDPFP AGDPEPRAAP NNSADPHVRR AAQATGPLQD 300 
NELPGLDERP PRAHAQHPHK EQLHPSPFRA Itf3?«PGRKDR RKKGQEVFMA ASQVU3FDEK 360 
TKQKARRKQK DEPSVCSRRY LKVDFADIGW NEWIISPKSF DAYYCAGACB FPMPKTVRPS 420 
KHATIQSIVR AVGIIPGIPE PGCVPHKMNS LOVU?ld3BNR NWLKVYPNM SVDTCJi.Cn 

9eq XD NOi B23 UNA sequence 

Nucleic Acid Accession AK026322.1 

1 11 21 31 41 51 

I ] I 1 I 1 

AT TCTT T AAA TACTTAAACA TGGT TATAAC AGCTGTGTTT TAAGCTCCTT GTCTGTTAAT 60 

tocatcatct gtcatttttg catctotttt tattgactaa ctttcttcct CATCATGGTC 120 

ACATATCCTT GCTTCTTCCC ATATTTAGTA AAGTTTAGTT GGATGCTGGA TTGTTTTAAT 180 

TTAGCATTTT GGAGTGACTG CTTTTGCACT TGCCTTTAAA AGTATTGGAC TTCGTTTTGA 240 

TAGTTGAATT ACTTGCAAAT CAGCTTGATC ATTCTGAGAC TTGTTTTTAA CTTTTGTCQA 300 

AAftGGCTATG CTACTTCAAG TATAATAAAA CCTAGTTTTA GTTTTATCCT ATAACTAAGG 360 

CAT0TGTTCA ATTTGAATGT CTCCAACCCT GTGTGAACTC TGAAAATTGT TCAGCTCCCA 420 

ATTTOGCAGT AATTTTTTTT GTTCAGCCTT GCAGTCTCAT CCTACTCAAG TGTGGCTCTG 480 

TATCCAACAA CAGTCTTGGA GATCTCATGA AQATTTCTGA AACTTTTGCT CTGCAGGATA 540 

CCTTCCTTTG TGGTTATGTG CCCTGAAAAC TCCAGCCTCC GTGTCTTCAA TTCAGTGAAA 6D0 

CTACTACTCT GCTTGGATTC CCCTCTTATA CAATGGTCTA CCAAGTGACT GCAAACAGAA 660 

ATCTAAGGGA ATTTCAGGAG TCACCTCATT TATTTTCCTT TTTTTCAGGG ATTATAGTAG 720 

TACTATACCA CCTGTCAGCC AATGTGTGAG AATGGTGGTT TCATACATTT TCTCCAATTC 780 

TTTCTTGXTA CAGGAGCCTC CATTACTTTT CAAACAGTAA TGCAGTTGCA GTTGCCTCTC 840 

CTTTTCTTCA TTATGTGTTT ATCTCTGGCA GTTTGAGCCA AGAGAGGGCA CGGAGAAGTA 900 

CAATGACTAG AGAGCACTTC TGTTGAGGCT CATTCATTGA CCCCTACCCC AGTGCTTTAT 960 

GAATGTGTGC TGCAGATGTC ATACAGCATC ACAGCTTCTT CTCTAATTTA TGAGCCATAA 1020 

tttttttttt tgtattttta ttgtattacc tgcttggttg caagaggatg ATGAGGAGGA 10 80 

CTACTATAGG ATAAATTTGT TTTTATAGAG CAATTTCTCA TGGGTCACGA GGGATAGTAC 1140 



806 



WO 03/042661 



TCCATTTTCC TCACTGATAT COGCTTGATC ATCTGAGTCA GTGGGCTCAT TGTGGAAACA 1200 
TATGGAATCC TTTATGCTTT TTCTCAGGCT GCTTCTGTTA CATGAAATAA AGCCAGAGTT 12 GO 
GATTGTGAAA AAAAAAAAAA AAAA 



seq ID NO: B24 dna sequence 

Nucleic Acid Accession # i cat cluster 



1 11 21 31 41 51 

t 1 I I I I 

TOTATACATT CCTTTCAAAT AAAGACCTTG AGAAAACAGC AGAGCCAAGT GAAGATCACC 60 

TAAAGAACTT TGTGGCTAAT TTATACT TCA TGTAGTAGCA GTGGGTACTa GGCAGGGTCT 120 

CTTTCCCACT CTGATGATTT GTGCTCTTAT TTTTCCTAGA TTTACCTCAT CTAGGGCATA ISO 

TTCXTTTCCC TCTTCCTCTT TACCTTTOC T GGTCCTGATC CCTCTGTACT CAGTTCCCCT 240 

AAATTATTGG ACTACAAACT AATATACTAG AAAAQCATAC ACTTATTTTA TTTGAATGCA 300 

GAAATGCTAT CTATCAGTAT ATATACATAA GAATGTATAT TACAGTATAT CTATATATAC 360 

TTAACACTGT AACTTTCAGT ATTCCCCAGT TAGCGTACCT AACTCTCCTG TGGGTTATGT 420 

TAATTCTTAT TAGACTACTA GAQAAAAACC AACTGGCAGT TTGCTAAGCA TATCTACTGG 480 

TGTTGTTTCT GCG CCCTCTT TTGGCTAATT GATGTAATTA TACTGGCTCT AAAGATT TAC 540 

TGCCCCATAA GTAAATAGTA TAGCCACATI CTGAACATAT CAAAAGTACA AACTTAGGAG 600 

GAGTGTATOT ACAAAAATGT AAAATTITAT GAAAATGAAC ATGTTTTTAT GATGTTATTT 66 D 

C TAGTTCAT A AGAATOTGAT GACTGCTTTG CTTCATTTAT GTACGTTCCC ATTATATTCT 720 

TGCTGTCAAT CAATCACAAA TTTATATCAG ATTAGGATAA ACTAAGCCAT TTTATGTATT 700 

TTATTTTAAA CCTTATXTTG GCAGAGTAAT TCCTTAGAAT TGGAAAAGCT GTTACTTlGA 840 

AATTACCAAT TTATTACAAA ACATAGAAAT GTATTGTAGC TACAAAGaCA ACCAAGCATT 900 

TTCTGTGTTT TAATGAATAT CTAAAAAACT ACATTTAGTT TATTTTACTC AGTTTTGAAA 96 O 

TGATTTTTTT ACTGGCTCTA TTGCCTTAAA ATAACTAAGA GATTAATGAT TCTTTGTATA 1020 

ATTTTOCTTT TCXTTGTTCT TTTTTTTACA TTTCGCAGAG TTATATCTAT AGTTTTAGTA 1080 

ACAATTTCTT ATGTATTCTG GATAACTGAA AACAACTAAA OGTGTTGGGC ATTAGAAAAT 1140 

AATTQTGAGC AGTAAGATTA CTGATGTAAT ATGTATGTTG GACTQAAGTA TTTCTTTATA 1200 

AACATTCTAT TTGATTTTAA GCAAAATGTA TGTTAAAGCA TGTTTTTACA TCAGTAAAGT X260 

CATTTGTCGA CCTTCTGGAA ATGAAAGGTT TTTACCTAGA TACTGTAAGT TACACCTCCT 1320 

TAACAATCAT ATTTGTCATT GTTGTTTTCT GCAAACAAAA ATGTTTATGO GCTTCATGTA 1380 

GGCTTAAGAT TGTAGGCAAA AATGGACTGA GTTGAGGACC CTTCAAGCAG TAGGCATTCA 1440 

GTTACAGAGC AGTTGGTACT TTGTAACCCA GACTTACAGT TTAAAAATAT CAAGTTAGCT 1500 

GATGTTTCAT TATAATAAAA ATACTATTTT GCTXAAGAGT TGTATTACAA ATATTTGTGC 1560 

TTAACATTAG AAATAGCTGT TTTAAATTGT AGTTAACATA TTAACTTTTT CAGAAAAAAA 1620 

GCATGGTrTA TTTTTAATAA TGAAATAGAG AACATAATAC GTAATGTTCA GTATAACA3C 1680 

TGAGTTAAAA CATCTGCCAG GATTAACATC AGTGCGTTTT TGCCAATGCA TAGAGGCATT 1740 

TTTCTCTAAG TATGATGGCT AATGATAACT ATTCTTTGTT ACACATTCAA GTCACTCCCA 1800 

TACAAGTAAC TAGTGGGTGA TATGTTTCAC TCCAAAGGGT GTATTAATTC TGAATGCTAA I860 
TCATQAAGAC TTAAGTTAGG ACAACACTTC AAACCAGGAA GTGTGAACTG ATTTA 



9eq. ID NOs B25 DNA sequence 

Nucleic Acid Accession #s PGENBSH predicted 

Coding sequence: 1- . 1144 

1 11 21 31 41 SI 

111)11 

ATGQAGCGCG GGATGCACCT CGGTGCAGCG GCOGCCGGCG AGGACGACCT CTTCCTGCAC 60 

AAGAGCCTGA GDGCCTCCAC CTCCAAGCGC TTGGAAQCOG CTTTCCGCTC CACGCCCCCG 120 

GGGATGGACC TGTCCCTGQC GCCGCCGCCT CGGGAACGOC CGGCGTCCTC CTCCTOGTCG 180 

CCCCTGGGCT GCTTCGAGCC GGCTGACCCC GAGGGGGCAG GGCTGCTGXT GCCGCCGCCT 240 

GGAGGAGGCG GCGGCGGCAG CGCGGGAAGT GGGGGCGGCG GCGGCGGCGG GGTGGGTGTC 300 

CCOGGOCTGC TAGTAGGTTC AQCCGGOGTT GGGGGCGACC CTAGCCTAAQ CAGCCTGCCG 3 GO 

GCCGGGGCG3 CCCTTTGCCT CAAGTAGGGC GAAAGCGCGA GGCGGGGCTC GGTGGCCGAG 420 

AGCAGCGGCG GCGAGCAGAG CCCCGACGAC OACAGCGACG GTCGCTGCGA GCTCGTGCTG 4B0 

CGGGCCGGAG 1AGCCGACCC GCGGGCCTCC CCGGGAGCG3 GAQGTGGTGG COCGAAQGCA 540 

GCCGAGGGCT GCTOCAATGC CCACCTCCAC GGCGGCGCCA GCGTCCCCCC GGGGGGCCTG 600 

GGCGGCGGCO GCQGCGGGGG TAGCAGCAGC GGTAGCAGTG GCGGCGGTGG CGOTAGCGGT 660 

AGCGGCAGCG GCGGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAAGAAATC CAAAGAGCAA 720 

AAGGCGCTGC GGCTTAACAT CAATGCCCGA GAGCQCCGGC GGATGCACOA CCTGAACGAC 780 

GCGCTGGACG AGCTGCOCGC GGTGATCCCC TACGCGCACA GCCCCTCGGT GCGAAAGCTC B40 

TCCAASATCG CCACGCTGCT GCTCGCCAAG AACTACATCC TCATGCAGQC GCAGGCCCTG 900 

GAGGAGATGC GGCGCCTAGI CGCCTACCTC AACCAGGGCC AGGCCATCTC GGCTGCCTCC 960 

CTGCCCAGCT CGGCGGCTGC AGCGGCAGCA GCZGCTGCCC TGCACCCGGC GCTCGGCGCC 1020 

TACGAGCAGG CAGCCGGCTA CCCGTTCAGC GCCGGACTGC CCCCGGCTGC CTCCTGCCCG 10 80 

GAGAAGTGCG CCCTGTTTAA CAGCGTCTCC TCCAGCCTCT GCAAACAGTG CACGGAGAAQ 1140 
CCTT 



Seq ID NO: B26 Protein sequence 
Protein Accession #; FGENESH predicted 



1 11 21 31 41 sr 

l l I l I I 

MERGMHLGAA AAGEDDLFUE 3CSI»SASTSKR LBAAPHSTPP GMDI*SLAPPP HERPA8SSSS 60 

PLQCFEPADP EGAGI»I*LPPP GGQGGGSAGS GGGGGGGVGV PGLLVGSAGV GGDPSLSSLP 120 

AGAALdiXTO ESASaGSVAK SSGOEQSPDD D8DGRCELVL RAGVADPHAS PGAGGGGAKA 1B0 

AEGC6NAHLH GGASVPPGGL GGGGGGGSSS GSSGGGGGSG SGSGGSSSS8 SS3SKKSKEQ 240 

KALRUJINAR ESRRMHDIJJD ALDSLBAVIP YAHSPSVRKI, SKIATLLLAK NYILMQAQAL 30 0 

B&CRRLVAYIj NQGQAISAA3 LPSSAAAAAA AAALHPALGA YEQAAGYPFS AGLPPAASCP 360 
EKCALFNSVS SSLCKQCTEK P 



Sec: ID NO j B27 DHA sequence 

Nucleic Acid Accession *t cat cluster 

807 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

I I I I I I 

TTTTTTTTTT TTTTTTTTAA ATGTTTGATC AAAATATTTT AATAAAGATT CTTTCTGACA 60 

5 TAGATACACA TACAAATGGT CGTACATAGC TGTCATAGTC TGATTGACCT ATTTAATATA 120 

TA.TAT CRT tc tttacacatc caaaacccgc caacagatcc atcacagctc CCAACTCACC 180 

ATCCAACCTG ACAAACTGAA TTTGTATTAT CTGCAAGGAG TGGAAAATAG CAGGACTCCA 240 

TTTTTAAAAA AGATTTTCTT GATTTTCATA GGGCGGAAAG GCAGTCAAAC AGCCATGCAA 300 

AACTAAAACT GAAAGCTCAC TTTGOQTAAA XAGCTTCTTG TTCTTCCTTA GTTTTCTTTC 360 

10 TTTTTAAATT TTATTTTTTT AGAAAAATAA CAAAGGATTT CACACCATAG GCAAATCAAA 420 

CCAGTCTTTT AACTTAAATA ATTCTCCACA GTTAAAATAA CATATATGTA CATATATATT 480 

AAAAGCAATT AAATTAGACC TTTTAAAAAT GCACAQCACA GCCTGGAAAA ATATTTGCTT S40 

AGCATGTTCT TACGTATAGC AACTATTQCT GTGATGTTTT TCCCTTTTGG AATGTAAAGG 600 

AGTCCCCTTT CAAAAAAGAG ATCAATTCAT TCATCAATTA AGAATACACC TTTCCTGTAA 660 

15 TTTTTGGACT GAAGCAATTT ATTAAAQCTC AATTTAAATA CAGGQATGAT GCAACTGAAA 720 

ATATCCAGGT GACCTTTCAT AAAGCTAAGC AGCTCAGATA CAT CAAT ATT TCTCTTCATA 700 

CTTGTTQGCA AATAAACCTT TAAACACTTG GCACACAGCA TAAGTAATCT ATAAAACAAT B40 

TTTAGAGGCA TTAAAAAAAT CTGCACATAA GACCCATGAC TTTAACACAT GATAAATACT 900 

GTTGATGTGG AAGGGTCATT GAAGAATAAC AAATAAATAC CATQAATTGT TAATACATCA 960 

20 TTGCAGAGTA GAARGTAACA AGGTGCACAT AAATATTTTT AAATGCAATT CTTTCAGCCA 1020 

CAGTCAGTTT TTTTATATCA CTCTOGCCAA AACTTTGAGC ATTTTCACAG GATTAAAGTT 1080 

CAGAGACAAT AAAAAATACA AGTCTTTCAT AGTAACATGT TCTCTCTCTC TCGCTCTGGG 1140 

TTTTATTTCC CCCTCACTTA AQTCAACATT TCAAGTTTTT CTCCTGGCTC AGAATCAAAA 12 DO 

TTTATTTTCA AGTGCCCTTT CTGATTTGTC TGAATGAATA TTCCATCTCT CATGCTACCG 1260 
25 ATCCGCT 



Seq ID NO i B28 DKA sequence 
tfucleic Acid Accession #: nm_oo2SB1.2 
^ Coding aequencei 368..S2S1 

1 11 21 31 41 51 

1 I I I 1 I 

GGGGGAGGGA ATTCAGCQGA TCAGTCTTAA GAGGAGCTTT TTTTTGGAGC GAGAAATCAT 60 

ATAAAATAAA ATGAAATAAA ACAAGGAGGA AGGCRAC5CAG CTGTTAGGGG GAAAATAAGG 120 

35 CAGATAAAGG AGCGGGGAGA GAAATTAATT GCCAACCAGG AGGAGTTGGG CTGTATTTTT 180 

CAAAGGTGGG GAGAGTGGAG CACACACCTT GAGGAGGAAA GCGAGAAAGA AAAGAAAAAA 240 

GCAAGTGAAG GGGGGCTCGC CCAAGAAGGG TGAAGAAGOG AAGAAAGTCG AGGCGOCGAG 300 

GCTCCCAAAG CTGGCAGCTC OQGGTGGCGG TGCAGGGGCG AAGGGGGGGG CGGGGGGAAC 360 

GTCGGACATG OGGCTCTGGA GTTGGGTGCT GCACC1GGGG CTGCTGAGCG CCGCGCTGGG 420 

40 CTGCGGGCTG GCCGAGCGTC CCCGCCGGGC COGGAGAGAC CCGGGGGCGG GCCGACCCCC 4E0 

GCGCCCCGCC GCCGGCCCGG CCAOCTGCGC CACCCGCGGC CCGCGGCCGC CGCGCCTOGC 540 

CGCCGCCGCC GCCGCOGCCG GGCGTGCCTG GGAAGCCGTG CGCGTCCCCC GGOGGCGGCA 600 

GCAGCGGGAG GCGAGGGGGG CCACCGAGGA GCCGAGCCCG CCGAGCCGGG CGCTCTATTX 660 

CAGCGGGCGA GGCGAGCAGC TGCGAGTCCT CGGGGCOGAC CTCGAGCTGC CCOGGGAOGC 720 

45 GTTCACGCTG CAAGTGTGGC TGCGAGCGGA GGGGGGCCAG AGGTCTCOGG CAGTGATCAC 7B0 

AGGGCTGTAT GACAAATGTT CTTATATCTC AOGTGAOGGA GGATGGGTCQ TGGGCATTCA 840 

CACCATCAGT GACCAAGACA ACAAAGACCC AOGCTACXTT TTCTCCTTGA AGACAGADCG 900 

AGOCCGGGAA GTGACCACCA TCAATGCCCA CCGCAGCTAC CTCCCAGGCC AGTGGGTATA 960 

CCTAGCTGCC ACCTATGATG GGCAGTTCAT GAAGCTCTAT GTGAATGGTG CCCAGGTGGC 1020 

50 CACCTCTGGG GAACAAGTGG GTGGCATATT CAGCCCACTG ACGCAGAAGT GCAAAGTGCT 1080 

CATGTTAGGG GGCAGTGCCC TGAATCACAA CIACCGGGGC TACATCGAGC ACTTCAGTCT 1140 

GTGGAAGGTG GCCAGGACTC AGCGGGAGAT ACTGTCTGAC ATGGAAACCC ATGGOGCCCA 1200 

CACTGCTCTA CCICAGCTOC TOCTCCAGGA GAACTGGGAC AATGTGAAGC ATGCCTGGTC 1260 

CCCCATGAAG GATGGCAGCA GCCCCAAAGT GGAATTCAGC AATGCCCACG GCTTTCTGCT 1320 

55 GGACACGAGT CTGGAGCCTC CTCTGTGCGG ACAGACATTG TGTGACAACA CAGAGGTCAT 1380 

TGCCAGCTAC AATCAGCTCT CAAGTTTCCG CCAGCCCAAG GTGGTGCGCT ACCGCDTGGT 1440 

CAAOCTCTAX GAAGATGATC ATAAGAACCC GAGGGTGACG CGCGAGCAGG TGGACTTGCA 1500 

GCACCATCAG CTGGCTGAGG CCTTCAAGCA ATACAACATC TCCTGGGAGC TGGAOGTGCT 1560 

GGAGGTGAGC AACTCCTCCC TTCGCCGCCG CCTCATOCTG GCCAACTGTG ACATCAGCAA 1620 

60 QATTGGGOAT GAGAACTGTG ACCCCGAGTG CAACCAjCACG CTGACGGGOC ACGACGGCGG 1680 

GGATTGCCGC CACCTGCGCC ACCCTGGCTT CGTGAAGAAG CAGCACAACG GGGTGTGTGA 1740 

CATGGACTGC AACTATGAAC GGTTCAACTT TGATGGTGGA GAGTGCTGTG ACCCTGAAAT 1800 

CACCAATGTC ACTCAGACTT GCTTTGACCC CGACTCTCCA CACAGAGCCT ACTTGGATGT 1860 

TAATGAGCTG AAGAACATTC TTAAATTGGA TGGATCAACA CATCTCAATA TTTTCTTTGC 1920 

65 AAAATCCTGA GAGGAGGAGT TGGCAGGAGT AGCAACTTGG CCATGGGACA AGGAGGCOCT 1980 

GATGCACTTA GGTGGCATTG TCTTGAACCC ATCTTTCTAT GGCATGCCTG GGCACACCCA 2040 

CACCATGATC CATGAGATTG GTCACAGCCT GGGCCTCTAT CACGTCTTCC GAGGCATCTC 2100 

AGAAATCCAG TCCTGCAGTG ACCCCTGCAT GGAGACAGAG CCCTOCTTGS AGACTGGAGA 2160 

CCTCTGCAAT GATACCAACC CAGCOCCTAA ACACAAGTCC TGTGGTGACC CAGGGOCAGG 2220 

70 AAATGACACC TGTGGCTTTC ATAGCTTCTT CAACACTCCX TACAACAACT TCATGAGCTA 2280 

TGCAGATGAC GACTGTACGG ACTCCTTCAC GCCCAATCAA GTCGCCAGAA TGCACTGTTA 2340 

CCTGGACC1G GTCTACCAGG GCTGGCAGCC CTCCAGGAAA CCAGCGCCTG TTGCCCTCGC 2400 

CCCCCAAGTT CTGGGCCACA CAACGGACTC TGTGACACTG GAGTGGTTCC CACCTATAGA 2460 

_ TGGCCATTTC TTTGAAAGAG AATTGGGATC AGCATGTCAT CTTTGCCTGG AAQQGAGAAT 2520 

75 CCTGGTGCAG TATGCTTCCA ACGCTTGCTC CCCAATGCCC TGCAGCCCAT CAQ3ACACTG 2580 

GAQCCCTCGT GAAGCAGAAG GTCATCCTGA TGTTGAACAG CCCTGTAAGT CCAGTGTCCG 2640 

CACCTGGAGC CCAAATTCAG CTGTCAACCC ACACACGGTT CCTCCAGCCT GCCGTGAGCC 2700 

TCAAGGCTGC TACCTX3GAGC TGGAGTTCCT CTACCCCTTG GTCCCTGAGT CTCTGACCAT 2760 

_ _ TTGGGTGACC TTTGTCTGCA CTGACTGGGA CTCTAGTGGA GCTGTCAATG ACATCAAACT 2620 

80 GTTGGCTGTC AGTGGGAAGA ACATCTCCCT GGGTCCTCAG AATGTCTTCT GTGATGTCCC 2880 

ACTGAOCATC AGACTCTGGG ACGTGGGCGA GGAGGrGTAT GGCATCCAAA TCTACAOGCT 2940 

GGATGAGCAC CTGGAGATCG ATGCTGCC3VT GTTGACCTCC ACTGCAGACA CCCCACTCTG 3000 

tctacagtgt aagcccctga agtataaggt ggtcogggac CCTOCTCTOC AGATGGATGX 3060 

GGCCTCCATC CTACATCTCA ATAGGAAATT CGTAGACATG GATCTAAATC TTGGCAGTGT 3120 
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GTACCAGTAT TGGGT.CATAA CTATTTCAGG AACTGAAQAG AGTOAGCCAT CACCTGCTGT 3180 

CACATACATC CAXGOACGTG GGTACTGTGG CGATGGCATT ATACAAAAAG ACCAAGGTGA 3 240 

ACAftTGOGAC GACATGAATA AGATCAATGG TGATGQCTGC TCOCTTTTCT GCCGACAAGA 3300 

AGTCTCCTTC AATTGTATTG ATGAACCCAQ CCGGTGCTAT TTCCATGATG GTGATGGGGT 3360 

ATGTGAGGAG TTTGAACAAA AAACCAGCAT TAAGGACTQT GGTGTCTACA CGCCCCAGGG 3420 

ATTCCTGGAT CAGTGGGCAT CCAATGCTTC AGTATCTCAT CAAGACCAG C AATGCCCAGG 3480 

CTGGGTCATC ATOGGACAGC CAGCAGCATC CCAGGTGTGT CGAACCAAGG TGATAGATCT 3540 

CAGTGAAGGC ATTTCCCAGC ATGCCTGGTA CCCTTGCACC ATCAGCTACC CATAT T CC CA 3600 

GCTGGCTCAG ACCACTTTTT GGCTCCGGGC GTATTTTTCT CAACC&ATGG TTGCCGCAGC 3660 

TGPTCATTGTC CACCTGGTGA CGGATGGGAC ATATTATGGG GACCAAAAGC AGGAGACOAT 3720 

CAGCGTGCAG CTGCTTGATA CCAAAGATCA QAGCCACGAT CTAGGCCTCC ATGTCCTGAG 3780 

CTGCAGGAAC AATCCCCTGA TTATCCCTGT GGTCCATGAC CTCAGCCAGC CCTTCTACCA. 3840 

CAGCCAGGCG GTACGTGTGA GCTXCAGTTC OCCCCTGGTC GCCATCTCGG GGGTGGCCCT 3900 

CCGTTCCTTC GACAACTTTG ACCCCGTCAC CCTGAGCAGC TGCCAGAGAG GGGAGACCTA 3960 

CAGCCCTGCC GAGCAGAGCT GCGTGCACTT CGCATGTGAG AAAACTGACT GTCCAGAGCT 402D 

GGCTGTGGAG AATGCTTCTC TCRATTGCTC CAGCAGCGAC CGCTAOCACG GTGCCCAGTQ 4080 

rACTGTGAGC TGCOGGACAG GCTACGTGCT CCAGATACGG CGGGATGATG AGCTGATCAA 4140 

GAGGCAGACG GGACCCAGCG TCACAGTGAC CTGTACAQAG GGCAAGTGGA ATAAGCAGGT 4200 

GGOCTGTBAG CCAGTCGACT GCAGCATCOC AGATCACCAT CAABTCTA.TG CTGGCTCCTT 4260 

CXCCTGCCCT GAGGGCACCA CCTTTGGCAG TCAATGTTCC TTCCAGTGCC GTCACCCTGC 4320 

ACAATTGAAA GQCAACAACA GCCTCCTGAC CTGCATGGAG GATGGGCTGT GGTCCTTGCC 4380 

AGAGGCCCTG TGTGAGCTCA TGTGCCTCGC TCCACCCCCT GTGCCCAATG CAGACCTCCA 4440 

GACCGCCOOG TGCCGAGAGA ATAAGCACAA GGTGGGCTCC TTCTQCAAAT ACAAATGCAA 4500 

GCCTGGATAC CATGTGCCTG GATCCTCTCG GAAGTC&AAG AAACGGGCCT TCAAGACTCA 4560 

GTGTACCCAG GATGGCAGCT GGCAGGAGGG AGCTTGTGTT CCTGTGACCT GTGACCCACC 4620 

TCCACCAAAA TTCCATGOGC TCTACCAGTG TACTAATGGC TTCCAGTTCA ACAGTGAGTG 4680 

TAGGATCAAG TGTGAAGACA GTQATGCCTC CCAGGGACTT GGGAGCAATG TCATTCATTG 4740 

CCGGAAAGAT GGCACCTGGA ACGGCTCCTT CCATGTCTGC CAGGAGATGC AAGGCCAGTG 4800 

CTCGGTTCCA AACGAGCTCA AGAGCAAOCT CAAACTGCAG TGCCCTGATG GCTATGCCAT 4860 

AGGCJTCGGAG TGTGCCACCT CGTGCCTGGA CCACAACAGC GAGTCCATCA TCCTGCCAAT 4920 

GAACGTGAOC GTGCGTQACA TCCCCCACTG GCTGAACCCC ACACGGGTAG AGAGAGTTGT 4980 

CTGCACTGCT GGTCTCAAGT GGTATCCTCA CCCTGCTCTG ATTCACTGTG TCAAAGGCTG 5040 

TGAGCCCTTC ATGGGAGACA ATTATTGTGA TGCCATCAAC AACCGAGCCT TTTQCAACTA S100 

TGACGGTGOQ GATTGCTGCA CCTCCACAGT GAAGACCAAA ARGGTCACCC CATTCCCTAT 5160 

GTOCXGTGAC CTACAAGGTG ACTGTGCTTG TCGGGACCCC CAGGCCCAAG AACACAGCCG 5220 

GAAAGACCTC CGGGGATACA GCCATGGCTA AGGAAGGACA AOAAGTTGTC AAAGAATTCC 52 BO 

CAACGCCAGG AOCCACATCC CTTTGGTATT QATTTCACAG TCAGCTGCTC AACOGAATGG 5340 

CCTCTCCACA CCAGGGATCC TTAGCACCCA AOCGGTCTGC CTTTAATTTT AOCCAGGAAG 5400 

GACTCACATT GGGGCGAATG AACCAAGTTT OGOCATGCTG GATGATGAAA TGGATTCCCA 5460 

TOCCAAAGTC TGAGATGGAT T G CAT AT ACA GTGTGCAGTC CCAGAGCCTC CTAAAATTCT 5520 
AGCCATTTGT CACACAACCA CAGCAAAAAA AAA 

Eeq ID NO: B29 Protein sequence 
ProbeiiL Accession #; HP 002572.1 



1 11 21 31 41 

I I I 1 I 

MRLWSWVIiHIi GLLSAALGCG 1AERPRRARR DPHAGRPPRP AAGPATCATR 
AAAAGRAWKA VRVPRRRQQR BARGATEEPS PPSHAIiYFSG HGEQIiRVIiRA 
LQVHLSAEGG QHSPAVITGL YDKCSYI9RD RGWWGIHTI SDQDNKDPKY 
QVTTIKAHRS YLFGQWVYLA ATYT3GQFMKX, YVNGAQVAXS GEQVQGIFSP 
GG3ALNHTTYR GYIEHFSLWK VAHTQREILB DMETEGAHTA LPQI^LLQSNW 
KDGSSPKVEF SKAHGFLLDT SLEPPLCGQT liCDHTBVXAS YKQI»SSFRQP 
YEDDHKNPTV TREQVDFQHH QliAKAFXQYN ISWELDVLEV SMSfilJlRRLI 
DKNCDPBCNH TLTGHDGGOG RHLRKPAFVK KQHNGVCDMD CKYERFNFD3 
VTQTCFDPD3 PBRAYfcDVHB LKUIIiKLDGS TELNIPFAKB SEEELAGVAT 
LGGZVLNPSP YGMPGHTHTM IEEIGHSLGIj YHVFRGISBI QSCSDPCHBT 
HDTMPAPKHK SCGDFGPGND TCGPHSFPNT PYMNFMSYAD DDGSDSPTPN 
liWQGWQPSR KPAPVALAPQ VLGHTTD SVT KEKPPPIDGH FPERELGSAC 
QYASNASSPM PCSPSGHWSP REAEGHPOVE QPCKBSVRTW SPNSAVNPHT 
CYLEIiEFl*YP LVPESLTIWV TFVSTDWESS GAVKDIKLLA VSGKNISLGP 
rRLWDVGKEV YGIQlYTliDE HLEIDAAKLT STADTPLCLQ CKPIiKXKWR 
ILHLiNHKFVD MDLNLGSVYQ YWVITIBGTE E3EPSPAVTY IHGHGYGGDG 
DDKH^XNGOa CSLFCRQEVS FKCIPKPSRC YFHDGDGVCE EFEQKT3I3CD 
DQWAGMASVS HQDQQCPGWV IIGQPAASOV CRTKVID1SE GIBQHAWYPC 
QTTFWIiRAYF SQPMVAAAVI VHKVTDGTYY GDQKQETISV QLXiDTKDQSH 
MNPLIIPWH DLSQPFYESQ AVHVSFSSPL VAISGVAUifl FEHEDPVTLS 
ABQSCVHFAC EKTDCPELAV EKAST^NCSSS DRYHGAQCTV SCRTGYVLQ1 
TGPSVTVTCT EGKKNXQVAC EPVDCGIPDH HOVYAAfiPSC PEGTTFGSQC 
KGHNSOjTCM EDGU9SFPEA LCELMCLAPP FVPNADLQTA RCRENKHKVG 
YEVFGSSHKS KKRAPKTQCT QDG9WQEGAC VPVTCDPPPP KFHGLYQCTK 
1CGEDSDASQG LGSNVIHCRK DGTKNGSFHV CQEMQGQCSV PNEXWSKIiKI, 

ecatscldbn sesiilpmnv tvrdiphwln pteverwct AGLKMYPHPA 

PMGE85YCDA1 KKTRAFCNYDG GDCCTSTVKX KKVTPFPMSC DLQODCACHD 
LRGYSHG 



51 
\ 

GPHPPRLAAA 
PI»ELPSDAFT 
FFSLKTDRAR 
IiTQKCKVi*t^i 
DNVKHAWSPM 
KWHYRWNL 
LANCDISKia 
GBCCDPEITH 



EPSFETGDLC 
QVABMHCYIiD 
HliCLEGRILV 
VPPACPEFQG 
QNVFCDVPT/r 
DEPIiOMOVAS 
IIQKDQGEQC 
CGVYTPQGPI» 
TISYPYSQIA 



SCQRGETYSP 
tt&DDBLIKSQ 
SFQCEHFAQIi 
SFCKYKCKPG 
GFQFNSECRI 
QCPDGYAIGS 
LIBCVKGCEP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 

ioeo 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
2560 
1620 



Seq ID NO: B30 DNA sequence 

Kucleic Acid Accession #: HM_032floa.l 

Coding sequence i 61. < 1008 

1 11 21 31 41 51 

I 1 1 1 1 I 

CTAGTCTATC TCCGCTTCCT CAACCTCTCC TACRAOCCCA TCAGCACCAT TGAGGGCTCC 
ATQTTGCATG AQCTGCTCCH GCTGCAGGAG ATCCAGCTGG TGGGCGGGCA GCTGGCCGTG 

809 
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GTGGAGCCCT ATGCCTTCCG CGGCCTCAAC 
CAGCTGACCA CACTGGAGGA ATTAGXCTTC 
CTGGACTCCA ACCCGCTGGC CTGCGACTGT 
CQQCTCAACT TCAACCGGCA GCA0CCCACG 
GAGTTCAAGG ACTTCCCTGA TGTGCTACTG 
ATCCGGGACC GCAAGGCCCA GCAGGTGTTT 
TGCCGGGCCG ATGGCGACCC GCCGOCCGCC 
GTCTCAGCCA AGAGCAATGG GCGGCTCACA 
I ACQ CCCAGG TACAGGACAA CGGCACGTAC 
GACTCCATGC CCGDCCACCT GCATGTGCGC 
AACAAGACCT TCGCTTTCAT CTCCAACCAG 
GCCACTGTGC CTTTCCCCTT CGACATCAAG 
ATCTCTTTCC TGGGOGTCGT CCTCTTCTGC 
AAGGGCAACA CAAAGCACAA CATCGAGATC 
ATCAGCTCCG CCGACGCGCC CCGCAAGTTC 
GGGCAGGGAC CCCCGGGCGG CCGGGCAGGG 
CCAQTCCTTC CCACCTCCTC CCTACCCTTC 
CGTCCCCTGC TQCCCCCOGC CAGCCCTCAC 
AGCCCAGACC TGGGGACCCC ACCTACACAG 

CGAACOGACA CGCGGCAGAG TCAATAATTC 

TTGGGTTTCA ATAATTATGG ATTTTTATGA 

TATTTCCTAT AQCTAGTCGG AATGCAAACT 

TCCAACTCAG TTTCTTGTTT TTCTCTTCCT 

CTTCCCCAGT GGGGAGGGAT CACTCAGGAA 

CTGCCCACCC CGCCCCAGGC ACCATCAGGA 

CCGGGCTGGC TTTTTGCAGG GCQCAGGTGG 

CCTGTCTGCT GGGCTGCCftG GCGGCACCAC 

GCTGAGACTC TGGACAGAGG CTGGGGTCCT 

CCAGGGGCTG GAGGTGGGGG CCACCCCAGC 

GTGTGGCCTC AAGCAGGTCA CTGGCCTCTC 

GGAACATTAC CCCCTGCCCT GCCTACCTGA 

TGTATGTGAA ACACTTTGTA ACCTGTAAAQ 

Seq ID NO: B31 Protein sequence 
Protein Accession #: NP_116197.1 

1 11 21 31 41 51 

1 1 t I i 1 

M&HHLIfilQE XQLVGGQLAV VEFYAFRGLN YfcRVLNVSGN QLTTKEELVF HSVGNXiEXLl 
liDSNPLACDC RI/LWVFRRRW HLNFNRQQPT CATPEFVQGK EFKDFPDVLli PNYPTCRRAR 
1PDRKAQQVF VDEGHTVQFV C&ADGDPPPA ILWIiSPRKHIj VSAKSNGRLT VFPEGTIiEVR 
YAQVQDNGTY I^IAANAGGN DSMPAHLHVR SYBPDWPHQP UKTFAFISNQ PGEGEftNSTR 
ATVPPPFDZK TLIIATTMGF I3FLGWLFC LVLLFLWSRG XGNTKHNIEI EYVPEKSDAG 
ISBADAPRKF KMKMl 



tacctgcgcg 
cactgggtqg 
cggctcctgt 
tgcgccacgc 
cccaactact 
gtggaogagg 

ATCCTCTGGC 

CTGTQCATCG 
AGCTACTCGC 
CCGGGCGAOG 
ACCCTCATCA 
CTGGTGCTGC 
GAGTATGTGC 
AACATGAAGA 
GAAGGGGCCT 
TACACACGTT 
CACGTGCCCT 
GBGCATTGAC 
AATAAAAAAG 
AAACTTGAAA 
TTTGACGTOC 
CCTCCTCCTC 
AACAGGAAAG 
GCAGGCTAGG 
AGGGGACAGG 
TGCAAGGGGT 
CCTGGGGGAC 
CTCTGGTCCC 
TGGGCCTGAG 
CAGGGCTGTT 
CGCTGTGCAC 



TGCTCAATGT CTTCTGGCAAC 180 

GCAACCTQGA GACACTCATC 240 

GOGTGTTCCG GCGCCGCTGG 300 

CCGAGTTTGT CCAGGGCAAG 3 GO 

TCACCTGCCG CCGCGCCCGC 420 

GCCACACGGr GCAGTTTGTG 480 

TCTCACCCCG AAAGCACCTG 540 

ATGGCACGCT GGAGGTGCGC 600 

CGGCCAACGG GGGCGGCAAC 660 

CCGACTGGCC CCATCAGCCC 720 

GAGAGGCCAA CAGCACCCGC 780 

TCGCCACCAC CATGGGCTTC 840 

TGTTTCTCTG GAGCCGGGGC 900 

CCCGAAAGTC GGACGCAGGC 960 

TGATATGAGQ CCGGGGCGGG 102 0 

GGCGGCCACC TGCTCACTCT 1060 

CTCTTTCTCC CTCCCGCCTC 1140 

CCTTCTACCA GGACCTCAGA 1200 

AGACTGGAGT TGAAAGOCGA 1260 

TTACGAACXT TCTCTGTAAC 1320 

TAATAAAAAG AGAAAAAAAC 1380 

1GATTGCTCC AGGGCCCTCT 1440 

TTCTTOCTCC TTTCTCTTCT 1500 

GAGGTTOCAG CCCCACX^CAC 1560 

GGGCAGGCCT GGGCCCAGCT 1620 

TCTGOCGATG GGGGTGGGAG 1680 

GGGAGCCTGG CTTGGGTGTG 1740 

AGCACAGTCA GTGGAGAGAG 1B00 

AGCTCTGCTG CTCACTTGCT 1860 

TCTCCACATC TGTACAAATG 1920 

GTGAGGAATT GATGAGATGA 1980 
ACGTG 



60 
120 
180 
240 
300 



Seq ID NO: B32 J3HA sequence 

Nucleic Acid Accession 41 > cat cluster 



i 
I 

CAGCACATAC 
GTAATAACCT 
GTATTCGGTA 
AACCAGGAOG 
TGGCACAGCA 
GAAAGTGGCG 
AGCCTTTCCG 
GGGTATAGAA 



11 

i 

AAGAAACATA 
AGTGAGTGAG 
CGAAGGAAQG 

CTQACAAAGG 
GGGGTGGTCC 
TCACGGGCAG 
A 



21 
I 

CAGTGTACCT 
TTGTGATGTC 
CAGGCTGGTC 
GCCCAGAGGT 
TAGAGGGAAA 
TTGGAAAACA 
GGACGCCTTG 



31 
I 

CAAAAGGGGC 
ATCTGGAACA 
CTGGCAGGAA 
GGAGGAGGGC 
TGTAATAGCA 

ATTTAAAAAA 



41 
I 

CCTTGAAATG 
TAGGAAATGG 
GTAAATGATA 
GGTCAGGGTG 
CATCTACGCT 
TCTTGGCAGG 
AATAAATAAA 



51 

i 

T CAT CAAAGG 
GGCTCTTAGC 
ATCXTTGGGA 
GGGTCTACAG 
GCAGTCTGGT 
AATTAGTGAC 
TAAAACGTCT 



60 
120 
180 
240 
300 
360 
42Q 



Seq ID NO; B33 sequence 

KUcleic Acid Accession nh_006174,1 

Coding sequencet 71.. 1408 



1 
1 

GAAAGGCTAT 
AGACTATAAT 
TAATACTGCT 



GGGGAATCTA 

cttcctcata 
cacactgacg 

GCCTTTTCTT 
TGTCAGGTAT 
CTTTCTGATA 
TCACAOTCTT 
ATGTGTTGAG 
AGTTCAGTAT 
TATAAGCTGT 
AACTCTTCAT 
GAGTT ATT CA 
AOCTGCTCCA 



TGAGATAAAA 



1L 
I 

CGGTAACAAC 
ATGGATTTAG 
GOCACTCGGA 
CAGTATTITC 
CTTATTTTAA 
GGCAATCTGG 
TCTGTCTTGC 
CAATGTGTGT 
CATATGATAA 
GCTACTGTCT 
GTGGAACTTC 
TCATGGCCAT 
ATTCTGCCCT 
GGATTGTCXZA 
CCATCCAAAA 
TTCATCAAAA 
GAAAGACCTT 
AGTCAGCTCT 
CCTGAAGAAA 



21 
I 

tgacctgcca 

AGCTCGACGA 
ATTCTGATTT 
TGATTGGGCT 
TGGCTCTTCAT 
CCTTTTCTGA 
TG GATC AGTG 
CAGTTTTGGT 
AACATCCCAT 
GGACACTAGG 
AAGAAACATT 
CTGATTCATA 
TAGTTTQTCT 
ACAAAGAAAA 
AGAGTGGGCC 
AACACAGAAG 
CTCAAGAGAA 
CTTCATCGAG 
ATTCAGATGT 



31 
I 

CAAAGTTAGA 
GTATTATAAC 
COCAGTCTGG 
CTATACATTT 
GAAAAAGCGT 
TATCTTGGTr 
GATGTTTGGC 
TTCAACTTTA 
ATGTAATAAT 
TTTTGCCATC 
TGGTTCAGCA 
CAGAATTGGC 
TACTGTAAGT 
CAGACTTGAA 
TCAGGTGAAA 



CCACTCCAQA 
TAAGTTCATA 
TCATGAATTG 



41 
I 

AGAAAGGATT 
AAGACACTTO 
GATGACEATA 
GTAAGTCTTC 
AATCAGAAGA 
GTGCTGTTTT 
AAAGTC7VTGT 
ATT TTAAT AT 
TTAACAGCAA 
TGTTCTCCCC 
TTGCTGAGCA 
TTTACTATCT 
CATACAAGTG 
GAAAATGAGA 
CTCXCTGGCA 
AAGAAGAGAG 
ATACTTCCAG 
CCAGGGGTCC 



51 
I 

GATTCAAGAA 
CCACAGAGAA 
AAAGCAGTGT 
TTGGCTTTAT 
CTACGGTAAA 
GCTCACCTTT 
GCCATATTAT 
CAATTGCCAT 
ACCATGGCTA 
TTCCSGTGTT 
GGAGGTATTT 
CTTTATTGCT 
TCTGCAGAAG 
TGATCAACTT 
GCCATAAATG 
CATGTGTGTT 
AAAACTTTGG 
CCACTTGCTT 
GTTCTGTTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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AAGAATAAAA AAGAGATCTC GAAGTGTTTT CTACAGACTG ACCATACTGA TATTAGTATT 

tgctgttagt tggatgccac tacacctttt ccatgtgota actgatttta atgacaatct 
tatttcaaat aggcatttca agttggtgta ttgcatttgt catttgttgg gcatgatgtc 
ctqttgtctt aatccaattc tatatgggtt tcttaataat gggattaaag ctgatttagt 
gtcccxtata cactgtcttc atatgtaata attctcactg ttt 

Seq ID NO; £34 Protein sequence 
Protein Accession #; NP_006165.l 



1200 
1260 
1320 
1380 



1 11 

1 1 

MDLELDEYYN KTLATEHNTA 
LJIiMALMKKR NQKTTVNFLI 
QCVSVLVSTL ILISIAIVRY 
VSLQETFGSA IiLSSRYLCVE 
GLSNKENRUE FNEMINUTItfi 



21 
I 

ATRNSDFPVH 
GNIiAFSDlLV 
HMIKHPISHN 
SWPSD SYRIA 
PSKKSGPQVK 



KRSRSVFYRL THOXOTKV& 
NPILYGFUHEt GIKADLVSLI 



WMP3uHliFHW 
HCLHM 



31 
I 

DDYKSSVDDI* 
VLFC3PPTLT 
IjTANHG Y FIj I 
FTI ST tTjT.tVQ Y 
IiSGSHKWSYS 
PGVPTCFBIK 
TDFNDNLISN 



41 
\ 

QYPIilGLYTP 
SVI1I1DQWMFG 
ATVKTLGFA1 
IU?LVCLTVS 
FTKKHRKRYS 



HHFKLVYCIC 



51 

I 

VSLLGFMGNL 60 

KVMCHIMPFL 120 

CSPLFVFH9I* iao 

HTSVGRSIBC 240 

KKTACVTjPAP 3 do 

RVKRSVTRIK 360 

HLLGMMSCd. 420 



Seq ID HOi B35 EWA sequence 

Nucleic Acid Accession #: NM_014279.1 

Coding Sequence: 28G..1689 

1 11 21 31 41 51 

1 I I I 1 I 

GCGCGGGGGA GCCATTAQGA GGOGAGGAGA GAGGAGGGCG CAGCTCCCGC CCAGOCCAGC 60 

CCTGCOC3K3C CCTGCCCGGA GGCftGACGCG CCGGAACCGG GACGCGATAA ATATGCAGAG 120 

CGGAGGCTTC GCBCAGCAGA GCOCGCDCGC CGCCCGCTCC GGGTSCTGAA TCCAGGCGTG 180 

GGGACAGGAG CCAGGCGCCG CCGCCGGAGC CAGCOGAGCC GGGGCCAGAG CCGGAGCGCG 240 

TCCGCGTCCA CGCAQCOGOC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTCGTTGG 300 

AGGTGGCAGC GAGACATGCA COCGGCCOGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AGTGCTGCCC ACCAACCCTG AGGAGAGCIG GCAGGTGTAC 420 

AGCTCTGCCC AGGACAGOGA GGGCAGGTGT ATCTGCACAG TGGTOGCCCC ACAGCAGACC 480 

ATGTGTTCAC GGGATGCCCG CACAAAACAG CTGAGGCAGC TAOTGGAGAA GGTGCAGAAC S40 

ATGTCTCAAT CCATAGAGGT CTTGGACAGG CGGACCOflGA GAGACTTGCA GTAOGTGGAG 600 

AAGATGGAGA ACCAAATGAA AGGACTGGAG TCCAAGTTCA AACAGGTGGA GGAGAGTCAT 660 

AAGCAACACC TGGCCAGGCA GTTTAAGGCG ATAAAAGCGA AAATGGATGA ACTTAGGCCT 720 

TTGAIAOCTG TQTTGGAAGA GTACAAGGCC GATGCCAAAT TGGTAT2GCA GTTTAAAGAG 780 

GAGOTCCAGA ATCTGACGTC AGTGCTTAAC GAGCTGCAAG AGGAAATTGG CGCCTATGAC 840 

TACGATGAAC TTCAGAGCAG AGTGTCGAAT CTTGAAGAAA GGCTO0GTGC ATGCATGCAA 900 

AAACTAGCTT GCGGGAAGTT GACGGGCATC AGTGACCCCG TGACTGTCAA GACCTCOGGC 960 

TCGAGGTTCG GATCCTGGAT GACAQACCCT CTCGCCCCTG AAGGCGATAA CCGGGTGTGG 1020 

TACATGGACG GCTATCACAA CAACOGCTTC GTACGTGAGT ACAAGTCCAT GGTTGACTTC 1080 

ATGAACACGG ACAATTTGAC CTCCCACCGT erCCCJCCACC CCTGGTCGGG CACGGGGCAG 1140 

GTGGTCTACA ACGGTTCTAT CTACTTCAAC AAGTTCCRGA GCCACATCAT CATCAGGTTT 1200 

GACCTGAAGA CAGAGACCAT CCTCAAGACC CGCAGCCTGG ACTATGCCGG TTACAACAAC 1260 

ATGTACCACT ACGCCTGGGG TGGCCACTCG GACATCGACC TCATGGTGGA CGAGAGOGGG 1320 

CTGTGGGCCG TGTACGCCAC CAACCAGAAC GCXGGCAACA TCGTGGTCAG TAGGCXGGAC 1380 

CCCGTGTCOC TGCAGACCCT GCAGACCTGG AACACGAGCT ACCCCAAGCG CftGCSSOCGGG 1440 

GAGGCCTTCA TCATCTGCGG CACGCTGTAC GTCACCAACG GCTACTCAGG GGGTAOCAAG 1S00 

GTCCACTATG CATACCAGAC CAA1GCCTCC ACCTATGAAT ACATGGACAT CCCATTCCAG 1560 

AACAAATACT CCCACATCTC CATGCTGGAC TACAACCCCA AGGACCGGGC CCTOTATGCC 1620 

TGGAACAACG GCCACCAGAT CCTCTACAAC GTGACCCTCT TCCACGTCAT GCGCTCCGAC 1680 

GAGTTGTAGC TCCCTCCTCC TGGAAGCCAA GGGCCCACGT CCTCACCACA AAGGGACTCC 1740 

TGTGAAACTG CTGCCAAAAA GATACCAATA ACACTAACAA TACCGATCTT GAAAAATCAT 1800 

CAGCAGTGCG GATTCTGACA TCOAGGGATG GCATTACCTC CGTGTTTCTC CCTTTCGAGC 1860 

CGGCGGGCCA CAGACGTCGG AAGAAACTCC OGTATTTGCA GCTGGAACTG CAGCCCACGG 1920 

CGCCCCGGTT TTOCTCCCCG CCCTGTCCCT CTCTGGTCAA ACAACATACT AAROAGGGGA 1980 

GGCAATGACT GTTGGCCAGT TCTCACCGGG GAAAAACCCA CTOTTAGGAT GGCATGAACA 2040 

TTICCTTAGA TOGTGGTCAG CTCOGAGGAA TGTGGOGTCC AGGCTCTTTG AGAGCCATGG 2100 

GCTGCACCOG GCCGTAGGCT AGTBTAACTC GCATCOCATT GCAGTGCCGT TTCTTGACTG 2160 

TGTTGCTGTC TCTTAGATTA ACCGTGCTGA GGCTCCACAT AGCTCCTGGA CCTGTGTCTA 2220 

GTACATAGTG AAGCGATGGT CAGAGTGTGT AGAGTGAAGT TGCTGTGCCC ACATTGTTTG 2280 

AACTCGCGTA GCCCGTAGAT ACATTGTGCA ACGTTCTTCT GTTATECCCT TGAGGTGGTA 2340 

ACTTGGTATG TTCAGTT T AT GGGATGATTG TTGTAAATGC AATGC-OGTAG TTTGGATTAA 2400 

TAAGTGGATG GTTTTTGTTT CTAAAAAGAA AAAAAAAATC AGTGTTCACC CTTATAGAOA 2460 

CATAGTCAAG TTCATGTTGA TAATAATCAA AGGAATTACT CTCTTCTTGT TAAATTAGCT 2520 

AAATCATGTA ACCGCAGATA GGAAGGGCTC ACCTGGGGAA ACTCTGGTM CCGATGGGAC 2580 

AGGAAAGTCA TACGGGCAAC AGTATGOGGA AAGTACGTTT TTTTAAGTAA AAAACAAAGG 2640 

CAAACTTTGT ACTATCCAGT XA2CTAAGGA ACAATAAAAA CATTAGGAGA AAAAAAAAAA 27Q0 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 

Seq id not B36 Protein sequence 
Protein Accession ft* np_055094.i 

1 11 21 31 41 SI 

I I " 1 I 1 I 

MPGRHRWQRD MHPAHKLLSL IrFIaXUtGTEL TQVLPTNPBE SWQVYSSAQD SBGRCICTW 60 

APQQTMCSBD ARTKQLBQLL EKVQKMSQSI EVLDRRTQSD LQYVSXMENQ KKGLEEKFKQ 120 

VEE6HTDQHLA EQFKAIKAKM DEliRPLIPVL EEYKADAKLV LQFKKEV^IL TSVIMKLQBE 180 

IGAYDYDELQ SRVSNUSKRIj RACKQ5CLACG KL.TG I SDPVT VKTSGSRFGS WMTDPliAPEG 240 

DNRVWYMDGY HNNRFVREYK SHVDFMNTDN FTfiHELPHPW SGTGQWYNG SIYFNKFQSH 300 

II1RFDLKTE TILKTHSLDY AGYNNMYHYA WOGHSDIDLM VDEfiGLMAVY ATNONAGNIV 360 



811 



WO 03/042661 



PCT/US02/36810 



VSRLDPVSLQ TLQTWNTfiYP KRBAGEAFJI CGTLYVTNGY SGGTKVHYAY QTNASTYBYI 420 
DIPFQNKYSH ISMUJXNPKD RAIjYAWNNGH QH.yNVTT.FH VIRSDEL 

Seq ID NO; B37 DNA sequence 
5 Nucleic ACid Accession #: NM_006334.1 

Coding sequences 286.-693 

1 11 21 31 41 SI 

in 1 1 I I 1 I 

IK) GCGCGGGGGA GCCATTAGGA GGCGAQGAGA GAGGAGGGCG CAGCTCCCGC CCAGCCCAGC 60 

CCTGCCCAGC CCTGCCCQQA GGCAGACGCG CCGGAACCGG GACGCGATAA ATATGCAGAG 120 

CGGAGGCTTC GCGCAGCAGA GCCCGCGCGC CGCOCGCTCC GGGTGCTGAA TCCAGGCGTG 180 

GGGACACGAG CCAGGGGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGCGCG 24D 

TCCGOTTCCA CGCAGCCGCC GGCCGGCCAG CACCCAQGGC CCTGCATGCC AGGTCGTTGG 300 

15 AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AQTGCTGCCC ACCAACCCTG AGGAGAGCTG GCAQGTGTAC 420 

AGCTCTGCCC AGGACAGCGA GGGCAGGTGT ATCTGCACAG TGGTCGCCCC ACAGCAGACC 480 

ATGTOTTGAC GGGATGCCCG CACAAAACAG CTGAGGCAGC TACTGGAGAA GGTGCAGAAC 540 

ATGTCTCAAT CCATAGAGGT CTTGGACAGG CGGACCCAGA GAGACTTGCA 6TA0GTGGAG 600 

20 AAGATGGAGA ACCAAATGAA AGGACTGGAG TCCAAGTTCA AACAGGTGGA GGAGAGTOAT 660 

AAGCAACACC TGQCCAGGCA GTTTAAGGGC TAACTTAAAA GAGTTTTTTC AATGCTGCAG 720 

TGACTOAAGA AGCAGTCCAC TCCCATGTAA OCATGAAAGA GAGCCAGAGA GCTTTTTGCA 780 

CCATGCATTT TTACTATTAT TTTCCAATAC TTAGCACCAT TTCACTAAGG AACCTTGAAT 840 

ACAACCAGGA TCCTCCTTTG CATGCGACTG TAGCTGCATT TCATGAATAG TTTGAACCCT 900 

25 TGTCAATGCA TTTTTTGAAA AAGAAAGAAA AAAAAAACTT CGTGTATGTG ACTCAAAGCA 960 
TGTAACCTTA AGATGTTQCA TrCTAAACTG ACAATAAAGA CCTTTCCCC 
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seq ID NO: B3B Protein sequence 
Protein Accession #s NP 006325.1 



1 11 21 31 41 51 

I I I 1 1 I 

MPGRWRWQED MHPARKLJjSL DFLILMGTSL TQVLPTNFEE SWQVYSSAQD SEGRCICXW 60 
APQQTMCSED ASTKQLRQLI* EKVQNMSQSI BVLDRRTQRD LQYVEKMKttQ MKGLE9KFKQ 120 
35 VEE3HKQHIA RQFKG 



Seq ID SO-. B39 DMA sequence 

Nucleic Acid Accession #; NM_osai99.l 

Coding sequence: 286. .755 



1 11 21 31 41 51 

t 1 I 1 I I 

GCGCGGGGGA GCCATTAGGA GGCGAGGAGA GAGGAGGGCG CAGCTCCCGC CCAGCCCAGC 60 

CCTGCCCAGC CCTGCCCGGA GGCAGACGCG CCGGAACCGG QADGCGATAA ATATGCAGAG 120 

45 CGGAGGCTTC GCGCAGCAGA GCCCGCGCGC CGCCCGCTCC GGGTGCTGAA TCCAGGCGTG 180 

GGGACACGAG CCAGGCGCCG CCGCCGGAGC CAGCGGAGCC GGGGCCAGAG CCGGAGCGCG 240 

TCCGOOTCCA CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTCGTTGG 300 

AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AAATAAAAGA GAAAACAAAG CAGAGAAGAT GQQAGGGCCA 420 

50 GAGAGCGAGA GGAAGACCAC AGGAGAGAAG ACACTGAACG AGCTTCCCTT GTTTTGCCTG 480 

GAAGCCCACG CTGGCTCCCT GGCTCTGCCC AGGATGTGCA GTCCAAATCC CAATCCAGCA 540 

GTGGGGTTAT GTCGTCCCGC TTACCCTCAG AGCCCTTCTC CTGGTGCTGC CCAGACGATC 600 

AGCCAGTCCC TCCTGGAGAG GTTCTGCATG GCCTCTAGGA GAGAABTTTT CTTGGCCCCA 660 

GGAAGGCCTG GTGGAGGGTG GTGGTTGTGC ACTGTTGCTG GACAGATGCA TTCATTCATG 720 

55 TGCACACACA CACACACACA TGCACACACA GGGGAGCAGA TACCTGCAGA GAAGAGCCAA 780 

CCAGGTCCTG ATTAGTGGCA AGCTGCCCCA CAAAGGGCTA TGCCTGTGTC TTATTGAGAC B40 

AOCTTGGCAA AGAGATGGCT GATTCTGGGT GGTCCTGGAC ATGGCCGCAC CCAAGGGCCC 900 

TCCAAGCCTT AATGGCACCC TGAAGCCTCC ATGCCCAGGC CAAAAGATGC TTTTCCTCCC 960 
TAAAAAAAAA AAAAAAAAAA 



Seq ID HQt B40 Protein sequence 
Protein Accession ftt NP_478106.i 



„ 1 11 21 31 41 51 

65 [ | | J | [ 

MPGRWRWQRD MHPA&KLLSL IiFLTLMBTKL TQNKRENKAE KMGGPSBB&K TTGEKTXtHBI» 60 
PIiFCLEAHAG 6LALPRMC3P NFNPAVGLCR PAYPQSP6PG AAQTISQSLL EHFCMAfiRRE 120 
VFIAPGRPGG GWWI^CTVAGQ MHSPMCTHTH TflAHTGEQIP AEKS0PGPD 

70 Seq ID NO: B41 dna sequence 

lffucleic Acid Accession <h AY038071.1 
Coding sequence a 1 . . 1686 

1 11 21 31 41 51 

75 | 1 | | | | 

ATGAGCAATC AGTACCAGGA GGAGGGCTGC TCCGAGAGGC CCGAGTGCAA AAGTAAATCT 60 
CCAACTTTGC TCTCCTCCTA CTGCATOGAC AGCATCCTGG GCCGGAGGAG CCCGTGCAftA 120 
ATGCGGTTGC TGG3AGCCGC GCAGAGCTTG CCTGCTCCGC TOACCAGCCG CG<XX3ACCCG 180 
GAAAAGGCCG TGCAAGGCTC CCCTAAGAGC AGCAGCGCCC CGTTCGAGGC CGAGCTGCAC 240 
CTGCCGCCCA AGCTGCGGCG CCTGTACGGC CCGGGCGGGG GCCGCCTCCT TCAQGGTGCG 300 
GCAGCGGCGG CGGCGQCGGC GGCGGCGGCG GCGGCAGCGG CCGCCACGGC CACGGCGGGT 360 
CCACGCGGGG AGGCCCCTCC GCCGCCACCG CCAACCGCGC GGCCCGGGGA ACGGCCGGAC 420 
GGCGCAGGGG CCGCCGCGGC AGCCGCGGCC GCGGCCGCCG CGGCCTGGGA CACGCTCAAG 480 
ATCAGCCAGG CGCCGCAGGT GAGCATCAGC CGCAGCAAGT CGTACOGCGA GAACGGGGCG 540 

812 
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CCCTTCGTGC CGCCGCCGCC CGCGCTGGAC GAGCTGGGCG GOCCGGGGGG 0C3TCAO3CAC 600 

CCGGAGGAGC GCCTCGGCGT GGCCGGCGGC CX33QQCAGCG CCCCGGCTGC GGGTGGTGGC 660 

ACCGGCACCG AGGACGACGA GGAGGAGCTG CTGGAGGACC3 AAGAAGATGA GGACGAGGAA 720 

GAGGAACTGC TGGAGGACGA CGAGGAGGAG CTGCTGGAGG ACGACOCCCG CGCGCTGCTC 78 0 

5 AAGQAGCCCC GGGGCTGTCC TGTGGCCGCC ACTGGCGCCG TGGCCGCAGC AGCTGCCGCT 84 D 

GCAGTGGCCA CAGAGGGCGG GGAGCTGTCA CCCAAGGAGG AGCTGCTGCT GCACCCGGAA 900 

GAC3CTGAGG GCAAGGACGG CGAGGACAGC GTGTGCCTCT CTGCGGGCAG CGACTCGGAG 960 

GAGGGGCTGC TGAAACGCAA ACAGAGGCGC TAOCGCACCA CGTTCACCAG CTACCAGCTG 1020 

GAGGAACTGG AGCGGGOCTT CCAGAAGACG CACTACCCGG ACQTCTTCaC CAGGGAGGAA 10 80 

10 CTGGCCATGA GGCTGGACTT GACCGAGGCC CGAGTCCAGG TCTGGTTCCA GAACCGTCGG 11 AO 

GCCAAGTGGC GCAAGCGGGA GAAGGCAGGC GCGCAGACCC ACCCCCCTGG GCTGCCCTTC 1200 

CCGGGGCCQC TCTCCGCCAC CCACCCGCTC AGCCCCTACC TGGACGCCAG CCCCTTCCCX 1260 

CCGCACCACC CGGCGCTCGA CTCCGCITGG ACTGCCGCTG CCGCCGCCGC CGCCGCCGCC 1320 

TTCCG3AGCC TACCTCCGCC TCCGGGCTCQ GCCAGCCTGC CGCCCAGCGG GGCGCCGCTG 1380 

15 GGCCTGAGCA CTTTCCTCGG AGCGGCAGTG TTCCGACACC CAGCTTTCAT CAGCCCGGCA 1440 

TTCGGCAGGC TCTTTTCCAC AATGGCCCCC CTGACCAGCG CGTCGACCGC GGCCGCGCTC 1500 

CTGAGACAGC CCACACCCGC CGTGGAGGGC GCAGTGGCAT CGGGCGCCCT GGCCGACCCG 1560 

GCCACGGCGG CCGCAGACAG ACGCGCCTCT AGCATAGCCG CGCTGAGGCT CAAGGCCAAG 1620 

GAGCACGCGG CGCAGCTCAC GCAGCTCAAC ATCCTGCCGG GCACCAGCAC GQGCAAGGAG 16 BO 

20 GTGTGC 

Seq ID HO; B42 Protein sequence 
Protein AcccBBion #s AAK93901.1 

25 1 11 21 31 41 SI 

1)111! 

MSNQYQEEGC SEHPECKSXS PTLtiSSYCID SILGRRSPCK MRLLGAAQSL PAPLT8RADP 60 

EKAVCGSPKS 99APFEAEUH LPPKLRRLYG PGGGRKLQGA AAAAAAAAAA AAAAATATAG 120 

PRGEAPFFPP PTARPGERPD GAGAAAAAAA AAAAAWDTLK ISQAPQVSIS RSKBYREHGA 180 

30 PFVPPPPALD ELOQPGGVTH PEERI/3VAG3 FGSAPAAGQG TGTEDPE5EI* LEDEEDBDEE 240 

EELiiEDDEEE KbEDDARALL. KHPRRCPVAA TGAVAAAAAA AVATEGGEbS PKEELliLHPE 300 

DAEGKDGEDS VCLSAGfiDSE EGUjKRKQSR YHTTFTSYQL EELERAFQKT HYPUVFTREE 360 

LAHHIiDIaTSA RVQVWFQNRR AXHRKHEKAG AQTHPPGUF PGPLSATHPIj SPYLDA8PFP 420 

PHEPALDBAW TAAAAAAAAA FPSLPPPPGS A9LPPSGAPL GtSTFLOAAV FRHPAFI6PA 480 

35 FGRUSTMAP LTSASTAAAL* liRQ PTPAVEG AVASGALADP ATAAADRRAS SIAALRIiKAK 540 
EHAAGfcTQLN IUGTSTGKE VC 



Soq ID NO i B43 DNA sequence 

nucleic Add Acceeslon flx CAT cluster 

40 

1 11 21 31 41 51 

I I I I I I 

CGCCTGCCCC CCS3GTTGCTT CTCTCTCTTC CCTTCTGCCT GCGTGCTCGC CCGTCTTTGG 60 

TTCOGCCGOT TCCGTGCGCC CTCTCTGGGC TTCGGCTTCA CCCTCOSTCT GGCTGTCGCT 120 

45 CTGCTCTCCT CCGCTCTCAT GCCCCTTTCA GCTCCTACTC TGGTGTCGTC CCCCTCCCTC 1B0 

GCTCGTTTCT CTTGTTCCTC TCCTTCCCGG TTOGTGTTCT GCTTCTTCGT GTTCCCTCTC 240 

OTCTGTGTTT GTGGGTCGTC CTCTGCGTCC TGCCTCTTTC TCTTGCGTTT CCGGGCGCTT 300 

CCTGTTTTGQ TOCCCCTGGT TCTCGCGOGG TTCTGTGTGC GCGGGCCGCC CCGTTCTTTC 360 

TTGGTGCTTT GCTCCTGCTC TCTGTCTGGT GTTCTCCaCC GTCTGTGTCT TTCTCCTTTT 420 

50 ATCTTCTGTT TTCTCTTTTT CTTGTTTCTT CTTGTTTTTT TTTGTCTTTG TTTTCTCXTA 480 

GCTGAGGTTG GGGAGAQATA ACGCTGTAAA CTTTTATTTT TCAGGAAATC TGGAAACCTA 540 

CAGTCTCCAT GCCTGCTCAG CCAAGAAGGA GCTCACTGTG GGCACCAGAG ACAGGGACCC 600 

AATGTGGAGA CCTGTGAGCC TGTGTCCGGC CCTGAACTCT CAAGCACAGG GCAGGCTTCC 660 

TGAGCATTGA AGAGAATATG TGGGAGAACA AAACAGAAAC TGAAAQAATA TGCAAGGTGT 720 

55 CTTTCTTGGA TGTTATXCCA TGATAGATAG TAGGGSCAGG AGTGAGAGAG GCTGACTAGG 7B0 

TCTGGACATG GAGGCTGGAA OAGTCAGGGT GTGATTCGGA GAGGCGCATG AGAAGGAAGG 840 

TGGATTTTAA QGCTGGAAAT CTGAGGGTCA GTGGTCCAAG TCACTCAGAG ACAGAATCAC 900 
AGCATAGCCC TTGCTGATGG CAA 

60 Seq ED NO: B44 DNA sequence 

Uucltic Acid Accession NM_014421.1 
Coding sequence i 718.. 1497 



1 11 21 31 41 51 

65 | | | | | I 

CCACGCOTCC GGCGGGAGCC CGOGGCGAGC GTAGCGCAAG TCCGCTCCCT AGGCATCGCT 60 

GOGCCGGCAG CGATTCGCTQ TCTCtTGTGA GTCAGGGGAC AACGCIT0GG GGCAACTGTG 120 

AGTGCBCOTG TQGGGGAC3CT CGATTCTCTT CAGATCTCGA GGATTOGGTC OGGGGACGTC 180 

ICCTGATCCG CTACTAAAGC GdOTGCTAAC TTTGAAAAGG AGCACTGTGT CCTGCAAAGT 240 

70 TTGACACATA AAGGATAG9A AAAGAGAGGA GAGAAAAGCA ACTGAGTTGA AGGAGAAGGA 300 

GCTGATGCOG GCCTCCTGAT CAATTAAGAG GAGAGTTAAA CCGCCGAOAT CCCGGCGGGA 360 

CCAAGGAGGT GCGGGGCAAG AAGGAAGGGA AGOGGTGCX3A TCCACAGGGC TGGSTTTTCT 420 

TGCACCTTGG GTCACGCCTC CTTGGOGAGA AAGCGCCTCG CATTTQATTG CTTCCAGTTA 480 

__ TTGCAGAACT TCCTOTCXZTG GTGGAGAAGC GGGTCTCGCT TGGGTTCCGC TAATTTCTGT 540 

75 CCTGAGGCGT GAGACTGAGT TCATAQGGTC CXGGGTCCCC GAACCAGGAA GGGTTGAGGG 60O 

AACACAATCT GCAAGCCCCC GCGACCCAAG TGAGGGGCCC CGTGTTGGGG TCCTCCCTCC 660 

CTTTGCATTC CCACCCCTCC GGGCTTTGCE TCTTCCTGGG GACCCCCTCG CCGGGAGATG 720 

GCCGCGTTGA TGCGGAGCAA GGATTCGTCC TGCTGCX^GC TCCTACTGGC CGCGGTGCTG 780 

ATGGTGGAGA GCTCACAQAT CGGCAGTTCG CGGGCCAAAC TCAACTCCAT CAAGTCCTCT 840 

80 CTGGGCGGGG AGACGCCTGG TCAGGCOGCC AATCGATCTG CGGGCATGTA CCAAGGACTG 900 

GCATTCGGCG GCAGTAAGAA GGGCAAAAAC CTGGGGCAGG CCTACCCTTG TAGCAGTGAT 960 

AAGGAGTGTG AAGTTGGGAG GTATTGCCAC AGTCCCCACC AAQOATCATC GGCCTGCATG 1020 

GTGTG1CGGA GAAAAAAGAA GCGCTG0CAC CGAGATGGCA TGTGCTGCCC CAGTACCCGC 1080 

TGCAATAATO GCATCTGTAT CCCAGTTACT GAAAGCATCT TAACCCCTCA CATCCCGGCT 1140 
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CTGGATGGTA CTCGG CACAO AGATCGAAAC CACGGTCATT ACTCAAACCA TGACTTGGGA 1200 

TGGCAGAATC TAGGAAGACC ACACACTAAG ATCTCACATA TAAAAGGGCA TGAAGGAOAC 1260 

CCCTGCCTAC GATCATCAGA CTGCATTGAA GGGTTTTGCT GTGCTCGTCA TTTCTGGACC 1320 

AAAATCTGCA AACCAGTGCT CCATCAGGGG GAAGTCTGTA CCAAACAACG CAAGAAGGGT 13 CO 

TCTCATGGGC TGGAAATTTT CCAGCGTTGC GACTGTGCGA AGGGCCTGTC TXGCAAAGTA 1440 

TGGAAAGATG CCAOCTACTC CTCCAAAGCC AGACTCCATG TGTGTCAQAA AATTTGATCA 1500 

CCATTGAGGA ACATCATCAA TTGCAGACTG TGAAGTTGTG TATTTAATGC ATTATAGCAT 1S60 

GGTGGAAAAT AAGGTTCAGA TGCAGAAGAA TGGCTAAAAT AAGAAACGTG ATAAGAATAT 1620 

AGATGATCAC AAAAAGGGAG AAAGAAAACA TGAACTGAAT AGATTAGAAT GGGTGACAAA 16 80 

TGCAGTGCAG CCAGTGTTTC CATTATGCAA CTTGTCTATQ TAAATAATGT ACACATTTGT 1740 

GGAAAATGCT ATTATTAAGA GAACAAGCAC ACAGTGGAAA TTACTGATGA GTAGCATGTG 1800 

ACTTTCCAAG AGTTTAGGTT GTGCTGGAGG AGAGGTTTCC TTCAGATTGC TGATTGCTTA I860 

TACAAATAAC CTACATGCCA GATTTCTATT CAACGTTAGA GTTTAACAAA ATACTOCTAG 1920 

AATAACTTGT TATACAATAG GTTCTAAAAA TAAAATTGCT AAACAAGAAA TGAAAACATG 1980 

GAGCATTGTT AATTTACAAC AGAAAATTAC CTTTTGATTT GTAACACTAC TTCTGCTGTT 2040 

CAATCAAGAG TCXTGGTAGA TAAGAAAAAA ATCAGTCAAT ATTTCCAAAT AATTGCAAAA 2100 

TAATGGCCAG TTGTTTAGGA AGGCCTTTAG GAAGACAAAT AAATAACAAA CAAACAGCCA 2160 

CAAATACTTT TTTTTCAAAA TTTTAGTTTT ACCTGTAATT AATAAGAACT QATACAAGAC 2220 

AAAAACAGTT CCTTCAGATT CTAOGQAATG ACAGTATATC TCTCTTTATC CTATGTGATT 22 B0 

CCTGCTCTGA ATGCATTATA TTTTCCAAAG TATACCCATA AATTGTGACT AGTAAAATAC 2340 

TTACACAGAG CAGAATTTTC ACAGATGGCA AAAAAATTTA AAGATGTCCA ATATATGTGG 2400 

GAAAAGAGCT AACAGAGAGA TCATTATTTC TTAAAOATTG GCCATAACCT GTATTTTGAT 2460 

AGAATTAGAT TGGTAAATAC ATGTATTCAT ACATACTCTG TGGTAATAGA GACTTGAGCT 2520 

GGATCTGTAC TGCACTGGAG TAAGCAAGAA AATTGGGAAA ACTTTTTOGT TTGTTCAGGT 2580 

TTTGGCAACA CATAGATCAT ATGTCTGAGG CACAAGTTGG CTGTTCATCT TTGAAACCAG 2640 

GGGATGCACA GTCTAAATGA ATATCTGCAT GGGATTTGCT ATCATAATNT TTCCTATGCH 2700 

GNTGAATTCN GlGTGAGGTC CTOTGTCCGT CCTATCCTCA AATTAT T TAT TTTATAGTGC 2760 

TGAGATCCTC AAATAATCTC AATTTCGGAG GTTTCACAAA ATGGACTCCT GAAGTAGACA 2820 

GAGTAGTGAG GTTTCATTGC CCTCTATAAG CTTCTGACTA GOCAATGGCA TCATCCAATT 2880 

TTCTTCCCAA ACCTCTGCAG CATCTQCTTT ATTGCCAAAG GGCTAGTTTC GGTTTTCTOC 2 940 

CAGCCATTGC GGTTAAAAAA TATAAGTAGG ATAACTTGTA AAACCTGCAT ATTGCTAATC 3 300 

TATAGACACC ACAGTTTCPA AATTCTTTGA AAOCACTTTA CTACITTTTT TAAACTTAAC 3060 

TCAGTTCTAA ATACTTTGTC TGGAGCACAA AACAATAAAA GGTTATCTTA TAGTTGTGAC 3120 

TTTAAACTTT TGTAGACCAC AATTCACTTT TTAGTTTTCT TTTACTTAAA TCCCATCTGC 3180 

AGTCTCAAAX TTAAGTTCTC CCAGTAGAGA TTGAGTTTGA GCCTGTATAT CTATTAAAAA 3240 

TTTCAACTTC CCACATATAT TTACTAAGAT GATTAAGACT TACATTTTCT GCACAGGTCT 3300 

GCAAAAACAA AAATTATAAA CTAGTCCATC CAAGAAGCAA AGTTTGTATA AACAGGTTGC 3360 

TATAAGCTTG GTGAAATGAA AATGQAACAT TTCAATCAAA CATTTCCTAT ATAACAATTA 3420 

TTATATTTAC AATTTGGTTT CTGCAATATT TTTCTTATGT CCACCCTTTT AAAAATTATT 3480 

ATTTGAA3TA ATTTATTTAC AGGAAATGTT AATGAGATGT ATTTTCTTAT AGAGATATTT 3S40 

CTTACAGAAA GCTTTGTAGC AGAATATATT TGCAGCTATT GACTTTGTAA TTTAGGAAAA 3600 

ATGTATAATA AGATAAAATC TATTAAATTT TTCTCCTCTA AAAACTGAAA AAAAAAAAAA 3660 
AAAAAAAAAA A 

Seq ID MOs B45 Protein sequence 
Protetn Accession #: NP_055236.1 

1 11 21 31 41 51 

1111!) 

MAALMRSKDS SCCLKLLAAV LMVESSQIGS SRAKiaJSIXS SLGGETPGQA AKRSAQHYQG 60 

LAFGGSKKGK JILGQAYFCSS DKECEVGRYC HSPHQGSSAC MVCRRKKKHC HRDGHCCPST 120 

HQJNGICIPV TB8TLTFHIP ALDGTHHRDR NHGHYSNHDL GWQNLGRPRT KMSHIXGHEG 180 

DPCI^RSSDCr EGFCCARHFK TKICKFV1JIQ GEVCTKQRKK QSHGKEIFQR CDCAKGLSCK 240 
VW KDATYSSK ARLHVCQKI 

Seq ID 250? 46 una sequence 

Kuclelc Acid Accession #: hm_019B65.1 

Coding sequence! 1 . .1339 

1 11 21 31 41 S% 

1 1 1 I I t 

ATGCTCTTTG AGGGCTTGGA tctggtgtcg gcgctggcca CCCTOGCCGC GTGCCTGGTG 60 

TCCGTGACGC TGCTGCTGGC . GGTGTCGCAG CAGCTGTGGC AGCTGCGCTG GGCCGCCACT 120 

OGOGACAAGA GCTGCAAGCT GCCCATCCCC AAGGGATCCA TGGGCTTCCC GCTCATCGGA 180 

GAGAOCGGCC ACTGGCTGCT GCAGGGTTCT GGCTTCCAGT OGTOGOGGAG GGAGAAGTAT 240 

GGCAAOGTGT TCAAGACGCA TTXGTTGGGG CGGCCGCTGA TACGCGTGAC OGGCGCGGAG 300 

AACGTGCGCA AGATCCTCAT GGGCGAGCAC CACCTCGTGA GCACCGAGTG GCCTCGCAGC 360 

ACCCGCATGT tgctgggccc caacacggtg tccaattcca ttggcgacat ccaccgcaac 420 

AAGCGCAAGG TC-TTCTCCAA GATCTTCAGC CAGGAGGCCC TGGAGAGTTA CCTGCCCAAG 480 

ATCCAGCTGG tgatccagga cacactgggc GCCTGGAGCA GCCAOCCOGA GGCCATCAAC 540 

GTGTACCAGG AGGCGCAGAA GCTGACCTTC C3GCATGGCCA TCOGGGTGCT GCTGGGCTTC 600 

AaCATCCCTG AGGAGGACCT TGGGCACCTC TTTGAGGTCT ACCAGCAGTT TGTGGACAAT 660 

GTCTTCTOCC TGCCTGTCGA CCTQCCCTTC AGTGGCTACC GGCGGGGCAT TCAGGCTCGG 720 

CAGATCCTGC AGAAGGGGCT GGAGAAGGCC ATCOGGGAOA AGCTGCAGTG CACACAGGGC 780 

AAOGACTACT TGGACGCCCT GOACCTCCTC ATTGAGAGCA GCAAGGAGCA CGGGAAGGAG 840 

ATGACCATGC AGGAGCTOAA GGAGGGGACC CTGGAGCTGA TCTTTQOGGC CTATGCCAOC 900 

ACGGCCAGOG CCAGCAOCTC ACTCATCATG CAGCTGCTGA AGCACCOCAC TGTGCTGGAG 960 

AAGCTGCGGG ATGAGCTGGG GGCTCATGGC ATCCTGCACA GTGGCGGCTG CCCCTQCGAG 1020 

GGCACACTGC GCCTGGACAC GCTCAOrGGG CTGCGCTACC TGOACTGCGT CATCAAGGAG 1080 

GTCATGGGCC TGTTCACGCC CATTTCCGGC GGCTACCGCA CTGTGCTGCA GACCTTCGAG 1140 

CTTGATGGTT TCCAGATCCC CAAAGGCTGG AGTGTCATGT ATAGCATCOG GGACACCCAT 1200 

GACACAGCGC CCGTGTTCAA AGACGTGAAC GTGTTCGACC CCGATGGCTT CAGCCAGGCG 1260 

CGGAGCGAGG ACAAGGATGG CCGCTTQCAT TACCTCCCGT TCGGTGGCGG TGTCOGGACC 1320 

TGCCTGGGCA AGCAGCTGGC CAAOCTGTTC CTGAAGGTGC TGGOGGTGGA GCTGGCTAGC 1380 



814 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



ACCAGCCGCT TTGAGCTGGC TACAOGGACC TTCCCOCGCA TCACCTTGGT CCCOGTCCTG 1440 

CACCCCGTGO ATGGCCTCAG CGTCAAGTTC TTTGGCCTGG ACTCCAACCA GAACGAGATC 1S00 

CTGCCGGAGA CGGAGGCCAT GCTGAGCGCC ACAGTCTAAC CCAAGACCCA CCCGCCTCAG 1560 

CCCAGCCCAG GCAGCGGGGT EGTGGTTGTG GGAGGTAGAA ACCTGTGTGT GGGAOGGGGC 1620 

CGGAACGGGG AGGGCGAGTG GCCCCCATAC TTGCCCTCCC TTGCTCOCCC TCCCTGGCAA 1680 

ACCCTACCCA AAGCCAGTGG GCCCCATTGC TAGGGCTGGG CTOCCCTTCT GGCTCCAGCT 1740 

TCCCTCCAGC CACTCCCCAT TTACCATCAG CTCAGCCCCTT GGGAAGGGCG TGGCAGGGGC 1B0O 

TCTGCATQCX: CGTGACftGTG TTAGGTGTCA GCGCGTGCTA CAGTGTTTTT GTGATGTTCT I860 

GAACTGCTCC CTTCCCTCOQ TTCCTTTCGG ACCCTTTTAG CTGGOGTTGG GGGACGGGAA 1920 

GAGCCGTGCC CCCTTGGGCG CACTCTTCAG CGTCTCCTCC TCCTGCGCCC CCACTGOGTC 1980 

TGCCCAGGAA CAGCATCCTG GGTAGCTGAA CAGGAGTCAA CCTTGGOGGG GCGGGGGCTG 2040 

CGTCCAACCT GGAGATTGCC CTTCCCTATG CCACQOTTCC CACCCTCCXTT CACCAGTTTG 2100 

GACAATTTGA AATTACCTAT TGCTGCTACT TGTTCTGTCC TCTGACCTTG GGGCAAAGGA 2160 

GCCCCAGGCC CTGTCTOCCC AGCATCCTCC CTGOTGGCCC TGGGCAGGTG CACTGACACC 2220 

CCCACCTTCC CATCCCCTGC TGAACCAGGC CCTGTTACAC ACAGCCGCCT AAGGCCCGCG 2280 

GCTCATGTGC TGCCCGCCOC CAT AT TT ATT CACTGATAGA GAATCTTGGG GATGCTGGGG 2340 

TCTGGAGTGA ACATCTCCTC CCCTTCATGC OCTAGCCTGT OTTCTAGCTG TCCTGGOGAG 2400 

ACTTCTGTQA GTGAAGAGGA AGGGGTCTCT GGTCAAACCC AGOCOCCAGG GCCTAGGGTT 2460 

GAAAGCCTTC CCCGGCTCCG GGCATTATTT GGGTTTAATC TCGGAGCCTC ACTCCTGGAC 2520 

TGAAGTCCGG TGCCTCTGCC TTATCCCTGG TGGAGATGGA ATGTGGCCCA TTGCCTCCTC 2S80 

CAAAAACCCT GATCAGGTAG ATTTGGAGGC GGCCACGATT TCCTGTTTGG 2640 

CCCCAGTGCA CTGGCCCTGA CTCCAGGCGT GAGTATGGGG AAGGATACGG 2700 

GTTCTTCTGA CGGGGAGCAA GGGCCTCCGT CTTCCCTTCC TTAACTCTCC CCCTTTGCCC 2760 

TCCGCCCTGA AAAAGGTGTC CTTGAAGTCC CTTCCACCTC TATGCCACTG TCTGCTTAGC 2820 

CCAGCTCAGG GGTGGSGAAG AGGOGAAAGC GTGGGGGAGG TGAGCGCAGC GGCAGTTCTG 2880 

CCTCGQAGCT GATTGCAGGG CCCTGTGTGG TCTODGGACA GCTGOGGGAA GGGTGCCGCA 2940 

GCTGAAGCTG AAGAGGCGGC TACGTGCGGT TTGTCAGGGG GATTGGGTTG AAAACTGGCC 3000 

AGTCGGGATG ACTGGGTGAA AOAGGAGTAG CTCCTGCCAC TGGCGTTTTG AGTGTTGGCA 3D 60 

ATTTGGGATQ CCTCCTGGGG AAGGTTTCCG GGCGTTTGGT GAGTCTCTAG ATTTTTCCTT 3120 

GCTTTCTGTG TTTATTG3TT TTTGATGTTG TAAAAGCAAT GAATCCCCTT TACAAGAAAA 3180 

TCGAAAACAC AGAAGAATGA AGGACATGCC AGTCCOCGAT OGCTGCTGTG AGCACCTCAG 3240 

TGGCTCCCTC AGACCAGATC CCGTAGGCAG CCCCACAGAC CGACCCTGAC OCCACTCACA 3300 

GCCACCCTGA AGATAGACTA TAGGAACQGG CCCATACCAC ACAGACTGCT CTCCAATGCC 3360 

TGAGTCTCAG ATGTTTCATT TATT T C CT AC TTTTCCACTA CTAAAAAACA GTGTGGAATA 3420 

GACATTATTG GCAAAATTGC TCATCCCTAA TCCTGAAAAA CAGGCCAGAA TOGGTAAAGA 3480 

CTTGTCAAAG CTTGCAACAT AGCXACATGG TGCACCCGGA CCTGTACOCC CTCCCCCCAA 3540 

CACAAAACCA GTGTCTGGGA GGTTCATTTT CCTTTAAACT GATCCAGCTG GCCCTGAACC 3600 

AATTGTTTTT GACTGAGTAT CTAGGAGAGC AGTAAGTGGA ACTTCAGACA AGCCCACTGG 3660 

GTCTGGTCCA GGTGAGGGGC AGGGGGCATG GGGCTGGGAG GTCTCAGGGG CCTTCCCTGG 3720 

GGGTGGCGAG CCTGGTAGGG GG CAGAGAAG GAAAAGCTGA GGGGGGTOCC TGTGAGG0AG 3780 

GAAAGAAGGA TCATTTGOOC CGCTGGGTCT CAAAGGCAGT GAGAAGAGAG CTGAAGAAAG 3840 

CTCTGGCTGG CTGACAGGAT CCCTGTGTTG TAATTQGTCC CTCCTTTCAG CTCTCTAGTG 3900 

AGATOCCOGr GTCTGTGCGT GTGCGTGTQT GTTTCATACA GCT AG CAT T A GATGGGTGAT 3960 

GTTTCTTACT TATCATCCCT AACTATTGCA ACTTGACCTr AAAAAGACAA AACCCCACAA 4020 

AACTCTTCCT GCCACGGGCT TGCAGATTOA AGCACTTTCG ATGTTGGGCG CTGGCGTTTG 4080 

AOCACCGTGA CCCTGCCCAG ATGGCTATAA TATTATTTTA TACACAAACC 4140 

TAAATGTTAT aattttgtgt CTGTCTTTAT AAACTATTAT AAQTACTATT 4200 

TTTGTTATAA TTCAAAATAG AT ATT T AGTA TAAAGTTTTT GCTGTTAAAT ATTTGTTATT 4260 

TAGTAAAATA TGAATTTTGC TCTATTGTAA ACATGGTTCA AAATATTAAT ATGTTTTTAT 4320 

CACAGTCGTT TTAATATTQA AAAAGCACTT GTGTGTTTTO TTTTGATATG AAACTGGTAC 4380 

CGTC3TGAGTG TTTTTGCTGT OGTGGTTTTA ATCTGTATAT AATATTCCAT GTTGCATATT 4440 



Seq ID NO x B47 Protein sequence 
Protein Accession #: NPJ>6393B.1 



1 
I 

MLFEGLX&V8 
ETQHWLTjQGS 
TRMIiLGPNTV 
VXQSAQKIiTP 
QILQKGUEKA 
TABASTBLIM 
VMKLFTPISQ 



HPVDGIiSVKF 



11 
I 

ALATIiAACLV 
GFQSSEREKY 
SN3IGDIHRN 
RMAIRVLU3F 
IREKIiQCTQG 
QLLKHPTVIiE 
GYRTVLQTFE 
YliPFGGGVHT 
FGLDSNQNEI 



SVTLLIAV8Q 
GNVFKTHLJliG 



31 
I 

QLHQ1JOTAAT 
KPL1KVTGAE 
HEALESYliPK 
FEVYQQFVDIT 



KDYLDAT iTTTtli 
KLRUSLRAEG 
LDGFQIPKGW 
CLGKHLAKLF 
bPEJTEAHLSA 



ILHSGGCPCH 
SVMYSIRDTH 
XdCVXiAVSLAS 
TV 



41 
1 

KDKSCKLMP 

snntiaiMQHB 

IQIjVIQDTLR 
VFSliPVDiLPF 
MTMQKUOX3T 
GTLMJ3TLSG 
DTAFVFKBVH 
T8RFBLATHT 



51 
I 

KGSMGFFX.IG 
HLVSTEWFRS 



Seq; ID UO: B48 DNA sequence 

Kucleic Acid Accession fti AB040527,!; AL136582.1 
Coding sequences 94.. 2319 



1 
I 

GCGGCTGCGG 
TGAGGGAAGC 
CGACGAAGTC 
TGGTGGCTAT 
GGGTCCAGGC 
CCAGACCCTG 
CCGTGCTGCC 
CGCTG CCCGT 
CTCAGGAGAG 
GCCACCCCTT 



11 
I 

GGTTCCTGGT 
TTCAGCGTGC 
GGGGAGGAAG 
GGCACCCTCA 
CTTCGCATCT 
GTGGAGGCAT 
AACGTAGCCC 
ACCGGCTTCA 
GATACCCAGC 
GCTTCTCCGC 



21 
1 

GCTGAGGACG 
AATCGGAAAG 
AGATGGTTGA 
CCAGCTTTGA 
TATOGAATGA 
TGC3U3CTGOA 
GOGCCGCOGC 



CCACGACCTA 
AGACCTCCCA 



31 
I 

GACGCCATTG 
CTACAGTGTT 
AGGCAACGAC 
CATCCATATC 
GCCCTGGGAA 
TCCGGAAACA 
CTCCAACCGT 
CGCTAGCCAC 
CGCCGCCGAG 
GATGTTAGTC 



41 

I 

GAGTTCCCGA 



TATGAAGAAT 
CTC3U3AGCCT 
CTGGAAAACC 
CTTGCCAATG 



COGGTGGCCA 
GCTCRGGGGC 
ACCAGTAAGA 



6GYRHGIQAR 
LELIFAAYAT 
IiRYI/DCVIKB 
VFDPDRFfiOA 
FPRTTLVPVli 



51 



GAAGCATGGC 
ATGAGGGTAG 



TCGGAAGCTT 
CTGTGCTGGC 
AGACGGCCGC 
CCGCTGCCGC 
CGCOGCAGGT 
CCACCCCTGA 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



815 



WO 03/042661 



COAGGCTCCG GCAACCTCCG CACAGTCCCA GACAGGCTCC CCGGCCCAQG AGGCTGCTAC 
TGAGGGCCCT AGTAGCGCCT GTGCTTTCTC TCAGGCTCCG TGTGOCAGGG AGGTQGACE3C 
CAACCGGCCC AGCACAGCCT TCCTGGGCCA GAATGATGTC TTCGATTTCA CTCAGCCGGC 
AGGTGTCAGT GGCATGGCCT TCCCGCGCCC CAAGAGACCT GCCCCAGCCC AAGAGGCTGC 
CACAGAGGGC CCCAGTGCTG CCTCTGGTGT GCCCCAGACG GGACCIGGCA GGGAGGTGGC 
AGCCACCCGG CCCAAGACCA CCAAGXOGGG GAAGGCQCTG GCCAAGACTC GGTGGGTGGA 
GCCTCAGAAT GTTGTGGCAQ CAGCTGCTGC CAAGGCCAAG ATQaCCACGA GCATCCCTGA 
GCCGGAGGQT GCAGCTGCTG CCACTGCTCA GCACAGTGCT GAGCGCTGGG CCAGGATGGG 

AGGCAAGAGG accaagaagt ccaagcacct ggatgatgag TATGAOAGCA GGGAGGAGGA 

GAGAGAGACT CCCGCGGTCC CACCCACCTG GAGAGCATCA CAGOCCTCAT TGACGOTGOG 
GGCTCAGTTG GCCCCTCGGC CCCCGATGGC CGCGAGGTCC CAGATACCCT CAAGGCACGT 
ACTGTGCCTG CCCCCCCGCA ACGTGACCCT TCTGCAGGAQ ASGGCAAATA AGTTGGTGAA 
ATACCTGATG ATTAAGGACT ACAAGAAGAT CCCCATCAAG CGCGCAGACA TGCTGAAGGA 
TGTCATCAGA BAATATGATG AACATTTCCC TGAGATCATT GAACGAGCAA CGTACACCCT 
GGAAAAGAAG TTTGGGATCC ACCTGAAGGA GATCGACAAG GAAGAACACC T GT AT ATTCT 
TGTCTGCACA COGGACTCCT CAGCTCGCCT CCTTGGAAAA ACCAAGGACA CTCCCAGGCT 
GAGTCTCCTC TTGGTGATTC TGGGCGTCAT CTTCATGAAT GGCAACCGTG CCAGCGAGGC 
TGTCCTCTGG GAGGCACTAC GCAAGATGGG ACTGCGCCCT GGGGTGAGGC ACCCATTCCT 
GGGCGATCTG AGGAAGCTCA TCACftGATGA CTTTGTGAAG CAGAAQTACC TGGAATACAA 
GAAGATCCCC AACAGCAACC CACCTGAGTA TGAATTCCTC TGGGGCCIGC GAGCCCGCCA 
TQAOACCAGC AAGATGAGGG TCCTGAGATT CATCGCCCAG AATCAGAACC GAGACCCCCG 
GGAATGGAAG OCTCATTTCT TGGAGGCTGT QGATGATGCT TTCAAGACAA TGGATGTGGA 
TATGGCOGAG GAACATGCCA GGGCCCAGAT OAGGGCCCAG ATGAATATCG GGGATGAAGC 
GCTGATTGQA CGGTGGAGCT GGGATGACAT ACAAGTCBAG CTCCTGACCT GGGATGAGGA 
OGGAGATTTT GGCGATGCCX GGGCCAGGAT CCCCTTTGCT TTCTGGGCCA GATACCATCA 
GTACATTCTG aatagcaacc gtgccaacag gagggccacg tggagagctg GCQTCAGCAG 

TGGCACCAAT GGAQGGGCCA GCACCMCGT CCTAGATGGC CCCAGCACCA GCTCCACCAT 
CCQGACCAGA AATGCTGCCA GAGCTGGCGC CAGCTTCTTC TCCTGGATCC AGCACC3GTTG 
ACGAACTGCA GCGATCTTAC TGGCCAAGCC AGAGCGCCTC CTCTCAGATT CCTTCTCQAC 
ACAGCACCCT AGGCGGCTTC TTCCTGTCAG TCGGAGGTGG CATGCAAGAT GAAGCTCTCT 
TTGCTCTTCC TGCTTTCATT TTGTGCXTTT CCTTGTGTTT TCATGTTTTG GGTATCAGTG 
TTACATTAAA GTTGCAAAAT TAAA 

3eq ID NO: B49 Protein sequence 
Protein Accession 4»= BAB3337B.l t CAB66517.1 

1 11 22 31 41 51 

1 I I I 1 1 

MABG3FSVQS ESYSVEDMDE GSDEVGEEEM VEQNDYEEFG AFGGYGTLTS FBIHIIiRAFG 60 

SLGPGLRILS HEFHELKNPV LAQTWEALQ LDPETIiANBT AARAAWVARA AABNRAARAA 12 0 

AAAARTAFSQ WASKRVATP QVSGEDTQPT TYAAEAQOFT PSPPLASPQT SQM&VTSKHA 180 

APEAPATSAQ SQTGSFAQBA ATEGPS9ACA FSQAPCAREV DANRPSTAFL GQNDVFDFTQ 240 

PAGVSGMAFP RtKRPAPAQH AATEGFSAAS GVPQTGPGRE VAATRPKTTK SGKALAKTHW 300 

VEPQNWAAA AAKAKMATSI PEPEGAAAAT AQHSAEPWAR MGGKRTKKfiK HLDDEYE8SE 360 

EERETPAVPP TWRASQPSLT VRAQIAFRPP MAPRBQIP3R HVLCLPPRNV TLLQERANKL 420 

VKYLMIKDYK KIPIKRADML KDV1REYDEH PFEIIERATY TLEKKFGIHIi KEIDKKEHLY 460 

XLVCTRD8SA RIiIiGKTKDTP RLSUJjVILG VIFHNGNRAS EAVLWEALRK HGLRPGVEHP 540 

PLGDLRKLIT UDFVKQKYLE YKKIPNSNPP BYBFLHGLRA RHETSKMHVL RFIAQNQNRD 600 

PRFWKAHFLE AVDDAPKTMD VDMAEEHARA QMRAQHHIGD KAI/IGSWSWD DJQVKLLTWD 660 

EDGDEGDAWA RIPPAFWARY HQYILNSNRA NRRA.TWRAGV SSGTKGK3AST SVLDGPSTSS *720 
TIRTHKAARA GASFFSWIQH R 

Seq ID NO: B50 DMA sequence 

Nucleic Acid Accession #= xh_064965,1 

Coding eequencei 356.. 2014 

1 11 21 31 41 51 

I 1 I I I 1 

GACCTAGCGT GTGCTCAGCT CTGGACAAGA CATGGATGTA GGGAGACATG CTTCCTGCCT 60 

GCTGGQCTTA GTGCACCAAT QCTAAGGACA GACATGAAGG GCTTGGGAAA GGCATGAAGA 120 

TGCAGGGAGT CTAGGAGAGA GGATAAGAAA AACAAGCATT TTGTGGTGGG TTGAATTGCG 180 

TTGrCGTTTT TAGAGGCATT TCGTGATCCA TQGTTGOGCT CCCGTGAAGA GCGAGTCTAG 240 

AGAAGGAATC CGAAGCGGCG GCGQCGAGCG CGGCTCAGGT TGATTTAGAA TACGGOTQAC 300 

AGTGGCCTGG CGCGAGCCCA CTGCTGACQA AAGCGGCTTA TCCCGCQCGG TTTCCATGGA 360 

GACGAGCCGG AGCCGCGGCG GCOGCGGGGC TGTCAGCGAG CGCGGCGGAG CTGGCGCGTC 420 

CGTGGGGGTC TGCAQQAGGA AGGGGGAGGC CGGQGCCGGG ACOGGQACCC TCGCGGCAGA ABO 

CATGGACTTG CATTGTGACT GTGCCGCCGA AACGCCQGCC GC0GAGC0GC CGTCQGGGAA 540 

GATTAATAAA GCTGCCTTCA AATTATTCAA GAAGAGGAAA TCGGGTGGCA CCATGCCCAG 600 

CATTTTTGGG GTCAAAAACA AAGGGGACGG GAAAAGdCQ GGTCCGACGG GGCTOQTGAG 660 

OAGCAGGACC CACGACGGAC TTGCCGAGGT GCTQGTGGTG GAGAGCGGCA GOAAGGAGGA 720 

GGCGCGCGGC GGGGGCGACA GCGGCGGGGG CGGCGGGGGG CGGCCGAACC CGGGGCCCCC 760 

CAOAGGCGCA GGGCCCGGCG QGGGCTCGCT CGCCAQCAGC TCGGTGGCCA AGTCGCACAG 840 

CTTCTTCTCG CTGCTGAAGA AGAAOQGGOG CTCGGAAAAC GGCAAGGGAG AGCCTGTGGA SOO 

OSCQAGGAAG GCGGGCGGCA AAGAAAAGCG GQGQCTGCGG GGGCTGTTCA GGGGCATGCG 960 

CTGGCACAGG AAAGACAAGC GGGCCAAGGG GGAGGCCGCG GAGGGGOGCG OGCOGGGGOG 1020 

COGCTTGATC CTACCOGGCT OGCTCACGGC CMCCTGGAG TGCGTCAAGQ AGGAGACGCC 1080 

CAGAGCCGCQ CGCGAGCCGG AGGAGCCCAG CCAGGACGCG CCGCGAGACC CAGCAGQCTG 1140 

1GGAGATATT ATTGCAGACC AAGAGGAAGA GGCftGGTCCC AGCTGTQACA AGCATGTCCC 1200 

CGGGCCAGGC AAGCCGGCTC TOTCTAAAAA GAAOCCCGGC GTGGTGGCCT ACKZAAGGAGG 1260 

CGGGGAAGAG ATGGCCAGCC OGQACGAGGI GGACGACACC TATCTACAGG AGTTCTGGGA 1320 

CATGCTCTGC CAGACCGAGG AGCAGGGACC CGAGTCCCAG GAGGGCGGGG CTAAGGTGGC 1380 

AGCTGCGCTG GAAACCAAGG TGGTGCCCGA GACCCCCAAA GACACCAGGT GTGTGGAAGC 1440 

GacCAAOGAC GCGTCCTCGG ^CAAGCGCAG GAGGCTGAAC CGGATTCOCA TCGAGCGCCA 1500 

TCCTAAGGAG QAGCCCAAGC ACOCGGAGAA GGAGCAGCAG QAAGGCGTCC CCAACAGCGA 1S60 



816 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CGAGGGCTAC 
GAAGGCGGGC 
CCCGGACGGA 
GTCCCGGTTA 
CTTGCTGAAG 
CCATCCCGTG 
TTCCAAAGTG 
AACGGCCTGC 
CCOATGCCAA 
AAGAAAGTCT 

TCTCCTCTCA. 
AAGTTTTTCT 
TGCAGAAAAC 
CAOTGTCTAG 
ACGTTTCCTT 



TGGGACTCCA 
ATCCCCCGGG 
AGTCCAGCAA 
AAGCCCGTAT 
GACTCTAAGA 
GTGCACCAGC 



CACGACAGTG 
GGAATACAAC 
TTTAGOACAC 
CCTGAGCAGG 
AGATAABTCC 
TCCTTTTGCT 
AGAACTTAGC 
AGGCCTGAAG 



CCACGCCAGG 
ATAGCTACAG 
CCCTTCCTGG 
CTCCAGGCAC 
TCCCTATTAG 

aaccctccag 

OAGTCAGCAA 
CCAAAAAGTT 

TTTtCcctgG 

CACCOGTCCC 
GACCGGATAA 
CTGAGAATTA 
TTAATATACr 
CAAAOCTAAG 
GAACCGCTGA 



CCCAGAGGAA 
OGGGGACGCG 
AGGGAAGGAC 
CATCACCTGT 
CATCAAGCAC 
GAGTGAGATG 
CDGGGGCTTG 
GTOAGGTCTT 
AAACCACTAA 
CCGCCCTQCT 
CACCAGAAAG 
TTTTCAAGCA 
GAACACTTGG 
TAAGAGTCAT 
AGAACCAGAG 
C CTTTCCAGT* 



GACAGCTCGA 
CTCTATGATC 
AAOGAGGAGA 
CCACTGCGAA 
CTGACCAACC 
CCCAGAACAA 
GCTGGGACCA 
CCAGGCCAAG 
AGTAAGTTTT 
CCAGAGCGTG 
.AGGGATGCTA 
CTTTTTTCTT 
AAGTCACCTT 
GOCTGGATAT 
GAGATCTCCT 

r 



GCAGCGGGAA 
T CT ATGCT GA 
CGTCCrCCCT 
CAOCAGGCAG 
TTCCATCTAG 
AAATCCCGGT 
CCATCAGAGC 
GTGGATGGGC 
GCTTTTCCTA 
GACCGAGGAG 
CACGGGGGTT 
TTTTACCTTT 
TACXTGCCTT 
TGGGATAAGC 
CTTCCAAGAC 



1620 
1660 
174 0 
1800 

iaeo 

1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



Seq ID KOi B51 Protein sequence 
Protein Accession #: XP_0B4965.1 



1 
I 

HETSRSRGGG 
GKINKAAFKL 
EBPRGGGDSG 
VDASHAGGRQ 
TPHAAREPEE 
GGGEEMASPD 
EAAKDASSVK 
□KXAGIPRD8 
GSLIrKDSKIP 
RATACHDSAK 



11 

! 

GAVSERGGAG 
FKKHKSGGTM 



21 
1 

ASVGVCRRKA 
PSIFGVKNKG 
PPRAAGPGGG 



KRGliRGLFSG 
P8QDAPRDPA 
EVBDTYLOEP 

RRRLNRIPIE 
YSGDAI.YDLY 
ISXKHLTTsfliP 
KL. 



GCGDIIADQE 
WDMLSQTEBQ 



ADPD GSPATL 
SSEPWHQQP 



31 41 

I I 

EAGAGTGTIiA ADMDLHCDCA 
DQKSfiGPTGb VRSRTHDGLA 
SJUASSSVAKS ESFFSLLKKN 
KAEAAEGRAP GGGLXLPGSL 
EEAGPSCDKH VPGPGKPAIiS 
GPEPQBGAAK VAAALETKW 
BKHQQBGVPN SDBGYWDSTT 

pggkdheets slsrckpvsp 
srskmprtki pvskvlvrrv 



51 

I 

AETPAAEPPS 
EVLVLBSGRK 



TASLECVKEE 
XKNFGWAYQ 
PBTPKDTRCV 



GTTTCPI>RTP 
HNRQIAGTTI 



Seq ID NO: B52 UNA sequence 
Nucleic Acid Accession # = 
Coding sequence: 1..2016 



FGEMESH predicted 



1 11 21 31 41 51 

I 1 I I I I 

ATGGAGACGA GCCGGAGCCG CGGCGG0QQC GGGGC7GTCA GCGAGCGCGG CGGAGCTGGC 
GCGTCCGTGG GGGTCTGCAG GAGGAAGGCG GAGGOOGGGG CCGGGAOOGG OACCCTOGCG 
GCAGACATGG ACTTGCATTG TGACTGTGCC GCGGAAACQC CGGCCGOCGA GCCGCCGTCG 
GGGAAGATTA ATAAAGCTGC CTTCAAATTA TTCAAGAAGA GGAAATCGGG TGOCACCATG 
CCCAGCATTT TTGGGGTCAA AAACAAAGGG GAOGGGAAAA GCTCGGGTCC GACGGGGCTG 
GTGAGGAGCA GGACCCACGA CGGACTTGCC GAGGTGCTGG TGCTGGAGAG CGGCAGGAAG 
GAGGAGCCGC GGGGCGGGGG CGACftGCGGC GGGGGCGGCG GGGGGCGGCC GAACGCGGGG 
CCOCCCAGAG CCGCAGGGCC OGGCGGGGGC TCCCTCGCCA GCAGCTCGGT GGCCAAGTOG 
CACAfiCTTCT TCTOGCTGCT GAAGAAGAAC GGGCGCTGGG AAAAOGGCAA GGGAGAGCCT 
GTGGACGCGA GCAAGGCCGG GGGCAAACAA AAGOGGGGGC TGGGGGGGCX GTTCAGCGGC 
ATGCGCTGGC ACAGGAAAGA CAAGCGGGCC AAGGOGGAGG CCBCGGAGGG GCGCGCGCCC 
GGGGGCGGCT TGATCCTACC CGGCTC3CTC ACGGCCAGGC TGGAGTGOTT CAAGGAGGAG 
ACGCCCAGAG CCGCGCGCGA GCCGGAGGAG CCCAGCCAGG ACGCCCCGCG AGACCCAGCA 
GGTOAGCCCG CAGGGGGAGA GGAGGTOCCC GCCCCCGCCG ACCGCSCCCC AGCGOQGAGC 
TGCCGAGAGG CAGAGGGCCT CGCG CACCCC GGCGACAQCG GCGCCCGGGG AGAGGAOGCC 
GOGGGGCATC GGOGCGCOGA GCCGGGGCCC GGGGAGGTCC GCACGGCAGA GGACGCTTCC 
AGGACGGGGG COGtTCOCGT AAAGACBGTC CCCCTTGTGG ACTCCGAAGG CGGCAGCQQC 
CGGGCGGC CG GCGCCCCAGA CCCTGCCTCT GTCGATCCAC CCTCAGACCC GTCGGCAGAT 
CGTATTTGTT TGATGTTTTC TGACGTGACT TCACTGAAAA GCTTTGACTC TCTTACAGGC 
TGTGGAGATA TTATTGCAGA CCAAGAGGAA GAGGCAGGTC CCAGCTGTGA CAAGCATGTC 
CCCGGGOCAG GCAAGCCGGC TCTGTCTAAA AAGAACCCCG GCGTGGTGGC CTACCAAGGA 
GGCGGGGAAG AGATGGCCAG CCCGGACGAG GTGQACGACA CCTATCTACA GGAGTTCTGG 
OACATGCTCT CCCAGACOGA GGAGCAGGGA CCCGAGCCCC AGGAGGGCGC GGCXAAGGTG 
GCAGCTGCGC TGGAAACCAA GGTGGTGCCC GAGACCCCCA AAGACACCAG GTGTGTGGAA 
GCGGCCAAGG AGGCGTCCTC GGTCAAGCGC AGGAGGCTCA ACCGGATTCC CATCGAGCCC 
CATCCTAAGG AGGAGGGCAA GCACCCGGAG AAGGAGCAGG AGGAAGGCGT CCCCAACAGC 
GftCGAGGGCT ACTGGGACTC CACCACGCCA GGCCCAGAOG AAGACAGC1C GAGCAGCGGG 
AAGAAGGCGG QCATCCCGCG GGATAGCTAC AGCGGGGACG CGCTCTATGA TCTCTATGCT 
GADCOGGACG GAAGTCCAGC AACCCTTOCT GGAGGGAAGG ACAACGAGGA GACGTCCTCC 
CTGPTCCCGGT TAAAGCCCGT ATCTCCAGGC ACCATCACCT GTCCACTG03 AACACCAGGC 
AGCTTGCTGA AGGACTCTAA GATCCCTATT AGCATCAAGC ACCTGACCAA CCTTCCATCT 
AGCCATCCCG TGGTGCADCA GCAACOCTCC AGGAGTGAGA TGCCCAGAAC AAAAATCCCG 
GTTTCCAAAG TGCTGGTCCG CAGAGTCRGC AACCGGGGCT TGGCTGGGAC CACCATCAGA 
GCAACQGCCT GCCACGACAG TGCCAAAAAG TTGTGA 



60 
120 
1B0 
240 
300 
360 
420 
4S0 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



Seq ID NO: B53 Protein sequence 
Protein Accession #: FGEHESH predicted 

1 11 21 31 41 51 

111(11 
METSRSRGGG GAVSERGGAG ASVGVCRRKA EAGAGTGTTA ADMDLHCDCA AETPAAEPPS 
GKXNKAAFKt* FKKHKfiGGIM PSEFGVXNKO D3KSSGPTGL VRSRTUDGliA EVI»VI.HSGRK 
EEF5GGGD8G GGGGGRPNPG PPRAAGPGGG SLA888VAKS KSPPSLTiKKN GRSEKGKGEP 
VDASKAGGKQ KRQLRGLFSG MRKHRKDKHA KAEAABGRAP GGOLILPGfiL TASLECVKEE 
TPRAAREPEE P SQDAPRDP A GEPAGGEEVP APADRAPARfi CREAEGLAHP GDTGARGEDA 

817 



60 
120 
180 
240 
300 



WO 03/042661 



PCTAJS02/36810 



AGHRRAEPGP GEVRTAEDAS RTGAVPVKTV PL.VDSEGGSQ RAPAAPDPAS VDPPSDPSAD 360 

RICLMPSDVT SLKSFDSLTG CGDXIADGBE EAGPSCDKHV PGPGKPAL9K KNPGWAYQG 420 

GGKEMASPDE VDDTYLQEPW DPHiSQTEEQQ PEPQEGAAKV AAALBTKWP ETPKDTRCVE 480 

AAKDASSVKR RRLNRIPIEP HFKEEPKHPE KEQQEOVPNS DEGYWDSTTP GPEEDSS8SG 540 

5 KKAGIPRnSY SGDALYDLXA DPD6SPATLP GGKDNEET5S 1*SRUCPVSPG TITCPIiRTPG 600 

9LLKDSKXPI SIKHLTHLPS SRPWHQQPS RSEMPRTKIP V g KVXiVRRVS NRGLAGTTIR 660 
ATACHDSAIOC L 



seq ID NOs B54 DNA sequence 
10 Nucleic Acid Accession NM_014138.1 
Coding sequence: 60, ,854 



1 11 21 31 41 51 

] I 1 I I I 

15 CTQCAGAGAC TTCCCAGGAA GGTCCAGCGC CCTCTCAGCC TTCGTACTCA GAACAGCCGA 60 

TGATGGGCCT CAGTAACCTQ AGCCCCGGTC CTGGCCCCAG CCAGGCCGTG CCTCTCCCAG 120 

AGGGGCTGCT CCGCCAGCGG TACAGAGAGG AGAAGACCCT GGAAGAGCGG CGGTGGGAGA 180 

GGCTGGAQTT CCTTCAQAGG AAGAAAGCAT TCCTGCGOCA TGTGAGGAGG AGACAC0GCG 240 

ATCACATGGC CCCCTATGCT GTTGGGAGGG AAGCCAGAAT CTCCCCATTA GGTGACAGAA 300 

20 GTCAGAATCG ATTCCGATGT GAATGTCQAT ACTGCCAGAG CCACAGGCCG AATCTTTCTG 360 

GGATCCCTGG GGAGAGTAAC AGGGCCCCAC ATCCCTCCTC CTGGGAGACG CTGQTGCAGG 420 

GCCTCAGTGG CITGACTCTC AGCCTAQGCA CCAACCAGCC OGGGCCTCTG CCTGAAGCGG 480 

CACTOCAGCC ACAGGAGACA GAGOAGAAGC GCCAGCGAGA GAGGCAGCAG GAGAGCAAAA 540 

TAATGTTTCA GAGGCTGCTC AAGCAOTGGT TAGAGGAAAA CTGAGACBTG CACCCCCATG 600 

25 GGATGGAGAC COGAAGGGAC TCAGACGGAG CtJGCCGTGTT GGCAGGGCCT GGGTGTGGGC 660 

OCATTTTGGG GACCAAACAG CAAGCTGtfGG TCGGATGAGT G0CAGGACCT GTGTACCGGG 720 

ACACGTGGGA Gf CCTCGCAG CATGATGCTT GACTGACCCG AGGAAGGTCC TCATGTTTCG 780 

TGCCTGTCAT TCTCGGATGG CTGTGAGGCA TTOCTTGGCA AGGGACGCTG CBTACCAGCG B40 

GTCCTCACCG CATCTCACAT GGCTCCTGTG ATGCATOTTG TOGCTTTCCC AOCCGGOATC 900 

30 TCCATCTCTC TTCCCTTCCT GCTGTCAGTA AGAGATCACA TGTCTGTGTA OTGTGAATGC 960 

CTTGTCGCTG TCCTGTGCTT TTGCACCATT GAGTTGACTG CCTCTGAGAA GCAGCACTAG L020 

GCCTGTTGAA ATGCAATGTG CTGCCCTGAG ATCCAGTTTC AAGAATGGGC AGGTAAACGC 1080 

AGTGTGGGAA AGGAATGTQG AATGAGAACI TGGTGGTTCA GCGCTGTACT ATTTGTGTAA 1140 

ATGTTTACGT ATQTGATAAG CTACATGTAT GTAAATGTTG CAATACCOCT AACAGTCGAG 12 DO 

35 TAGTAGTCTC CCTTACAGGA ATTTTTOACG GGGTTCCTCA TCATCAATAC CAAATAAATA 1260 

TATGTAGGAA TGCJAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1320 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1380 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

40 Seq ID NO: BS5 Protein sequence 
Protein. Accession #; NP_054857.I 

1 11 21 31 41 51 

Ac I I I I I ! 

45 MMGLSH1»SPG PGPSQAVPLP EGLLRQRYRE EKTLEERRWB RLEFLQRKKA FLRHVRRRHR 60 

DHMAPYAVGR EARISPLGDR SONRERCECR YCQSHRPNL8 GIPGESNRAP EPSSHETLVQ 120 
GLSGIVTIiSUG TNQPGPIiPEA ALQPQETEEK RQRERQQESK IMFQRI.LXQW LEEtf 



_ _ seq ID NO; B56 UNA sequence 
50 Nucleic Acid Accession NMJ>00025.1 
Coding sequence x 198,. 1424 

3, 11 21 31 41 51 

) | } | I \ 

55 GCTACTCCTC OCCCAAGAGC GGTGGCAOCG AGGGAGTTGG GGTGGGGGGA GGCTGAGCGC 60 

TCTGGCTGGG ACAGCTAGAG AAGATGGCCC AGGCTGGQGA AGTCGCTCTC ATGCCTTGCT 120 

GTCOCCTCCC CTGAGCCAGG TGATTTGGGA GACCCCCTCC TTCCTTCITT OCCTACCGCC 180 

CCAOGCQCGA CCOGGGGATG GCTCCGTGGC CTCAOGAGAA CAGCTCXCTT GCCCCATGGC 240 

CGGACCTCCC CACCCTGGCG CCCAATACCG CCAACAOCAG TGGGCTGCCA GGGGTTOCGT 300 

60 GGGAGGCGGC CCTAGCCGGG GCCCTQCTGG OGCTCK3CGGT GCTGGCCACC GTGGGAGGCA 360 

ACCTGCTGGT CATCGTGGCC ATOGCCTGGA CTOGGAGACT CCAGACCATG ACCAACGTGT 420 

TCQTGACTTC GCTGGGCGCA GC03ACCTGG TGATGGGACT CCTGGTGGTG OOGCCSGGCGG 480 

CCACCTTC3GC GCTGACTGGC CACTGGCCGT TGGGOGCCAC TGGCTGCGAG CTGTGGACCX 540 

CGGTGGAOGT GClGXG!TGTG AC3CGCCAGCA TCGAAACCCT GTGCGCCCTG GCCGTGGACC 600 

65 GCTACCISGC TGTGACCAAC CCGCTOOGTT AOGGCGCACI GGTCACCAAG CGCTGCGCCC 660 

GGACAGCTGT GQTCGTGGTG TQOGTCGTGT CGGGCGCGGX GTCGTTTGCG COCATCATGA 720 

GCCAGTGGTG GCGCGTAGGG GCCGADGCOG AGGCGCAGOG CTGCCACTCC AACCCGCGCT 7 B0 

GCTGTGCCTT CGCCTCCAAC ATQCCCTACX3 7X3CTGCTGTC CTCCICCGTC TCCTTCTACC 840 

TTCCTCTTCT OGTGATOCTC TTCGTCTACG CGCGGGTTTT OGTGGTGGCT ACGCGCCAGC 900 

70 TGCQCTTtSCT GCGCGGGGAG GTGGGCCGCT TTGCGGCCGA GGAGTCTCCG CCGGCGCCGT 960 

OGCGCTCTCT GGCCCCGGCC COGGTGGGGA CGTGCGCTCC GCCCGAAGGG GTGCCCGCCT 1020 

GCGGCCGGCG GCCCGOGOGC CTCCTGOCTC TCOGGGAACA CCGGGCCCTG TGCACCTTGG 1080 

GTCTCATCAT GGGCACCTTC ACTCTCTGCT GGTTGCCCTT CTTTCTGOCC AACGTGCTGC 1140 

GCGCCCTGGG GGGCCCCTCT CTAGTCCCX3G GCCOGGCTTT CCTTGCGCTG AACTOGCTAG 1200 

75 GTTATGOCAA TTCTGCCTTC AACCCGCTCA TCTACTGCCG CAGCOCGGAC TTTOGCAGCG 1260 

CCTTCCGCCG TCTTCTGTGC CGCTGOGGCC GTCBCCTGCC TCCGGAGCCC TGCQCCGCX3G 1320 

CCOGCCCGOC CCTCTTCCCC TCGGGCGTTC CTGCGGCX30G GAGCAGCCCA GCGCAGCCCA 13B0 

GGCTTTGCCA ACGGCTCGAC GGGGCTTCTT GGGGAGTTTC TTAGGCCTGA AGGACAAGAA 1440 

_ _ GCAACAACTC TGTTGATCAG AACCTGTGOA AAACCTCIGG CCTCTGTTCA GAATGAGTOC 1500 

80 CATGGGATTC CCCGGCTGTG ACACTCTACC CTCCAGAACC TGACGACTGG GCCAIGTGAC 1560 

OCAAGGAGGG ATCCTTACCA AGTOQQTXTT CACCATCCTC TTGCTCTCXG TCTGAGAGAT 1620 

GTTTTCTAAA CCCCAGCCTT QAACTICACT CJCtCOCTCAG TGCTAGTGTC CAGGIGCCGT 1680 

GGAGCAGCAG GCTGGCTTTG GTAGGOGCAC CCATCAOCCG GCTTGCCIGT GCAGTCftGTG 1740 

AGTGCTTAGG GCAARGAGAG CTCCCCTGOr TCCATTCCTT CTGCCACCCA AACCCTGATG 1808 

818 



WO 03/042661 



PCT/US02/3681© 



5 
10 
15 
20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AGACCTTAGT 
AAGATTTGGG 
CTTTAGCCAT 
TTTTGGGAGC 
TTTGCAATCA 
GTTTGTTTTC 
CAGAGGCAGT 
AATGAAAAGT 
ACCCTTCCTG 
GGACTTGG AC 
GCATTTGTCC 
TGCGTGTGTG 
TCAAATOTCT 
TTCCCACTCA 
TTAC 



GTTCTCCAGG 
GTTTTATCTC 
GGAATGGCTC 
TCCAGGGTTC 
GATAAATAAA 
TTTTCATAAT 
AAATTAGGCC 
CTGTCTOOAC 
AAACTCTTGA 
AGAGGGCCCA 
CCTCCATGCT 
tgtgtgtgtq 
TGTGTCATAA 

ccttccccag 



CGATCTACCT 
ATAAGAAGGT 
TATCACTGAA 
CCACTTACTC 
TAATCCTCAC 
GAAGGGGAGT 
AATCCAOTTG 
CAGGGGGATG 
ATAATCCAGT 
TGTGTCTGGA 
ATATATTCTG 
CTACAAAAAT 



CAGGCTGAGA 
ATTACTGCTC 
CACAGCAGTG 
GAACCATTAG 
TGCAGTTCAT 
CCTTCCCTT C 
TCTTTTCTTC 
GAGCTTGAGC 
CCATTGAGTA 
GGCTQGCTGT 
GGTGCCTTAC 
GGCACAGGCA 
ATGTTTCCCA 
CTGTATTATC 



GCAGCAGGGT 
TCAAGCAGTG 
TCAGAAGGAC 
AACAGATCCC 
CCTCGGCCCA 
TACTCTGCGC 
CTAATCTTCA 
CTTTOATATC 
GCAAAGCCAC 
GGCCAGGTTT 
ATGGTGTGTG 
CAAAGCATTG 

atXmrccAC 

CTCTTAAAOT 



AGAAAAGACC 
GCCTCTCXCA 
TTCGCCAGGO 
TTCTTITCCT 
CTTTOCCTCC 
TGGCTTTTGA 
TCAAACAAAA 
TTGCTCCCCC 
GCTCCCCaCA 
AGGGCAGGGG 
TGTGTGTGTG 
CTTGGGTTGG 
AACCTCTACC 
AAAACTGGAG 



I860 
1920 
198 0 
2040 
2100 
2160 
2220 
22 SO 
2340 
2400 
2460 
2520 
2580 
2640 



Seq ID NO: BS7 Protein sequence 
Protein Accession ft: UP 000016.1 



1 
1 

MAPVfPHENSS 
AIAHTPEIjQT 
VTASIETI*CA 
GADABAQRCH 



FTLCWI.PFFL 
CRCGRRUPPE 



11 
I 

LAPWPDLPTI, 
MTNVFVTS1A 
LAVDHYLAVT 
8NPRCCAFAS 
FPAPSRSLAP 
ANVLRALGGP 
9CAAARPALF 



21 
I 

APNTANTSGIi 
AADLVHGLLV 
NPLRYGALVT 
NMPYVLLSSS 
AFVGTCAPPE 
SLVPGPAFIA 
PSGVPAARfiS 



31 
I 

PGVEWfiAALA 
VPPAATIiAIiT 
KKCASTAWIi 
VSPYIiPLLVM 
GVPACGRRPA 
LNWIiGYANSA 
PAQPRlCQRIi 



41 
1 

GAIXALAVIiA 
GHWPLGATGC 
WWSMVSF 
IiFVYAKVFW 
RLLFLREHRA 
FNPI>IYCRSP 
CGASWGVS 



51 
1 

TVGGNLlrVIV 
ELKTSVDVLC 
APIM9QWHEV 
ATRQkRIrLRG 
E#CT1«G1jIMGT 
DFKSAFRRLXi 



Seq ID NO; B58 UNA sequence 

Nucleic Acid Accession # = NM_032SS3.1 

coding sequence : 37 . - 1038 



l 

CACCATTAGG 
AGGCCAGATG 
ATTCTTGTGC 
AAAGAAACAA 
QTTCTTTCCT 
GGTCTCTGCA 
TTGGTCTGCA 
TGCAAACAQA 
TGTGTACTCT 
TTTGTGGATC 



GTTTTATCAC 
TTGAAGATGA 
AGTTTTCCTT 
GTGATTCTAA 
CCAGTCATAT 
CATGAGAGCA 

acacctgaat 
cagaacatat 
agttctgctc 



11 
I 

CAAAGATAOT 
QAGACAATAC 
CAGGTCTCAT 
AACGAGCTGT 
TGCCACTGAG 

TCAGTGTGCG 
AATATGAGCT 
TTCCACTCCT 
TTCCTACCAG 
TTGGGTTTGT 
TGCAAGATAA 
TTCTAACCTG 
TAGATTTCCT 
TATTTCATTC 
ACTACTTTTC 
TCCAACTCCA 
TATGCTAAAA 
CTGCAATAOC 
TATCTTACTQ 
CCACGATTAT 



21 
I 

TTCTCTAGAG 
AGATTTTCGA 
AGGGAATATA 
GATATTTATG 

gatcttctac 
ctacctqaag 

ACGATTTTGG 
GTACATCAGC 
CAGAACCAGT 
GAATGTCAAC 
AACTCCGCTT 
ATATCCCATG 
TGCAGGGGTA 
GGTGAACJTCC 
TGTGGCATTG 
CACTAATGAG 
TGCAAAATCC 
CAAAAAACCA 
CAAGCCACAG 
CTATGGGGAA 
TGATOTTGAC 



31 
1 

AGAATCATGC 
TACTTTATTT 
TTAGCCCTGT 
ATAAACTTA3 
TACT TGAAT C 
TATGTCAACA 
TTTCTCATGT 
ATTGCTGGCT 
GATBATACCT 
CTGGCCCAGT 
CTGATTGTCC 
GCCCAAGATC 
TTCCTAATTT 
AATGAAATTA 
TGTCTTGCTA 
TTCOGAAGAC 
TTTGTGAGTA 
AACTGAATGT 
GGAAGAACTT 
TTCACTTCTT 
ATGTCCATGT 



41 
I 

GTGCTAATTA 
ATQCAGTGAC 
GGGTATTCTA 
CCATTGCTGA 
ATGACXGGCC 



AGCCCTTTOG 
GGCTGATCAT 
CTGGCAATAG 
CCGTTGTTAT 
TATATTGTAC 
TTGGAGAGAA 
GCTTTGCACC 
AAAGCTGCCT 
GTCTGAATTC 
GGCTTTCAAG 
ACCAXACAGC 
GACCTGAAAT 
GCAAAACAAC 
CAAAGCAGGA 
AGTAATTTTT 



51 

i 

CACGTGTACC 
ATACACT <3T C 
TGGTTATATG 
CTTACTACAA 
ATTTGGGCCT 
CATCTACTTC 
CTTCCATGAC 
CTGCCTTQCC 
GACCAAATGC 
GATGACCATT 
CTGGAAGAOG 
ACAGAAAGGC 
TTATCATTTC 
AGCCAGAAGG 
ATGTCTTGAC 
ACAAGATTTG 
TTCCACCATG 
GCAAGTACAT 
ACAGCTT7TC 

CCTATTTGGA 
CTTCAAGT 



Seq ID NO i BS9 Protein sequence 
Protein Accession tii MPJ11S942-1 

1 11 21 31 41 

I I I I I 

MPANYTCTRP DGDMTDFRYF IYAVTYTVIL VPGLIGNILA LWVPYGYMKE 
LAIADLLQVIi SLPIiHIPYYI. NHEWPFGPGL- CMFCF*LKYV KMYASIYFIiV 

MYPFRFHDCK OKYDLYTSIA CJHTjIICLACV LFPLfcRTSDD TSGURTKCFV 

QSWMMTIGE LIGFVTPiiLl VLYCTWKTVIi SIiQDKYPMAQ DLGBKQKAIJC 

ICPAPYHFSF PLDFLVK8NE IKgCLARHVI LlFHSVAIid. ASLHSCIiDPV 

RHIiSRQDXiHD 9IQLHAKSFV fiNHTASTHTP ELC 

&eq ID NO: B60 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



GATTCGGATT 
CCCGAGGGTT 
TCATATATAA 
1ACATTTTAA 
AAGAGTGGAC 
CCCTTTTTAT 
TCrAATTATT 
TGCCTCCAGG 
AATTTTCTCA 



11 
I 

TTAATCCGAC 
AGACATTTAA 
TTAATTTTCA 
GGAATTTATA 
TTGGTATCCG 
TTTCATCTTC 
TAGAACACTG 
TGCOAGTTTA 
ATTCGAAAAC 
ATTTCCTGTC 



21 
\ 

TCACTATAGG 
CTTGAGTCTT 
TCTGOGCATA 
CATTTGCATC 
TCTGATTATA 
TCTTTCTCTA 
TCTCCGAGTA 
TGTGTGTTTA 
TATTTTCCTT 
CTTGGTTTTT 



31 
1 

GAATTTTGGC 
TTCATATCAC 
TATTTCCAGA 
TQAAATTGTA 
TrTTCTTTOT 
GTGTTTATAA 
TGTGTT TAT T 
TGTTTCCTAT 
CAGCACTCAG 
CGTGTTCTTA 



41 
1 

CCTCGAGGCG 
GGTATGCTTT 
TTAGAAATCA 
TTCATTATCT 
TGGTAAAGAG 
TCTTATTTTT 
TGGGTTTGTG 
TTGAGGTTGG 
AAATTTTTAT 
ACCTAGAATT 



60 
120 
100 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



51 
1 

TKRAVIFMIN 
ClGVHfiFWFL 
DUTJUSVN1A 
MlI/TCAGVFIi 
IYYFSTNEFR 



51 
I 

AAGAATTCGG 
TATTTIACCC 
TTTCTTTTCA 
TCTAGAAACC 
OCTCTTTTTT 
ATGAATGTAG 
TTTTGAACAT 
ACACTTAGAA 
CAAAOATTAT 
CAATTGTTTA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



819 



WO 03/042661 



ATGTTAGACC TTACACATCT TCAATGTATC ATATATATTT CACTCATATT ATTTATTGTC 
ATATCTTTTG GACTATTGTG TGAAAGCAAT AAATGTATAT TACTTTAAAA AAAAAAAAAA 
AAAACTHCAT TTTCAAATTT AATACACACA AATTGTGCAT ACOCACACAC ATATACCACA 
GATATATTCA TTTTCAGGAT T T CTTTAAGT GTTATTTTAA AAATAATCAT ATTCT TATTT 
TQTGAACTGT GGGATTTCTT GAAACTTTTT GAAAACTCTC ATTAGTCTGT GTTT GTO TQG 
TTTCTTTAAA TTATCTCTTC CTTCAGAATT ACTTTTGTGT TTTGAATTTG AGTCTTTGCC 
ATTCATAAGG CGGGCTTTCC TTGAATACCA GATAACTCTG AGTTTCCTGT TCATATTTAA 
GAATAAATAA CCTAA.TTTGA AAAAAAAAAA AAAAA 

Seq ID NO: BG1 DMA sequence 
Nucleic Acid Accession WM_014S22,1 
Coding sequence : 846 - - 3911 

1 11 21 31 41 51 

1 I 1 1 1 I 

CTGGTGGTCC AGTACCTCCA AAGATATGGA ATACACTCCT OAAATATCCT GAAAACTTTT 60 
TTTTTTCAGA ATCCTTTAAT AAGCAGTTAT GTCAATCTGA AAGTTGCTTA CTTGTACTTT 12 0 
ATATTAATAG CTATTCTTGT TTTTCTTATC CAAAGAAAAA TCCTCTAATC CCCTTTTCAC 180 
ATQATAGTTG TTACCATGTT TAGGCATTAG TCACATCAAC CCCTCTCCTC TCCCAAACTT 240 
CTCTTCTTCA AATCAAACTT TATTAGTOCC TCCTTTATAA TGATTCCTTG CCTCGTTTTA 300 
TCCAGATCAA TTTTTTTTCA CTTTGATGDC CAGAGCTGAA GAAATGGACT ACTGTATAAA 360 
TT AT TCATTG CCAAGAGAAT AATTGCATTT TAAACCCATA TTATAACAAA GAATAATGAT 420 
TATATTTTGT GATTTOTAAC AAATACCCTX TATTTTCCCT TAACTATTGA ATTAAATATT 4 B0 
TTAATTATTT GTATTCTCTT TAACT AT CT T GGTATATTAA ASTATTATCT TTTATATATT 540 
TATCAATGGT GGACACTTTT ATAGGTACTC TGTOTCATTT TTGATACTGT AGGTATCTTA 600 
TTTCATTTAT CTTTATTCTT AATGTACGAA TTCATAATAT TTGATTCAGA ACAAATTTAT 660 
CACTAATTAA CAOAOTGTCA ATTATGCTAA CATCTCATTT ACTGATTTTA ATTTAAAACA 720 
GTTTTTGTTA ACATGCATGT TTAGGGTTGG CTTCTTAATA ATTTCTTCTT CCTCTTCTCT 780 
CTCTCCTCTT CTTTTGGTCA GTGTTGTGOG GGTTAATACA ACAAACTGTA AGAAGTGTAC 840 
CTGGTATGGA CTTGTTGTCC GGGAGGTACA TTTTCGCGGT CCTGCTAGCA TGCGTGGTGT 900 
TCCACTCTGG OGCCCAGGAQ AAAAACTACA CCATCCGAGA AGAAATGCCA GAAAAOGTCC 960 
TGATAGGCGA CTTOTTOAAA GACCTTAACT TGTCGCTGAT TCCAAACAAG TCCTTGACAA 1020 
CTOCTATGCA QTTCAAGCTA GTGTACAAGA CCGGAGATGT QCCACTGATT CGAATTGAAG 1060 
AGGATACTGG TGAGATCTTC ACTACTGGCG CTCGCATTGA TCGTGAGAAA TTATGTGCTG 1140 
QTAiCCCAAG GGATGAGCAT TGCTTTTATG AAGTGGAGOT TGCCATTTTG COGGATGAAA 1200 
TATTTAGACT GGTTAAGATA CGTTTTCTGA TAGAASATAT AAATOATAAT GCACCATTGT 12 60 
TCCCAGCAAC AGTTATCAAC ATATCAATTC CAGAGAACTC GGCTATAAAC TCTAAATATA 1320 
CTCTCCCAGC GGCTGTTGAT* CCTGACGTAG GAATAAACGG AGTTCAAAAC TACQAACTAA 1380 
TTAAGAGTCA AAACATTTTT GGCCTOGATG TCATTGAAAC ACCAGAAGGA GACAAGATQC 1440 
CACAACTGAT TGTTCAAAAG GAGTTAGATA GGGAAGAGAA GGATACCTAC GTGATGAAAG 1500 
TAAAGGTTGA AGATGGTGGC TTTCCTCAAA GATCCAGTAC TGCTATTTTG CAAGTGAGTG 1560 
TTACTGATAC AAATGACAAC CACCCAGTCT TTAAGGAGAC AGAOATTGAA GTCAGTATAC 1620 
CAGAAAATGC TCCTGTAGGC ACTTCAGTGA CACAGCTCCA TGCCACAGAT GCTGACATAG 1680 
GTGAAAATGC CAAGATCCAC TTCTCTTTCA GCAATCTAGT CTCCAACATT GCCAGGAGAT 1740 
TATTTCACCT CAATGCCACC ACTGGACTTA TCACAATCAA AGAACCACTG GATAGGGAAG 1BO0 
AAACACCAAA CCACAAGTTA CTGGTTTTGG CAAGTGATGG TGGATTGATG CCAGCAAGAG I860 
CAATGGTGCT GOTAAATGTT ACAGATGTCA ATGATAATGT CCCATCCATT GACATAAGAT 1920 
ACATCQTCAA TGCTGTCAAT GACACAGTTG TTCXTTCA6A AAATATTCCA CTCAACACCA 19B0 
AAATTGCTCT CATAACTGTG ACGGATAAGG ATGCGGACCA TAATGGCAGG GTGACATGCT 2040 
TCACAGATCA TGAAATCCCT TTCAGATTAA GGCCAGTATT CAGTAAICAG TTCCTOCTGG 2100 
AGACTGCAGC ATATCTTGAC TATGAGTCCA CAAAAGAATA TGCCATTAAA TTACTGGCTG 2160 
CAGATGCTGG CAAACCTCCT TTGAATGAGT CAGCAATGCT CTTCATCAAA GTGAAAGATG 2220 
AAAATGACAA TGCTCCAGTT TTCACOCAGT CTTTCGTAAC TGTTTCTATT CCTGAGAATA 2280 
ACTCTCCTGQ CATCCftGTTG ACGAAAGTAA GTGCAATGGA TGCAGACAGT GGGCCTAATG 2340 
CTAAGATCAA TTACCTGtZTA GGCCCTGA.TG CTCCACCTGA ATTCAGCCTG GATTOTCGTA 2400 
CAGGCATGCT GACTGTAGTG AAGAAACTAG ATAGAGAAAA AGAGGATAAA TATTTATTCA 2460 
CAATTCTGGC AAAAGATAAC GGGGTACCAC CCTTAACCAG CAATGTCACA GTCTTTGTAA 2520 
GCRTTATTGA TCAGAATGAC AATAGCCCAG TTTTCACTCA CAATGAATAC AACCTCTATG 2580 
TCCCAGAAAA CCTTCCAAGG CATGGTACAG TAGOACTAAT CACTGTAACT GATCCTGATT 2640 
ATGGAGACAA TTCTGCAGTT ACGCTCTCCA TTTTAGATGA GAATGATGAC TTCACCATTG 2700 
ATTCAGAAAC TGGTGTCATC CGACCAAATA TTTGATTTGA TAOAGAAAAA CAAGAATCTT 2760 
ACACTTTCTA TGTAAAGGCT GAGGATGGTG GTAGAGTATC ACGTTCTTCA AGTGOCAAAG 2820 
TAACCATAAA TGTGGTTGAT GTCAATGACA ACAAAOCAGT TTTCATTGTC OCTCCTTCCA 2BB0 
ACTGTTCTTA TGAATTGGTT CTACOGTCCA CTAATCCAGG CACAGTGGTC TTTCAGGTAA 2940 
TTGCTGTTGA C3VATGACACT GGCATGAATG CAGAGGTTCG TTACAGCATT GTAGGAGGAA 3000 
ACACAAGAGA TCTGTTTGCA ATCGACCAAG AAACAGGCAA CATAACATTG ATGGAGAAAT 3060 
GTGATGTTAC AGACCTTGGT TTACACRGAG TGTTGGTCAA AGCTAATGAC TTAGGACAGC 3120 
CTGATTCTCr CTTCAGTGTT GTAATTGTCA ATCTGTTCGT GAATGAGTCG GTGACCAATG 3180 
CTACACTGAT TAATGAACTG GTGCGCAAAA GCACTGAAGC ACCAGTGACC OCAAATACTG 3240 
AGATAGCTOA TGTATCCTCA CCAACTAGTG ACTATGTCAA GATCCTGGTT GCAGCTGTTG 3300 
CTGGCAOCAT AACTGTCGTT GTAGTTATTT TCATCACTGC TGTAGTAAGA TQTCGCCAGG 3360 
CZVCCACACCT TAAGGCTGCT CAGAAAAACA AGCAGAATTC TGAATGGGCT ACCCCAAACC 3420 
CAGAAAACAG GCAGATGATA ATQATGAAGA AAAAGAAAAA GAAGAAGAAG CATTCCCCTA 3480 
AGAACTTGCT GCTTAATTTT GTCACTATTG AAGAAACTAA GGCAGATGAT GTTGACAOTG 3540 
ATGGAAACAG AGTCACACTA GACCTTCCTA TTGATCTAGA AQAGCAAACA ATGGGAAAGT 3600 
ACAATTGGGT AACTACACx:T ACTACTTTCA AGCCCGACAG CCCTGATTTG GCCCGACACT 3660 
ACAAATCTGC CTCTCCACAG CCTGGCTTGC AAATTCAGCC TGAAACTCCC CTGAATTCGA 3720 
AGCACCACAT CATCCAAGAA CTGCCTCTCX3 ATAACACCTT TGTGGOCTGT GACTCTATCT 3780 
CCAAG^GTTC CTCAAGCAGT TCAGATCOCT ACAGCGTTTC TGACTGTGGC TATCCAGTGA 3840 
CGAOCTTCGA GGTACCTGTG TCOGTACACA CCAGACCGGT AGQTATCCAA GTTTCTAACA 3900 
CAACTTTCTA ACTATTTTTT TATTATTATT TTCAGTTGAT GTAGAACTTT ACAAAATCTA 3960 
TTGACTTCAA AGAGGGATCA AAACAATCAT ATTCTACAGA TGTACCCAAT AGATATATGG 4020 
ATTCAATTAA GTTTGGTAGA AGATGAGAAC AAAATAACTA CTGATTTAGG AAAATTGGAT 4080 
GCAGAATAAT AATTATAGTA GGGGCAATTT TGTCTGTAOA TGGCAGTATG ACAATTCTTG 4140 



820 



WO 03/042661 



PCT/US02/36810 
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10 
15 

20 
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30 



CTAGAGAATA 
TTGTGTGCAT 
TCTGATAAAT 
TGTQGCTGTT 
GGTCCTCATA 
AAATCTACAG 
TCATCTXTAA 
GCATATTTAT 



TATTGAAAAA 
GAGGATCAGA 
AGCTCTAAAT 
CACAGAAATT 
AAGAAAAOTG 
TCACACAATA 
GCCAGAGAAT 
GCAAQGTATr 



AACTTCAACA 
ATAGTCTGGG 
ACTAATGATA 
CAGCACCAAG 
TCGTTTTCAG 
TAACAAGAAT 
GAAATATATA 
TTGAGCCCTT 



CAAAGGGTTG 
CTAGATACAT 
TTGAGAAGCC 
TTATTCCCCC 
ATTAGOAACT 
GGGATTAGAA 
TGAGGTCTCT 
CAGAAGACAT 



TAGCACTGTC 
CACATTAAAG 
TAGCTTCACT 
CMACTCTAC 
CAAAATTATT 
AAATGAAAGC 
GGATAGCTAT 
TCT 



Seq ID JJO: B62 Protein sequence 
Protein Accession #t MP 055337,1 



1 
I 

KDIjLSGTYIP 
MQPKL.VYKTG 
RIjV KI RfTi IB 
6QHIFGLDVI 
BTNDNHPVFK 
HLHAXTGLI T 
VNPVNDTVVTi 
AAYIiDYESTK 
PGIQDTKVSA 
lAKDNGVPPL 
DNSAVTL.S1L 
INWDVNDNK 
m>LFAir>QST 
iiSIEIiVRKST 
HLKAAQKNKQ 
NUVTIjDZiFIE 
HI IQELPIiDN 
P 



11 
} 

AVIiliACWFM 
DVPIilRIEBD 
DINDNAPLFP 
ETPEGDKMPQ 
ETEIEVSIPE 
IKBPLDREET 



EVA3KIXAAD 
KOADSGPNAK 
TSNVTVFVSI 
DENBDFTIDC 
FVFTVPPSNC 
GNITLMEKCD 
EAPVTPNTEI 
NSEWATPNPE 
LEBQTHGKYN 
TFVACDSISK 



21 
I 

SGAQEKNYTI 
TGEIPTTGAR 
ATVINISIPE 
IiIVQKELiDRE 
NAPVGTSVTQ 
PHnKLIiVXAS 
AL.ITVTDKDA 
AGKPPIjMQSA 
TNYLLGPDAP 
IDOKDNSPVF 
QTGVIRFHIS 
SYKLVIaPSTN 
VTDLGIjHRVIi 
ADVSSPTSDY 
KRQMIMHKKK 
WVTTPTTFKP 



31 
I 

REEWPENVIiI 
rDHKKLCAGI 
NSATHSKYTL 
EKDTYVMKVK 
LHATDADIGE 
DGGlMPARAM 
DHNGRVTCFT 
MLFIKVKDEN 
PBFSLDCRTG 
THNEYNFYVP 
PDREKQESYT 
PGTWFQfVlA 
VKAND&GQPD 
VXIIiVAAVAQ 
KKKKKHSPKN 
BSPDLAKHYK 
VfiDCGYPVTT 



41 
I 

GDLIiKDLltfLS 
PRDEHCPYEV 
PAAVDPDVGI 
VEDGGPPQRS 
NAXIHFSFSN 
VIiVNVTDVHD 
DEEIPFRLRP 
DNAPVFTQSF 
MIjTWKKLDR 
ENLiPRHGTVa 
FYVKAEDGGR 



SLFSWJVN1* 
TITVWVIPI 
LIiIMFVTIEE 
SASPQPAFQI 
FEVPV5VHTR 



CTCAGTACCA 
CTTTTCAGAA 
TGGGAAAATC 
CAGGCCTTCA 
TTXSGTGCATC 
CTACTCATTC 
TTAAATATXT 



SI 
I 

LIPNKSIVTTA 
EVAILPDEIF 
NGVQNYEIiIK 
STAlIiQVSVT 
LVSNIARRIiF 
NVPSIDZRYT 
VFSNQFLI»BT 
VTVSIPENNS 
EKEDKYLFTI 
LITVTDPDYG 
VSRSSSAKVT 
VRYSXVGGNT 
FVNESVTHAT 
TAWRCRQAP 
TKADDVDSDG 
0PBTP1J»SKH 
PVGIQVSNTT 



4200 
4260 
4320 
4380 
4440 
4500 
4560 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 



Seq id no i B63 DKA sequence 
35 Nucleic Acid Accession 41: xm_059180,2 
Coding sequence: 276.. 3740 

1 11 21 31 41 51 

Art 1 1 1 1 1 1 

^MJ OCOGCGGCCS CGGAGTATCC TGGAGCTGCA GACAGTGCGG GCCTGCGCCC AGTCCCGGCT 60 

GTCCTCGCCG CGACCCCTCC TCAGCCCTGG GCGCGCGCAC GCTGGGGCCC CGCGGGGCTG 120 

GCCQCCTAGC GAGCCTGCCG GTCGACCCCA GCCABCGCAG CGACGGGGCG CTGCCTGGCC 1B0 

CAGG03CACA CGGAAGTGCG CTTCTCTGAA GTAGCTTTGG AAAGTAGAGA AGAAAATCCA 240 

GTTTGCTTCT TGGAGAACAC TGGACAGCTG AATAAATGCA GTATCTAAAT ATAAAAGAGG 300 

45 ACCGCAATGC CATGGCTTTC TGTGCTAAAA TGAGGAGCTC CAAGAAGACT GAGGTGAACC 360 

TGGAGGCCCC TGAGCCAGGG GXGGAAGTGA TCTTCTATCT GTCGGACAGG GAGCCCGTCC 420 

GGCTGGGCAG TGGAGAGTAC ACAGCAGAGG AACTGTGCAT CAGGGCTGCA CAGGCATGCC 480 

QTATCTCTCG TCTTTGTGAC AACCTCTTTG CCCTGTATGA CGAGAACACC AAGCTCTGGT 540 

c/% ATGCTCCAAA TOGCACCATC ACGGTTGATG ACAAGATGTC CCTCCGGCTC CACTAOCGGA 600 

50 TGAGGTTCTA TTXCACCAAT TGGCATGGAA CCAACGACAA TGAGCAGTCA GTGTGGCGTC 660 

ATTCTCCAAA GAAGCAGAAA AATGGCTAOG AGAAAAAAAA GATTCCAGAT GCAACCCCTC 720 

T C CTTGATGC CAGCTCACTG GAGTATCTGT TTGCTCAGGG ACAGTATGAT TTGGTGAAAT 7 B0 

GCCTGGCTCC TATTCGAGAC CCCAAGACCG AGCAGGATGG ACATGATATT GAGAACGAGT 840 

GTCTAGGGAT GGCTGTCCTG GCCATCTCAC ACTATGCCAT GATGAAGAAG ATGCAGTTGC 900 

55 CAGAACTGCC CAAGGACATC AGCTACAAGC GATATATTCC AQAAACATTG AATAAGTCCA 960 

TCAGACAGAG GAACCTTCTC ACCAGGATGC GGATAAATAA TGTTTTCAAG GATTTCCTAA 1020 

AGGAATTTAA CAACAAGACC ATTTGTGACA GCAGCGTGTC CACGCATGAC CTGAAGGTGA 108 Q 

AATACTTGGC TACCTTGGAA ACTTTGACAA AACAOTACGG TGCTGAAATA TTTGAGACTT 1140 

CXZATGTTACT GATTTCATCA GAAAATGAGA TGAATXGGTT TCATTCGAAT GACGGTGGAA 1200 

60 ACGTTCTCTA CTACGAAGTG ATGGTGACTG GGAATCTTGG AATCCAGTGG AGGCATAAAC 1260 

CAAATGTTGT TTCTGTTGAA AAGGAAAAAA ATAAACTGAA GCGGAAAAAA CTGGAAAATA 1320 

AACACAAGAA GGATGAGGAG AAAAACAAGA TCCGGGAAGA GTGGAACAAT TXTTCTTACT 1380 

TCCCTGAAAT CACTCACATT GTAATAAAGG AGTCXGTGGT CAGCATTAAC AAGCAGGACA 1440 

ACAAGAAAAT GGAACTGAAG CTCTCTTCCC ACOAGGAGGC CTTSTCCTTT GTGTCCCTGG 1500 

65 TAGATGGCTA CTTCCGGCTC ACAflCAGATG CCCATCATTA CCTCTGCACC QAOSTGGCCC 1560 

CCCCGTTGAT CGTCCAGAAC ATACAGAATG GCTGTCATGG TUDCAATCTGT ACAGAATACG 1620 

CCATCAATAA ATTGCGGCAA GAAGGAAGCG AGGAGGGGAT GTACGTGCTG AGGTGGAGCT 1660 

GCACCGACTT TGACAACATG CTCATGACCG TCACCT0CTT TGAGAAGTCT GAGGAGGTGC 1740 

A AGGGTGCCCA GAAGCAGTTC AAGAACSTTC AGATCGAGGT GCAGAAGGGC CGCTACAGTG 1800 

70 TGCACGGTTC GGACCGCAGC TTCCCCAGCT TGGGAGACCT CATGAGCCAC CTCAAGAAGC 1860 

AGATCCTGCG CACGGATAAC ATCAGCTTCA TGCTAAAACG CTGCTGCCAG CCCAAGCCCC 1920 

GAGAAATCTC CAACCTGCTG GTGGCTACTA AGAAAGCCCA GGAGTOGCAG CCCGTCTACC 1980 

CCATGAGCCA GCTGAGTTTC GATCGGATCC TCAAGAAGGA TCTGGTGCAG GGCGAGCACC 2040 

TTGGGAGAGG CACGAGAACA CACATCTATT CTGGGACCCT GAT GOAT TAG AAGGATGACG 2100 

75 AAGGAACTTC TGAAGAGAAG AAGATAAAAG TGATCCTCAA AGTCTTAGAC CCCAGCCACA 2160 

GGGATATTTC CCTGGCCTTC TTCGAGGCAG CCAGCATGAT GA3ACAGGTC TGCCACAAAC 2220 

AGATE COT GTA CCTCTATGGC GTCTGTGTCC QCGACGTGGA GAATATCATG GTGGAAGAGT 2280 

TTGTGGAAGG GGGTCCTCTG GATCTCTTCA TGCACCGGAA AAGCGATGTC CTTACCACAC 2340 

CATGGAAAT7 CAAAGTTGGC AAACAGCTGG CCAG^GCCCT GAGCTACTTG QAGGATAAAG 2400 

80 ACCTGQTCCA TGGAAATGTG TGTACTAAAA AOCTCCTCCT GGOOCGTGAG GQCATCGACA 2460 

GTGAGTGTGG CCCATTCATC AAGCTCAGTG ACCCOGGCAT CCCCATTACG GTGCTGTCTA 2520 

GGCAAGAATG CATTGAAGGA ATCCCATGGA TTGCTCCTGA GTGTGTTGAG GACTCCAAGA 258 0 

ACCTQAGTGT GGCTGCTGAC AAGTGGAGCT TXGGAACCAC GCTCTGGGAA ATCTGCTACA 2640 

ATGGCGAGAT CCCCTTGAAA GACAAGACaC TQATTGAGAA AGAGAGATTC TATGAAAGCC 2700 
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GGTQCAGGCC AGTGACACCA TCATGTAAGG AGCTGGCTGA CCTCATGACC CGCXGCATGA 2*760 

ACTATGACCC CAATCAGAGG CCTTTCXTCC GAGCCATCAT GAGAGACATT AATAAGCTTG 2820 

AAGAGCAGAA TCCAGATAXI GTITCAQAAA AAAAACCAGC AACXGAAGTG GACCCCACAC 2880 

ATTTTGAAAA GCGCTTCCTA AAGAGGATOC GTGACTXGGG AGAGGGCCAC TTTGGGAAGG 2940 

TTGAGCTCTG CAGGTAIGAC CCOGAAGGGG ACAAIACAGG GGAGCAGGTG GCTGTTAAAT 30 0D 

CTCTOAAGCC TGAGAGTGGA GGTAACCACA TAGCTGATCT GAAAAAGOAA ATCGAGATCT 30 60 

TAAGGAACCT CTATCATGAG AACATTGTOA AGTACAAAGG AATCTGCACA GAAGACGGAG 3120 

GAAATGGTAT TAAGCTCATC ATGGAATTTC TGCCTTCGGG AAGCCTTAAG GAATATCTTC 33.80 

CAAAGAATAA GAACAAAATA AACCTCAAAC AGCAGCTAAA ATATGCCGTT CftGATTTGTA 3240 

AGGGGATGGA CTATTTGGGT TCTCGGCAAT ACGTTCACCG GGACTTGGCA GCAAGAAATG 3300 

TCCTTGTTGA GAGTGAACAC CAAGTGAAAA TTGGAGACTT OGGTTTAACC AAAGCAATTG 3360 

AAACCGATAA GGAGTATTAC ACCGTCAAGG ATGACCGGGA CAQCCCTGTG TTTTGGTATG 3420 

CTCCAGAATG TTTAATGCAA TCTAAATTTT ATATTGCCTC TGACGTCTGG TCTTTTGGAG 3480 

TCACTCTQCA TGAGCTGCTG ACTTACTGTG ATTCAGATTC TAGTCCCATG GCTTTGTTCC 3540 

TGAAAATGAT AGGCCCAACC CATGGCCAGA TGACAOTCAC AAGACTTGTG AATACGTTAA 3600 

AAGAAGGAAA ACGCCTGCCG TGCCCACCTA ACTGTCCAGA TGAOGTTTAT CAACTTATGA 3660 

GGAAATGCTG GGAATTCCAA CCATCCAATC GGACAAGCTT TCAGAACCTT ATTGAAGGAX 3720 

TTGAAGCACT TTTAAAATAA GAAGCATGAA TAACATTTAA ATTCCACAGA TTATCAAGTC 3780 

CTTCTCCTGC AACAAATGCC CAAGTCATTT TTTAAAAATT TCTAATGAAA QAAQTTTGTG 3840 

TTCTGTCCAA AAAGTCACTG AACTCATACT TCAGTACATA TACATGTATA AGGCACACTG 3900 

TAGTGCTTAA TATGTGTAAG GACTTCCTCT TTAAATTTGG TAOCAGTAAC TTAGTGACAC 3960 

ATAATGACAA CCAAAATATT TGAAABCACT TAAGCACTCC TCCTTGTGGA AAGAATATAC 4020 

CACCATTTCA TCTGGCTAGT TCACCATCAC AACXGCATTA CCAAAAGGGG ATTTTTGAAA 4080 

ACGAGGAGTT GACCAAAATA ATATCTGAAG ATGATTGCTT TTCCCTGCTG CCAGCTGATC 4140 

TGAAATGTTT TGCTGGCACA TTAATCATAG ATAAAGAAAG ATTGATGGAC TTAGCCCTCA 4200 

AATTTCAGTA TCTXTACAGT ACTAGAOCAT GCATTCTTAA AATATTAGAT ACCAGCTA3T 4260 

ATATATTGTT TCTGTACAAA AATGACTOTA TTCTCTCTACC AGTAGOACTT AAACTTTGTT 4320 

TCTCCAGTGG CTTAGCTCCT GTTOCTTTGG GTQATCRCTA GCACCCATTT TTGAGAAAGC 4380 

TGGTTCTACA TGGGGGGATA GCTGTGGAAT AGATAATTTG CTGCATGTTA ATTCTCAAGA 4440 

ACTAAGCCTG TGCCAGTGCT TTCCTAAGCA QTATACCT XT AATCAGAACT CATTCCCAGA 4500 

ACCTGGATGC TATTACACAT GCTTTTAAGA AACGTCAATG TATATCCTTT TATAACTCTA 4560 

CCACTTTGGG GCAAGCTATT CCAGCACTGG TTTTGAATGC TGTATGCAAC CAGTCTGAAT 4620 

AOCACATADB CTGCACTGTT CTTAGAGGGT TTCCATACTT ACCACCGATC TACAAGGGTT 4680 

GATCCCTGTT TTTACCATCA ATCATCACCC TGTGGTGCAA CACTTGAAAG ACCCGGCIAG 4740 

AGGCACTATG GACXTCAGGA TCCACTAGAC AGTTTTCAGT TTGCTTGGAG GTAGCTGGGT 4800 

AATCAAAAAT GTTTAQTCAT TGATTCAATG TOAACGATTA CGGTCTTTAX GACCAAGAGX 4660 

CTGAAAATCT TTTTGTTATG CTGTTTAGTA ITOGXTXGAT ATTGTTACTX TTCACCTGTT 432 O 

GAGCCCAAAT TCAGGATTGG TTCAGTGGCA GCAATGAAGT TGCCATTTAA ATTTGTTC3VT 4980 

AGCCTACAIC ACCAAGGTCT CTGTGTCAAA CCTGTGQCCA CTCTTATATGC ACTTTGTTTA 5040 
CTCTTTATAC AAATAAATAT ACTAAAGACT TT 

6eq ID NO: B64 protein sequence 
Protein Accession #: A39577 

1 11 21 31 41 51 

I 1 ! I I 1 

KQTUmCBDC MAMAFCAKMR SSXXTBVWLE APEPGVEVIF YXiSDHEPliRL GSGEYTAEEI. 60 

CIRAAQACRI SPLCHNIjFAL. YDEHTKLKYA FNRTITVDDK. MSLRLHYHMR FYFTNWHGTU 120 

DNEQSVWHHS PKKQKNGYEK XKIPDATPliL DASBLEYIiFA Q GQ YDIjVKCTj APTRDPKTEQ 100 

DGHDIENECL GMAVLAISHY AMHKKKQItFE UPKDIEYKRY IPETLNKSIR QRNUjTRMIU 240 

NNVFKDFLKE FNNXTICD33 VSTHDl^KVKY lATLETI/THH YGAEIFBTSM LLIS8BNEHK 300 

KFHSNDGGNV LYYEVMVTGN LGIQWBHKFN WSVEKEKNZ LKRKKLEHKD KKDEEKHKIR 360 

EBWNNFSFFP EITHTVjCKES WSINKQDNK KKEUCLgSHE EALSFVSLVD GYFRIVTADAH 420 

HYLCTDVAPP LIVHNIQNGC HGPICTBYAI NKLSQEGSEB GMYVLRWSCT DFDN2LMTVT 480 

CFEKCEQVQG AQKQFKHFQI EVQKGRYSU5 GSDR&FPSLG DLMSHLKKQI LRTDNISFML 540 

XRCCQPKPRE IHNLLVATKK AQEWQPVYPK SQLSFDRILX KDL.VQOEHIX5 RGTRTHIYSG 600 

TUiDY KDDEG TSEBKKIKVI LKVLDPSHRD 1SCAFFSAAS WMHQVSHKHI VYI*YGVCVRD 660 

VRNIMVEEFV EGGPLDLPMH RKSDVliTXPK KFKVAKQLAS ALSYliKDKDIi VHGHVCTKNL 720 

LLAREGIDSE CGPFIKLEDP GIPITVLSRQ ECXERlPHIA PECVEDSKNI* SVAADKMSFG 780 

TTLHEICYMG EIPLKDKTXjI EKBHFYKSRC UPVTPSCKBI. ADUiTHCMMY DPKQRP FFRA 840 

IMRDIKKLEE QNPDIVSRKK MQPTHVDPTH PEKRFLKRIS DLGEGHFGKV ELCRYDPEOH 900 

TGEQVAVKSL KPESGGNHIA DLKKEIEILR NI»YHEKTVKY KGICTEDGGW GIFJ»IMBETjP 960 

SG9I.KBY1*PK KKNKINLKQQ LKYAVQICKG MDYLGSRQYV HPDIAAKNVL VESEHQVKIG 102O 

DSGLrTKAIET DKEYYTVKDD RD8PVFHYAP ECLMQSKFYX ASDVWSFGVT LHETiTtTYCDfi 1080 

DSSPMALFUC KIGPTHGQMT VTHLVNTLKE GKRUCPPNC PDEVYQLMRK CWEPQPSHRT 1140 
fiPQNLIEGFE AX.I.K 

Seij ID NO: B65 DNA sequence 

Nucleic Acid Accession #i HM_004B€7.1 

Coding sequence: 140..931 

1 n 21 31 41 51 

1 \ I I I I 

GATCCCAGAC CTCGGCTTGC AGIAGTGTIA GACTGAAGAT AAAGTAAGTG CTGTTTGGGC 60 

TAACAGGATC TCCXCTTGCA GTCTGCAGCC CAGGACGCTG ATTCCAGCAG CGCCTTACCG 120 

CGCAGCCOGA AGATTCACTA TGGTGAAAAT CGOCXTCAAT ACCCCTACCG CCGTGCAAAA 160 

GGAGGAGGOG DGGCAAGACG XGGAGGCCCT CCTGAGCCGC AOGGTCAOAA CTCAGAXACT 240 

GACCGGCAAG GAGCTCCGAG TTGCCACCCA GGAAAAAGAG GGCTCCTCTG GGAGATGTAT 300 

GCTTACTCTC TTAGGCCTTT CATTCATCTT GGCAJ3GACTT ATTGTTGGTG GAGCCTGCAT 360 

TTACAAGTAC TTCATGCCCA AGAGCACCAT TTACCQTGGA GAGATGTGCT XTTTTGAXXC 420 

TGAGGATGCT GCAAATTCCC TTCGTGGAGG AGAGCCTAAC TTCCTGCCTG TGACTGAGGA 480 

GGCXGACATT CGTGAGGAT3 ACAACATTGC AATCATTGAT GTGCCTGTCC CCAGTTTCTC 540 

TGATAGTGAC CCXGCAGCAA TTATTCATGA CXITGAAAAG GGAATGACTG CTTACCTGGA 600 

CTTGTTGCTG GGGAACTGCT ATCTGATGCC CCTCAATACT TCTATTGTTA TGCCTCCAAA 660 
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AAATCTGGTA GAGCTCTTTG GCAAACTGGC GAGTGGCAGA TATCTGCCTC AAACTTATGT 720 

GGTTCGAGAA GACCTAGTTG CTGTGGAGGA AATTCGTGAT GTTAGTAACC TTGGCATCTT 7 BO 

TATTTAGCAA CTTTGCAATA ACAGAAAGTC CTTCCGCCTT CGTCGCAGAG ACCTCTTGCT 840 

GGGTTTCAAC AAACGTGCCA TTGATAAATG CTGGAAQATT AGACACTTCC CCAACGAATT 900 

5 TATTGTTGAG AOCAAGATCT GTCAAGAGTA AGAGGCAACA GATAGAGTGT CCTTGGTAAT 960 

AAGAAGTCAS AGATTTACAA TATGACTTTA ACATTAAGGT TTATGGGATA CTCAAGATAT 1020 

TTACTCATGC ATTTACTCTA TTGCTTATGC CGTAAAAAAA AAAAAAAAAA AAAAAAAAAA 1080 
AA 

.0 Beg ID WO: B66 Protein sequence 
Protein Accession #; NP_004858.1 



1 11 21 31 41 51 

1C I I I I I I 

I J HVXIAFNTPT AVQKEEARQD VEALLSH.TVR TQILTGKELR VATQEKEG9S QRCMLTTiTiffTi SO 

SFILAGLIVG GACIYKYFMP KSTIYRGEMC FFDSEDPAKS I/RGGEFHFI»P VTEEKDIRED 120 
DNIATIDVPV PSF6DSDPAA I IHDFKKGMT AYLDDLLGHC YLMPLKTSIV MPPKWLVEIiP 180 
GKLAfiGRYLP OTYWREDLV AVEEIRDVSN LGIFIYQLCN NRKSFRLHRR DLLLGFNKRA 240 
IDKCWKIRHF ENEFIVETKI CQE 



20 



Seq ID NOx B67 DNA sequence 

Nucleic Acid Accession ft; XM_083862.1 

Coding sequence: 121 -.813 



25 1 11 21 31 41 51 

I \ I I 1 1 

ACGCGGGAGG TCTGAGCTGT GGGCTGAGGC AGCGCAGCCG CTGCGCCAGG GTGCGCGATG 60 

CCTTGAACCT GGGAAACTAT GTGAAGCAAC ACTCTGGATT TTGAAAGACA TCTTTTCATC 120 

ATGGGACAGC AAATTTCGOA TCAGACACAQ TTGGTTATTA ACAAOTTACC AGAAAAAGTA 180 

30 GCAAAACATG TTAOSTTGGT TCGAGAGAGT GGCTCCTTAA CTTATGAAGA ATTTCTCGGG 240 

AGAGIAGCTG AGCTTAATGA TGTAACGGCT AAAGTGGCTT CTGGCCAGGA AAAACATCTT 3 00 

CTCTTTOAGG TACAACCTGG GTCTGATTCC TCTQCTTTTT GGAAAGTGGT TGTACGGGTG 3G0 

GTCTGTACCA AGATTAACAA AAGCAGTGGC ATTGTGGAGG CATCAGOGAT CATGAATTTA 420 

TACCAGTTTA TTCAACTTTA TAAAGATATC ACAAGTCAAG CAGCAGGAGT ATOOGCACAG 4 B0 

35 AGCTCCACCT CTGAAGAACC TGATGAAAAC TCATCCTCTG TAACATCTTG TCAGGCTAGT 540 

CTTTGGATGG GAAGGGTGAA GCAGCTGACC GATGAGGAGG AGTGT TGT AT CTGTATGGAT 600 

GGGCGGGCTG ACCTCATCCT GCCTTGTGCT CACAGCTTTT GTCAGAAGTG TATTGATAAA 660 

TGGAGTGATC GACACAGGAA T TG CCCT ATT TGTCGCCTAC AGATGACTGG AGCAAATGAA 720 

TCTTGGGTGG TATCAGATGC ACCCACTGAA GATGATATGG CTAACTATAT TCTTAACAT3 780 

40 GCTGATQAGG CAGGCCAGCC CCACAGQCCA TGACCTTGAA GTGAAAGTCT TCTGTTGCTA 840 

TTGTGGGCTC AAATATTTGG TCATGGGGGA AGAATGTAGG GTTGTGGCAC TGGCACAGAC 900 

ACAGGAAAAT CCATTTTCCC CACTCTTTTA TTTTTGCTAT TCTGATCATT TGTCCCCCTT 960 
TTAAAAATAA ACTTCCCATG TCTTCCAAAA AAAAAAAAAA AAAAA 

45 Seq ID NO: B68 Protein sequence 
Protein Accession #: XP_0B3862«1 

1 11 21 31 41 51 

<A I 1 1 1 1 1 

jU WGQQISDOTQ 1.VINKLPEKV AKHVUiVRES G9I»TYKKFIiG UVAELNDVTA KVASGQEKHL 60 

liPBVQPGSDS SAFWKWVRV VCTKDHC9SG IVfiASRIMNIi YQFI(]LYKDI TSQAAGVIAQ 120 

SSTSEEFDEN SSSVTSCQAS OH4GKVKQLT DEEBCCICMD GRADL.ILPCA HSFCQKGIDK 180 
HSDRHKSTCPr CRLQMTGANE SWWSDAPTE DDMANYILN5? ADSAGQPHRP 

55 Seq ID NO i B69 DNA sequence 

nucleic Acid Accession #; HM_002975.1 
Coding sequence; 180* ,1151 

1 11 21 31 41 51 

60 | 1 [ \ | | 

OGACCAACGG ACCGGACAGA GACGAGGAGA GGAACAQGAA GAGAGAAGCT GGGAGAATCG 60 

GGAACCTGGa GGCTAGTGAC CTGCACACAG GGCAGGGGCA CTCGGCAGTT CCCAGAGGCC 120 

ACCCCTCCCA CCCCAGACAT CCAGACATCT OGAACXXTGG GTGCCAAGAG TCGAGCTTAA 1B0 

TGCAGGCAGC CIGGCT3TTG GGGGCTTTGG TGGTCCCCCA GCTCTTGGGC TTTGGCCATG 240 

65 GGGCTCGGGG AGCAGAGAGG GAGTGGGAGG GAGGCltSGGG AGGTGCCCAG GAGGAGGAGC 300 
GGGAGAGGGA GGCCCTGATG CTGAAGCATC TGCAGGAAGC CCTAGGACTG CCTGCTGGGA 360 
GGGGGGATGA GAATCCTGCC GGAACTGTTG AGGGAAAAGA GGACTGGGAG ATGGAGGAGG 420 
ACCAQGGGGA GGAAGAGGAG GAQGAAGCAA CGCCAACCCC ATCCTCCGGC CCCAGCCCCT 480 
CTCCCACCCC TGAGQACATC GTCACTTACA TCCTGGGCCS CCTGGCCGGC CTGGACGCAG 540 

70 GCCTGCAOCA GCTGCACGTC CGTCTGCAOG CGTTGGACAC CCGCGTGGXC GAGCTGACCC 600 
AGGGGCTGCG GCAGCTGCGG AAOGCGGCAG GCGAGACCCG CGATGCCGTG CAAGCCCTGC 660 
AGGAGGCGCA GGGTCQCGCC GAGOGCGAGC ACGGCCGCTT GGAGGGCTGC CTGAAGGGQC 720 
TGCGCCTGGG CCACAAGTGC TTCCXGCTCT CGCGCGACTT CGAAGCTCAG QCGGCGGCGC 780 
AGGCGCGGTG CACOQCGCGG GGCGGGAGCC TGGCGCAGCC GGCAGACCGC CAGCAGATOG 840 

75 AGGCGCTCAC TCGGTAOCTG CGCGCGGCGC TCGCTCOCTA CAACTGGCCC GTGTGGCTGG 900 
GCGTGCACGA TC5QGCQCG CC GAGGGCCTCT ACCTCTTCGA AAACGGCCAG CGCGTGTGCT 960 

TCTTCGCCTG GCATCGCTCA CCCCGCCCCG ASCTCGGCGC CCAGCCCAOG GCCTCGCCGC 1020 

ATCCGCTCAG CCCGGACCAG OCCAAGGGTG GCACGCTCGA GAACTGCGTG GCGCAGGCCT 1080 

0 _ CTQACGACGG CTCCTGGTGG GACCACGACX GCCAGCGGCG TCTCTACTAC GTCTGCGAGT 1140 

80 TCCCCTTCTA GOGGGGCOGQ TACCCCGCCT CCTTGCCCAT CCCACCACCC GGCCTTTCCC 1200 

TGCGCCGTGC CCACCCTCCT C3CGGAATCGC CCTTCCCTTC CTGGCCAGGA ATGGCAGCGT 1260 

CXTCCCCGAC CCCCAGTCTG GGCGCTTCTG GGAGGGCTCT TGCGGTGCCG GCACTCCTCC 1320 

TTGTTAGTGT CTTTCCTTGA AGGGGCGGGC ACCAGGCTAG GTCCGGTGCC AATAAATCCT 13 80 
TGTGGAATCT GAAAAAAAAA AAAAAAA 
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Seq re hot B70 Protein sequence 
Protein Accession Hi NP_0029G6.1 

1 ll 21 31 41 51 

111)11 

MQAAWLLGAL WPQLLQFGH GARGAEREWE GGWGGAQEEE RliREALMLKH LQEALGLPAG 60 

RGDENPAGTV EGKSDWEMEE DQGEKEEBEA TPTPSSGPSP BPTPEDIVTY ILGRLAGLDA 120 

GIjHQLHVRLH ALDTRWHLT QGLRQLRNAA GDTHDAVQAL QEAQGHAERE HGRLEGCLKG 180 

LRLGHKCPLL SRDFBAOAAA QARCTARGGS LAQPADRQQM EALTRYUtAA LAPYNWPVWI* 240 

GVHDKRAEGL YJ^PEHGQRVS FFAWHRSPRP ELGAQPSASP HPLSPDQPWG GTLENCVAQA 300 
SDDGSWMDHD CQRRIjYYVCE FPP 

Seq ID NOj B71 DNA sequence 

Hucleic Acid Accession #t BC00083 9.1 

1 11 21 31 41 51 

I 1 I I I I 

CTGGCCAGGG CGAClGGCGG ATAAGGTCTT GTGCGTGGCC TCGAGGCTTA AAAGTAGCAG 60 

TQGGGCTTTQ TGAAGGACAA AATGGCGATG GCGGGCCGTG TAGGTCCCCC TTCCTATGAT 120 

GAGGACCTTT TCACAGACCT GTACTGAGCT GCGTGAGGAT AAGTAACrCT GAGGAGATGG 180 

GOCCTGCAAG CCrCCTTCTT AGCCGTCTGT TCABAAAATA GCGTTTTCGA AATGCCCTGA 240 

GTTGACCrAA TGTCTTATTG GGCTCCTGTC TGCAGGATTT ACGGGCAOGT TGGAACCGAA 300 

GAGAGCTCTG TTGTTGCAAT GTTCAGCCCA CAAGAGCTTA CTGGTGAAGG AATGGGACAA 360 

GACCCATCTT TATGCAAAGC CAQCGTTACA GTAATGTTCC AGCATCTCAT AATCTATCCT 420 

GGGGAAITCA GCTGCCTCCC AGGGTGAATA CAGGTATTCC TGATGACAGT CTGCCTCTAT 480 

CTTACAGAGC AGCTTGTTGC TATATACCAT TGAAAAGCCT TCAGAGCTGA GAGGTACTAC 540 

TAACCAATAA CCTGCTTGGC TCAAAGGGOC AGCACCTTCT CTCTAAAGCC CAAGAGGAGT 600 

TTGAGGAAAA CTAGGTGTCT GTGTTCACTC CAGGCTGAAG TTACAGGTCT GAGCAAATAA 660 

GGTGTATAAA AAATGGAATC TGTCTTGGAG GACATCAGAA GGTGAATTTT CCAAGTTCTT 720 

GGACAAOCTA GCTGTTGAAA AGCTTTCTGG GTTTGGGGGG TATTTCAGAT GTAOCTTAAA 760 

GTGTTAGCAG ATACAGATTA AQACACTGGG AGCCAATGAA AGAGCAGTTG AQGGTTTGCT B40 

GTGTATCACA TTTCTGTATT TTATCACCCC CTTCCTGCAA CATTATTTAT CTGGAATCTA 900 

CC1GOCCTTT TGTTTTTTAG ATACAAGGGC TTGGTTTTGT TAOCCAGGCT GGTTTCAAGG 960 

CCATAGCTTT AAGAGATCCT CTCACCACAO ATTTCCAAAG TGCTBGGATT GCAGGTGTGA 1020 

TTCATGGCAC CCAGACTTTG CTGCCTTTCT TACATGATCC AGGCCCAGAA CCCAAACTCA 1080 

GGCACXGTAT AGATGACCaC TTTOGTAAAC TACTGACCTA GCTTGTTGCC AATTGTTGAT 1140 

TGAACTTCOC ATAACTCCAC TTCGTGTCTC TTCCTCTGTA TACAGCCACC TTCTGTTCCC 1200 

QTCATGAGCC TTTAGGTCTC CATTTGCATA TTGCAAATAC TATGTTCCAT GTAGGTAGCT 1260 

CATTCAGGGC CTTGCTCTTC ACTTCAAAAA AGGTTCCCTT GAGGACTGGC TGTCAATTTG 1320 

TGTTGCTGTG TTGGTTGTTG ATGAAAATAA TAAAATGATT GATTACATAA AAAAAAAAAA 1380 
AAAAAAA 

Seq ID NO i B72 UNA sequence 

Nucleic Acid Accession ft? CAT cluster 

1 11 21 31 41 SI 

I 1 I 1 t 1 

TGCTAGTAGG GCCTGGGTTA ATCGGOCGAQ GGTGGCTTCG TGGTCTTTAT AGCTGTTACT 60 

CTTTGTACTT GTCTTTTTCT TTTAlTSJTCT TTTGAGCGAT TGTGCGAACA TAGCATAGCA 120 

CGCACTATGG CACGAGGCTC GTGCTGCCTG GCCAGGGCGA CTGGCGGATA AGGTCTTGTG 180 

CGTOQCCrCG AGGCTTAAAA GTAGCAGTGG GGCTTlGTGA AGGACAAAAT GGCGATGGOG 240 

GGCCGTGTAG GTCCCCCTTC CTATOATGAG GACCTTTTCA CAGAGCTGTA CTGAQCTCOG 300 

TGAGGATAAA TAACTCTGAG GAGATQGGCC CTGCAAGCCT CTTGCTTAGC CGTCTGTTCA 360 

GAAAATAGCG TTTTOGAAAT GCCCTG&GTT GACCTAATGT CTTATTGGGC TCCTGTCTGC 420 

AGGATTTACG CGCACGTTGG AACGGAAGAG AGCTCTGTTG TTGCAATGTT CAGCOCACAA 480 

GAGCTTACTG GTGAAGGAAT GGGACAAGAC CCATCT TTAT GCAAAGCCAG CGTTACAGTA 540 

ATGTTOCAGC ATCTCATAAT CTATCCTGGG GAATTCAGCT GCCTCCCAGG GTGAATACAG 600 

GTATTCCTGA TGACAGTCTG CCTCTATCTT ACAGAGCAGC TTGTTGCTAT ATACCATTGA 660 

AAAGCCTTCA GAGCTGAGAG GTACTACTAA CCAATAACCT GCTTGGCTCA AAGGGCCAGC 720 

ACCTTCTCTC TAAAGCCCAA GAGGAGTTTG AGGAAAACTA GGTGTCTGTG TTCACTCCA0 780 

GCTGAAGTTA CAGGTCTGAG CAAATAAGGT GTATAAAAAA TGGAATCTGT CTTGGAGGAC 840 

ATCAGAAGGT GAATTTTCCA AGTTCTTGGA CAAOCTAGCT GTTGAAAAGC TTTCTGGGTT 900 

TGGGGGGXAT TTCAGATGTA CCTTAAAGTG TTAGCAGACA CAGATTAAGA CACTGGGAGC 360 

CAATGAAACA GCAGTTGAGG GTTT9CTGTG TATCACATTT CTGTAlrlTTA TCACCOOCTT 1020 

CCTGCAACAT TATTTATCTG GAATCTAOCT GCCCTTTTGT TTTTTAGATA CAAGGGCTTG 1000 

GTTTTGTTAC CCAC3GCTGGT TTCAAGGCCA TAGCXTTAAG AGATCCTCTC ACCACAGATT 1140 

TCCMABGTGC TGGGATTGCZA GGW3TGATTC ATGGCAOCCA GACTTTGCTG CCTTTCTTAC 1200 

ATGATCCAGG CCCAGAACCC AAACTCAGGC AGTGTATAGA TGAOCACTTT OGTAAACTAC 1260 

TGACCTAGCT TGTTGCCAAT TGTTGATTGA ACTTOOCATA AGTCCACTTC GTGTCTGTTC 1320 

CTCTGTATAC AGCXACCTTC TGTTCCCGTC ATGAGOCTTT AGGTCTCCAT TTGCATATTQ 13 B0 

CAAATACTAT QTTCCATGTA GGTAGCTCAT TCAGGGGCTt GCTCTTCACT TCAAAAAAGG 1440 

TTCCCTTGAG GACTGGCrrGT CAATTTGTGT TGCTGTGTTG GTTOTTGATG AAAATAATAA 1S0O 

AATGATTGAT TACATAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACAAA CACAAAAAAC 1560 
CGCOG0GCTT TTTCCCQGGC GCACAAAGTT ATAAAACGCC GTCCATC 

Beq ID NO: B73 sequence 

Nucleic Acid Accession #i NM_000222.i 

Coding sequence : 2 2 , . 2 9 52 

1 11 21 31 41 51 

I 1 1 t I 1 

GATCCCATCG CAGCTACCGC GATGAGAGGC GCTCGCGGCG CCTGGGATTT TCTCTGCGTT 60 
CTGCTCCTAC TGCTTCGCGT CCAGACAGGC TCTTCTCAAC CATCTGTCGAG TCCAGGGGAA 120 
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CCETCTCCAC CATCCATCCA TCCAGGAAAA TCAGACTTAA TAGTCCGCGT GGGCGACOAG 180 
ATTAGGCTGT TATGCACTGA TCCGGGCTTT GTCAAATGGA CTTTTGAGAT CCTGGATGAA 240 
ACGAATGAGA ATAAGCAGAA TGAATGGATC ACGGAAAAGG CAGAAGCCAC CAACACCGGC 300 
_ AAATACAOGT GCACCAACAA ACACGGCTTA AGCAATTCCA TTTATGTGTT TGTTAGAGAT 360 

5 CCTGOCAAGC TTTTCCTTGT TGACOGCTCC TTGTATGGGA AAGAAGACAA CGACACGCTG 420 

GTCCGCTGTC CTCTCACAGA CCCAGAAGTG ACCAATTATT CCCICAAGGG GTGCCAGGGG 4B0 
AAGCCTCTTC CCAAGGACTT GAGGTTTATT CCTGACCCCA AGGOGGGCAT CATGATCAAA 540 
AGTGTGAAAC GCGCCTAOCA TCGGCTCTGT CTGCATTGTT CTOTGGACCA GGAGGGCAAG €00 
TCAGTGCTGT CGGAAAAATT CATCCK3AAA GTGAGGCCAG CCTTCAAAGC TGTGCCTGTT 660 
10 GTGTCTGTGT CCAAAGCAAG CTATCTTCTT AGGGAAGGGG AAGAATTCAC AGTGAOGTGC 720 
ACAATAAAAG ATGTGTCTAG TTCTQTGTAC TCAACGTGGA AAAGAGAAAA CAGTOAGACT 780 
AAACTACAGG AGAAATATAA TAGCTGGCAT CACX3QTGACT TCAATTATGA ACGTCAGGCA 840 
ACGTTGACTA TCAGTTCAGC GAQAGTTAAT GATTCTGGAG TGTTCATGTG TTATGCCAAT 900 
AATACTTTTG GATCAGCAAA TGTCACAACA ACCTTGGAAG TAGTAGATAA AGGATTCATT 930 
15 AATATCTTCC CCATGATAAA CACTACAGTA TTTGTAAACG ATGGAGAAAA TGTAGATTTG 1020 

ATTGTTGAAT ATGAAGCATT CCCCAAACCT QAACACCAGC AGTGGATCTA TATGAACAGA ZD B0 
ACCTTCACTG ATAAATGGGA AGATTATCCC AAGTCTGAGA ATGAAAGTAA XATCAGATAC 1140 
GTAAGTGAAC TTCATCTAAC GAGATTAAAA GGCACCGAAG GAGGCACTTA CACATTCCTA 1200 
GTGTCCAATT CTGACGTCAA TGCTGCCATA GCATTTAATG TTEATGTGAA TACAAAACCA 1260 
20 GAAATCCTGA CTTACGACAG GCTOGTGAAT GGCATGCTCC AATGTGTOGC AGCAGGATTC 1320 
CCAQAGDCCa CAATAGATTG GTATTTTTGT CCAGGAACTG AOCAGAGATG CTCTGCTTCT 13 B0 
GTACTGCCAG TGOATGTGCA GACACTAAAC TCATCTGGGC CACCGTTTGG AAAGCTAGTG 1440 
GTTCAGAGTT CTATAGATTC TAGTGCATTC AAGCACAATG GCAOGGTTGA ATGTAAGGCT 1500 
TACAAOGATG TGGGCAAGAC TTCTGCCTAT TTTAACTTTG CATTTAAAGG TAACAACAAA ISfiO 
25 GAGCAAATCC ATCCCCAC2AC CCTGTTCACT CCTTTGCTGA TTGGTTTCGT AATCGTAGCT 1620 
GGCATGATQT GCATTATTGT GATGATTCTG ACCTACAAAT ATTTACAGAA AOCCATGTAT 1680 
GAAGTACAGT GGAAGOTTGT TGAGGAGATA AATGGAAACA ATTATGTTTA CATAGACCCA 1740 
ACACAACTTC CTTATGATCA CAAATGGGAG TTTCCCAGAA ACAGGCTGAG TTTTGGGAAA 1B00 
ACCCTGGGTG CTGQAGCTTT CGGGAAGGTT GTTGAGGCAA CTGCTTATGG CTTAATTAAG I860 
30 TCAGATGC3GG CCATGACTGT CGCTGTAAAG ATGCTCAAGC CGAGTGCCCA TTTGACAGAA 1920 
CGGGAAGCCC TCATGTCTGA ACTCAAAGTC CTGAGTTACC TTQGTAATCA CATGAATATT 1390 
GTGAATCTAC TTGGAGCCTG CACCATTGGA GGGCCCACCC TGGTCATTAC AGAAIATTGT 20 40 
TGCTATGGTG ATCTTTTGAA TTTTTTGAGA AGAAAACGTG ATTCATTXAT TIGTTCAAAG 2100 
CAGGAAGATG ATGCAGAAGC TGCACTTTAT AAGAATCTTC TGCATTCAAA GGAGTCTTOC 2160 
35 TGCAG OGATA GTACTAATGA GTACATGGAC ATGAAACCTG GAGTTTCTTA TGTTGTCCCA 2220 

ACCAAGGOCG ACAAAAGGAG ATCTGTGAGA ATAGGCTCAT ACATAGAAAG AGATQTGACI 2280 
CCCGCCATCA TGGAGGATGA CGAGTTGGCC CTAGACTTAG AAGACTTGCT GAGCTTTTCT 2340 
TACCAGGTGG CAAAGGGCAT GGCTXTCCTC GCCTCCAAGA ATTGTATTCA CAOAGACTTG 2400 
GCAGCCAGAA ATATCCTCCT TACTCATGGT CGGATCACAA AGATTTGTGA TTTTGGTCTA 2460 
40 QCCAGAGACA TCAAGAATGA TTCTAATTAT GTGGTTAAAG GAAACGCTCG ACTACCTGTG 2520 
AAGTGGATGG CACCTGAAAG CATTTTCAAC TGTGTATACA CGT1TGAAAG TGACGTCTGG 25B0 
TCCTATGGGA TTTTTCTTTG GGAGCTGTTC TCTTTAGGAA GCAGCCCCTA TCCTGGAATG 2640 
COGGTCOATT CTAAGTTCTA CAAGATGATC AAGGAAGGCT TCCGGATGCT CAGCCCTGAA 2700 
CACGCACCTG CTGAAATGTA TGACATAATG AAGACTTGCT GGGATGCAGA TCCCCTAAAA 2760 
45 AGACCAACAT TCAAGCAAAT TGTTCAGCTA ATTOAGAAGC AGATTTCAGA GAGCACCAAT 282 O 
CATATTTACT CCAACTTAGC AAACTGCAGC COCAACCGAC AGAAGCCCGT GGTAGACCAT 2880 
TCTGTGCGGA TCAATTCTGT CGGCAGCACC GCTTCCTCCT CCCAGCCTCT GCTTGTGCAC 2940 
GACGATGTCT GAGCAGAATC AGTGTTTGGG TCACCCCTCC AGGAATGATC TCTTCTTTTG 3000 
GCTTCCATGA TGGTTATTTT CTTTTCTTTC AACTTGCATC CAACTCCAGG ATAGTGGGCA 3O60 
50 CCCCACTGCA ATCCTGTCTT TCTGAGCACA , CTTTAGTGGC CGATGATTTT TGTCATCAGC 3120 

CACCATCCTA TTGCAAAGGT TCCAACTGTA TATAXTCCCA ATAGCAACGT AGCTTCTACC 3180 
ATGAACAGAA AACATTCTGA TTTGGAAAAA GAGAGGGAGG TATGGACTGG GGGCCAGAGT 3240 
CCTTTCCAAG GCTTCTCCAA TTCTGCCCAA AAATATGGTX GATAGTTTAC CTGAATAAAT 3300 
GGTAGTAATC ACAGTTGGCC TTCAGAACCA TCCATAGTAG TATGATGATA CAAGATTAGA 3360 
55 AGCTGAAAAC CTAAGTCCTT TATGTGGAAA ACAGAACATC ATTAGAACAA AGGACAGAGT 3420 
ATGAACACCT GGGCTTAAGA AATCTAGTAT TTCATGCTGG GAATGAGACA TAGGCCATGA 3480 
AAAAAATGAT CCCCAAGTGT GAACAAAAGA TGCTCTTCTG TGOACCACTG CATGAGCTTT 3540 
TATACTACCG ACCTGGTTTT TAAATAGAGT TTGCTATTAG AGCATTGAAT TGGAGAGAAG 3600 
GCCTCGCTAG CCAGCACTTG TATATACGCA TCTATAAATT GTCOGTGTTC ATACATTTGA 3660 
60 GGGGAAAACA CCATAAGGTT TCGTTTCTGT ATACAACCCT GGCATTATGT CCACTGTGTA 3720 
TAGAAGTAGA TTAAGAGCCA TAXAAGTTTG AAGGAAACAG TTAATACCAT TTTTTAAGGA 37B0 
AACAATATAA OCACAAAGCA CAGTTTGAAC AAAATCTCCT CTTTTAGCTG ATGAACTTAT 3840 
TCTGTAGATT CTGTGGAACA AGCCTATCAQ CTTCAGAATG GCATTGTACT CAATGGATTT 3900 
GATGCTGTTT GACAAAGTTA CTGATTCACT GCATGGCPCC CACAGGAGTG GGAAAACACT 3960 
65 GCCATCTTAG TTTGGATTCT TATGTAGCAG GAAATAAAGT ATAGGTTTAG CCTCCTTCGC 4020 
AGGCATGTCC TGGACACCGG GCCAGTATCT ATATATGTGT ATGTACGTTT GTATGTGTGT 4080 
AGACAAATAT TTGGAGGGGT ATTTTTGCCC TGAGTCCAftG AGGGTCCTTT AGTACCTGAA 4140 
AAGTAACTTG GCTTTCATTA TTAGTACTGC TCTTGTTTCT TTTCACATAG CTGTCTAGAG 4200 
TAGCTTACCA GAAGCTTCCA TAGTGGTGCA GAGGAAGTGG AAGGCATCAG TCCCTATGTA 4260 
70 TTTGCAGTTC ACCTGCACTT AAGGCACTCT GTTATTTAGA CTCATCTTAC TGTACCTGTT 4320 
CCTTAGACCT TCCATAATGC TACTGTCTCA CTGAAACATT TAAATTTTAC CCTTTAGACT 4380 
GTAGCCTGGA TATTATTCTT GTAGTTTACC TCTTTAAAAA CAAAACAAAA CAAAACAAAA 4440 
AACTCCCCTT CCTCACTGCC CAATATAAAA GGCAAATGTG TACATGGCAG AGTTTGTGTG 4500 
TTGTCTTGAA AGATTCAGGT ATGTTGCCTT TATGGTTTCC CCCTTCTACA TTTCTTAGAC 4560 
75 TACATTTAGA GAACTGTGGC CGTTATCTGG AAGTAACCAT TTGCACTGGA GTTCTATGCT 4620 
CTCGCACCTT TCCAAAGTTA ACAGATTTTG GGGTTGTGTT GTCACCCAAG AGATTGTTOr 4 6 B0 
TTGCCATACT TTGTCTGAAA AATTCCTTTG TGTTTCTATT GACTTCAATG ATAGTAAGAA 4740 
AAGTGGTTGT XAGTTATAGA TOTCTAOGTA CTTCAGGGGC ACTTCATTGA GAGTTTTGTC 4800 
TTGCCATACT TTGTCTGAAA AATTCCTTTG TGTTTCTATT GACTTCAATG ATAGTAAGAA 4860 
80 AAGTGGTTGT TAGTTATAGA TGTCTAGGTA CTTCAGGGGC ACTTCATTGA GAGTTTTGTC 4920 
AATGTCTTTT GAATATTCCC AAGCCCATGA GTCCTTGAAA ATATTTTTTA TATATACAGT 49B0 
AACTTTATGT GTAAATACAT AAGCGGCGTA AGTTTAAAGG ATGTTGGTGT TCCACGTGTT S040 
TTATTCCTGT ATGTTGTCCA ATTGTTGACA GTTCTGAAGA ATTC 
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Seq ID NO: B74 protein sequence 
Protein Accession ft: NP_000213,1 



1 
I 

MRGAHGANDF 
FGFVKHTFEI 



11 
I 

LCVKLLIjIiRV 
IiDETNENKQN 
JDTLVRCPIrTD 



10 

15 

20 

25 
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35 

40 
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65 
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75 

80 



RliCLHCSVDQ 
SVYSTWKRHN 
VTTTLEWDK 
DYPKSENESN 
LVNGMLQCVA 
SAFKHNGTVE 
MILTYKYLQK 
QKWEATAYG 
TIGGPTLVIT 



AFIASKNCIH 
IFNCVYTFES 
DIMKTCWBAD 
GSTASSSQPIj 



SQTKLQKKYN 
GFJNIFPMXM 
IRYVSET. T HT.T 
AGFPBPTXDH 
CKAYNDVGXT 
PMYEVQWKW 
LIKSDAAMTV 
EYCCYGDLLN 
WPTKADKRH 
RDIiAARNIIiL 
DVWSYGIFfcW 
PLKRPTFKQI 
LVHDDV 



21 
I 

QTGSSQPSVS 
EWITEKRERT 
PEUTNYSLKG 
ILKVRPAPKA 
SWHHGDFNYE 
TTVFVNDGEN 
RLKGTEGGTY 
YFCPGTEQBC 
9AYFNFAFKG 
EBINGNNYVY 
AVKMLKPSAH 
FLRRKRD3FI 
SVH1GSYIER 
THGRITKICD 
EUSLGSSPY 
VQLIEKQISE 



31 
I 

NTGKYTCTNK 
COGKPIiPKDIi 
VPWSVSKAS 
RQATIiTISSA 
VDLIVEYEAF 
TFLVSKSDVN 
SASVLPVDVQ 



IDPTQLPYDH 
LTEREAIiMSE 
CSKQEDHAEA 
DVT P AI MED D 
FGLARDIKND 
PGKPVDSKFY 
STHHIYSNLA 



41 
I 

PGKSDItlVKV 
HGIsSNSIYVF 
StFIPDPKAGI 
YIiLREGEEPT 
KVHDSGVFWC 
PKPEHQQWIY 
AAIAFNVYVN 
T&N9SQPFFG 
LFTPlJilGW 
KWEFPRKRLtS 
LKVLSYLGNH 
ALYKNLliHSK 
EiliALDIiEDUi 
SNYWKGNAR 
KMIKEGFRML 



51 
1 

GDEIHLIiCTD 
VRDPAKLFLV 
MIKSVKRAYH 
VTCTIKDVSS 
YANNTFGSAN 
MNRTFTPKWE 
TKPEILT5TDR 
KLWQS3IDB 
IVAGMMCIIV 
FGKTLGAGAF 
MNIVNIiLGAC 
ESSCSDSTNE 
SFSYQVAKGM 
IiPVKWMAPES 
SPEHAFABMY 
VDHSVRINSV 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NO t B75 DNA sequence 
Nucleic Acid Accession #t HM__0044S6 
Coding sequence: 58 . .2298 

1 11 21 31 41 SI 

I 1 I I 1 1 

GAATTCCGGG COACGCGCGG GAACAA03CG AGTOGGCGCG CGQGACGAAG AATAATCATG 60 

GC3CCAGACXG GGAAGAAATC TGAGAAGGGA CCASTTTGTT GGCGGAAGCO TGTAAAATCA 120 

GAGTACATGC GACTGAGACA GCTCAAGAGG TTCAGACGAG CTGATGAAGT AAAGAGTATG 180 

TTTAGTTCCA ATCGTCAGAA AATTTTGGAA AGAACOGAAA TCTTAAAOCA AGAATGGAAA 240 

CAGCGAAGGA TACAGCCTGT GCACATCCTG ACTTCTGTGA GCTCATTGCG CQGGACTAGQ 300 

GAGTGTTCGG TGACCAGTGA CTTGGATTTT CCAACACAAG TCATCCCATT AAAQAClCTG 360 

AATGCAGTTO CTTCAGTACC CATAATGTAT TCTTGGTCTC CCCTACAGCA GAATTTTATG 420 

GTGGAAGATG AAACTGTTTT ACATAACATT CCTTATATGG GAGATGAAGT TTTAGATCAG 480 

GATGGTACTT TCATTGAAGA ACTAATAAAA AATTATGATG GGAAAGTACA CGGGGATAGA 340 

GAATGTGGGT TTATAAATGA TGAAATTTTT GTOQAGTTGG TGAATGCCCT TQGTCAATAT 600 

AATGATGATG ACGATGATGA TGATGGAGAC GATCCTGAAG AAAGAGAAGA AAAGCAQAAA 660 

GATCTGGAGG ATCACCGAGA TGATAAAGAA AGCCGCCCAC CTCGGAAATT TCCTTCTGAT 720 

AAAATTTTGG AGGCCATTTC CTCAATGTTT CCAGATAAGG GCACAGCAGA AGAACTAAAG 780 

QAAAAATATA AAGAACTCAC CGAACAGCAG CTCCCAGG CG CACTTCCTCC TGAATGTACC 840 

CCCAACATAG ATGGACCAAA TGCTAAATCT GTTCAGAGAG AGCAAAGCTT ACACTCCTTT 900 

CATACOCTTT TCTGTAGGOG ATGTTTTAAA TATGACTGCT TCCTACATCC TTTTCATGCA 960 

ACACGCAACA CTTATAAGCG GAAGAACACA GAAACAGCTC TAGACAACAA ACCTTGTGGA 1020 

CCACAGTGTT ACCAGCATTT GGAGGGAGCA AAGGAGTTTG CTGCTGCTCT CACCSCTQAG 1080 

CGGATAAAGA CCCCACCAAA ACGTCCAGGA GGCCGCAGAA GAGGACGGCT TCCCAATAAC 1140 

AGTAGCAGGC CCAGCACCCC CACCATTAAT GTGCTGGAAT CAAAGGATAC AGACAGTGAT 1200 

AGGGAAGCAQ GOACTGAAAC GGGGGGAGAG AACAATGATA AAGAAGAAGA AGAGAAGAAA 1260 

GATGAAACTT CGAGCTCCTC TGAAGCAAAT TCTCG3TGTC AAACAOCAAT AAAGATGAAG 1320 

CCAAATATTG AAOCTCCTGA GAATOTGGAG TGGAGTGGTG CTGAAGCCTC AATGTTTAQA 13 BO 

GTCCTCATTG GCACTTACTA TGACAATTTC TGTGCCATTG CTAGGTTAAT TGGGACCAAA 1440 

ACRTGTAOAC AGGTGTATGA GTTTAGAGTC AAAGAATCTA GCATCATAGC TOCTGCTCCJC 1500 

GCTGAGGATG TGGATACTCC TGCAAGGAAA AAQAAGAGGA AACACCGGTT GTGGGCTGCA 1560 

CACTGCBGAA AGATACAGCT GAAAAAGGAC GGCTCCTCTA ACCATGTTTA CAACTATCAA 1620 

CCCTGTGATC ATCCACGGCA GCCTTGTGAC AGTTCGTGCC CTTGTGTGAT AGCACAAAAT 1680 

TTTTGTGAAA AGTTTTGTCA ATGlAGTTCA GAGTGTCAAA ACQGCMTTCC GGGATQCCGC 1740 

TGCAAAGCAC AGTBCAACAC CAAGCAGTGC GCGTGCTAOC TGGCTGTCCG AGAGTGTGAC 1800 

CCTQACdCT GTCTTACTTG TGGAGCCGCT GACCATTGGG ACAGTAAAAA T3TGTCCIGC 1860 

AAGAACTGCA GIATTCAGCG GGOCTCCAAA AAGCATCTAT TGCTGGCACC ATCTGACGTG 1920 

GCAGGCTGGG GGATTTTTAT CAAAGATCCT GTGCAGAAAA ATGAATTCAT CTCAGAATAC 1980 

TGTGQAGAGA TTATTTCTCA AOATGAAGGT GACAGAAQAG GGAAAGTGXA TGATAAATAC 2040 

ATGTGCAGCT TTCTGTTCAA CTTGAACAAT QATZXTGIGG TGGATGCAAC CCGCAAGGGT 2100 

AACAAAATTC GTTTTGCAAA TCATTCGGTA AATCCAAACT GCTATGGAAA AGTTATGATG 2160 

GTTAACGGTQ ATCACAGGAT AGGTATTTTT GCCAAGAGAG CCATCCAGAC TGGCGAAGAG 2220 

CTGTTTGTTG ATTACAGATA CAGCCAGGCT GATGCCCTGA AGTATGTCGG CATCGAAAGA 2280 

GAAATGGAAA TCCCTTGACA TCTGCTACCT CCTCCCCCTC CTCTGAAACA GCTGCCTTAG 2340 

CTTCAGGAAC CT CG AG TACT GTGGGCAATT TAGAAAAAGA ACATGCAGTT TGAAATTCTG 2400 

AATTTGCAAA GTACTGT7AAG AATAATTTAT AGTAATGAGT TTAAAAATCA ACTTTTTATT 2460 

GGCTTCTCAC CAGCTGCAAA GTGTTTTGTA CCAGTGAATT TTTGCAATAA TGCAGTATGG 2S20 
TACATTTCTC AACTTTGAAT AAAGAATACT TQAACTTGAA AAAAAAAAAA AAAAAA 

Seq m MOi B76 Protein sequence 
Protein Accession #: NP_004447 

1 11 21 31 41 SI 

1 1 1 1 1 I 

MGQTGKKSEK GPVCWRKKVK RFRRADBVKS MPeSNRQKIIi ERTEIUJQEW 60 

KQRRIQFVHI LTSVSSLRGT EBCSVTSDLD FPTQVIPLKT MIAVASVPtM YSWSPLQONF 120 

MVBDETVLHW IPYMGDEWIJD QDGTFIEETjI KKYDGKVHGD RECGFINDEI FVHLVNALGQ 180 

YNDDDDDDDG DDPEERBEKQ KDIiEDHHDDK ESRPPRKFPS DKlLEAISSM FPDKGTAEEI. 240 

KKKYKELTEQ QLPGALPPEC TPNIDGPNAK SVQRBO/SliHS FHTLFCHRCF KYDCFLHPFH. 3Q0 

ATPKTYKRKBT TETAUTOKFC GPQCYOjHIiBG AKEFAAAIiTA ERIKTPPKRP GGRRRGRLPN 360 
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K8SRPSTPTS 
KPNIEPPENV 
PAEDVDTPPR 
KPCEKFCQC3 
CKNCSIORGS 
YHCgFLFNIiBT 
ELFVDlCRYSQ 



NVI/ESKDTDS 



KXKRKHRlrWA 
5EOQNHFE>GC 
KKELIrLAPSD 
NDFWDATKK 
ADALKYVGIE 



DREAGTETGG 
RVLIGTYYDN 
AHCRKIQUCK 
RCKAQCNTKQ 
VAGWGIFIKD 



EKNDKEEEEX KDETSSSSEA NSRCQTPIKM 
FCAIARIxIOT KTCRQVYEFR VKESSIIAFA 
DGSSNHVYNY QPCDHPRQPC D5SCPCVIAQ 
CPCYTAVRBC DFDLiCXjTCGA ADHWDSKNVS 
FVQKNEFISS YCGEIISQDE ADRRGKVYDK 
VNPNCYAKVM MVNGDHRIGI FAKRAIQTGE 



420 
460 
540 
600 
660 
720 



REHJSIP 



Seq ID NO: B77 UNA sequence 
Nucleic Acid Accession «; NM_00701S 
Coding sequences 2 . .1005 



ATnACA.OA.GA 
TGCAGCCCCC 
AAGGTGGGAG 
GCCTTCTACT 
ATCAATOGGA 
TTTAAAATGG 
ACAGGAATTC 
ATTCCTGAGG 
ATGCCAGTCA 
GACAACAGCT 
CTTAAACCAA 
GTTOCAACTA 
CTGAATAATG 
CCTTATCATC 
ATCTGTTGTA 
GGGGQCTATT 
ATGCCATGTA 
CAGGTGCTGT 
ATGCTGATGG 
TAACAGAATT 
AGTTCAAGTC 
CATAAGTCrX 
AGTTTGCC 



11 
I 

ACTCCGACAA 
CGGCOTACGC 
CCGTGGTCCT 
TCTGGAAGGG 
AACTACAAGA 
GAAGTGGAGC 
GTTTTGCTGG 
TGGGCGCCGT 
AATATGAAOA 
TCTTGAGTTC 
CCTATCCAAA 
CCACAAAAAG 
AAACCAGACC 
AGCAGGAAGG 
TAG AAT GEAG 
ACCCATGGCC 
OCTGGTGGGT 
AAAATAAGAA 
GACCATAAAA 
TTTTTAATCQ 
TAAAATGCCA 
OCCTTGCTTG 



21 
\ 

AGTTCCCATT 
TACGCTGACG 
CATTTCGGGA 
GAGCOACAQT 
TGGGTCAATG 
TGAAGAAGCA 
AGGAGAGAAG 
GACCAAACAG 
AAATTCTCTT 
TAAGGTGTTA 
AGAAATCCAG 
ACCACACAGT 
CAGTGTTCAA 
GGAAAGCATG 
GCGGAGCTAC 
TXAlAATTAT 
GGCCCGTATC 
CTAGCTGAAQ 
TATTTTTACA 
TTTTCCAGAA 
TAACC C CGTT 
CATCTTCCAA 



31 
I 

GCCCTGGTQG 
GTGAAGCCCT 
GCTGTQCTGC 
CACATTTACA 
GAAATAQACG 
ATTGCAGTTA 
TGCTACATTA 
AGCATCICCT 
ATCTGGGTGG 
GAACTCTGCG 
AGQGAAAGAA 
GGACCACGGA 
GAGGACTCAC 
ACATTCOACC 
ACCCACTGCC 
CAAGGGIGCC 
TTGGGCATGG 
AQACAACCAA 
CGCAGCCTGA 
CTTTAGTATA 
ATTTGTTATT 
AGCTATTTCG 



41 
I 

GACCTGATGA 
CCAGCCCGGC 
TGCTCTTTGG 
ATQTCCATTA 
CTGGGAACAA 
ATGATTTCCA 
AAGCGCAAGT 
CCAAACTGGA 
CTGTAGATCA 
GTGACCTTCC 
GAGAAGTGGT 
GCAACCCAGG 
AAGCCTTCAA 
CTAGACTGGA 
AGAAGATCTG 
GTTCGGCCTG 
TGTGAAATCA 
AGAAGCATTA 
GOGGTTATTC 
TGCAAATGCA 
TTTTATTTGC 
AAATAAACAC 




GGCCATCGGG 
CACCATGAGT 
CTTGGAGAOC 
GAATGGCATC 
GAAGQCTCGT 
AGGCAAGATC 
GCCTGTGAAG 
TATTTTCTGG 
AAGAAAAATT 
CGCTGGAAGA 
TCCTGATAAT 
TCACGAAGGA 
TGAACCCCTG 
CAGAGTCATC 
CTTCATATAT 
AGGCAGGTTG 
TTGACACTCT 
CTGAAAGGGT 
ATTGATTTGC 
QAAAATTTAC 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
10BO 
1140 
1200 
1260 
1320 



Seq XD WO: B7B Protein sequence 
Protein Accession #i NP i _006946 



1 11 21 31 41 51 

I I I I 1 1 

MTEETfiDKVPI ALVGPDDVEF CSPPAYATLT WPSSPARLIi KVGAWIjISG AVLLLFGAIG 
AFYPWKGSDS HIYNVHYTMS IKGKLQDGSM EIDAGNNLET FKMGSQAEEA IAVHDFQNGI 
TGIRFAGGEK CYIKAQVKAR IPEVGAVTKQ SISSKLEGKI MPVKYEENSIi IWVAVDQPVK 
DMfiFLSSKVIi HLCGDLPIFW LKPTYPKEIQ RERREWRKE VPTTTKRPHS GPRSNPGAGR 
IiNMETRPSVQ EDSQAFNPDN PYHQQEGESM TFDPRLDHEG ICCIECRRSY THCQKICEPL 
GGYYPWPYNY QOCSSACRVI MPCSWWVARI LGMV 

Seq IP HO» B79 DMA sequence 

Nucleic Acid Accession #; NM_012449.1 

Coding sequence; 66- -lOBS 



1 
I 

COGAGACTCA 
AATTAATGGA 
GGAQAAATTT 
AAAGACCTOT 
CAGAACTTCA 
CTATTATAGC 
CAACTTCCCA 
CAATGGTTTC 
TCCAACTTCA 
TAACAAGAAA 
GTCTGTC3TA 
AGGTCCAACA 
ATGTGTCTCT 
CATCTGTGAG 



ATATAAAACA 
TTGXTGTCCT 
AGATTAGACA 
TGTAGAATTA 
TCAAGTTTGT 



11 
I 

CGGTCAAGCT 
AAGCAGAAAA 
AGAAGAAGAC 
GCTTTTGCAT 

gcacacacag 

ATCTCTGACT 
TCAACAATAT 
CATCACTCTC 
TAATGGAACC 
GCAGTTTGGG 
CCCAATGAGG 
AAATAAAQAA 
GQGAATTGTG 
TGACTCTTTG 
TCTACTGGGC 
ATTTGTATGG 
GAT ATT T AAA 
TGGTTGGGAA 
CTGTTTACAC 
ATTTGT TAAT 



21 
I 

AAGGCGAAGA 
GACATCACAA 
GATTATTTGC 
TTGCACCAAA 
GAACTCTTTC 
TTTCTTTACA 
TTTTATAAAA 
TTGGCATTGG 
AAGTATAAGA 
CTTCTCAGTT 
CGATCCTACA 
GATGCCTGGA 
GGATTGGCAA 
ACATGGAGAG 
ACAATACACG 
TATACACCTC 
AGCATACTAT 
GACGTCACGA 
ACATTTTTGT 
AAAATOATTA 



31 
I 

GTGGGTGX3CT 
ACCAAGAAGA 
ATAAGGACAC 
CAGCCCATGC 



41 
I 

GAAGCCATAC 
ACTTTGGAAA 
GGGAGAGACC 
TGATGAATTT 



CTCTTCTGAG 
TTCCAATCCT 
TTTACCTGCC 
AGTXTCCACA 
TCTTTTTTGC 
GATACAAGTT 
TTGAGCATGA 
TACTGGCTCT 
AATTTCACTA 

cattgatttt 

CAACTTTTAT 
TCCTGCCATG 
AAATTAACAA 
TCAATATTGA 
TTCAAGGAAA 



GGAAGTAATT 
GGTCATCAAC 
AGGTGTGATA 
TTGGTTGGAT 
TGTACTGCAT 
GCTAAACTGG 
TGTTTGGAGA 
GTTGGCTGTG 
TATPCAGAGC 
TGCCTGGAAT 
GATAGCTGTT 
CTTGAGGAAG 
AACTGAGATA 
TATATTTTAT 
AAAAAAAAAA 



60 
120 
180 
240 
300 



51 
I 

TATITTATAG 
ATGAAGCCTA 
AGCATGCTAA 
GACTGCCCTT 
AAAATAGCTG 
CACCCTTTAG 
AAAGTCTTGC 
GCAGCAATTG 
AAGTGGATGT 
GCAATTTATA 
GCATATCARd 
ATGGAGATTT 
ACATCTATTC 
AAGCTAGGAA 
AAGTGGATAG 
TTCCTTCCAA 
AAGATACTGA 
TGTTCCCAGT 
CACCAACATT 



60 
120 
1B0 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: B80 Protein sequence 
Protein Accession #2 NP_036581»l 

1 11 21 31 41 51 

) 1 I 1 1 1 

MESRKDITNQ EBLWKMKFRR NI1EEDDYI1HK DTGKTSMLKR P\TLi LHL-HQT A HADBFDCPSE 
IiQHTQEX»FPQ HHIiPlKIAAI IASLTCFIiYPL laREVIHPTtAT SEQQYFYKXP ILVIKKVLPK 

827 



60 
120 



WO 03/042661 



VSITLLALVY LPGVXAAIVQ LHNOTKYKKP PHWLDKWMLT RKQPGLLSFF FAVLHAIY8L ISO 

SYFMRRSYRY KXLNWAYQQV aQNKEDASflE HDVWRMBIYV SLGIVGLAIL ALLAVrgiFB 240 

VSDSLTMEB? HYIQSKLGIV SU^LGTIBAL IFAWNKWIDI KQFVWYTPPT FM LAV? LP XV 300 
VLIFKSILFL PCLRKKILKI RHGWSDVTKI NKTBICSQL 

Seq XT) HO t B81 DNA sequence 
Nucleic Acid Accession ft: NM_00O684 
Coding sequence: 87.. 1520 

1 11 21 31 41 51 

I I 1 I 1 I 

TGCTACCCGC GCCCGQGCTT CTGGGGTGTT OCCCAACCAC GGCCCAGCCC TGCCACACCC 60 

CCCGCCCCCG GCCTCCGCAG CTCGGCATGG GCGCGGGGGT GCTCGTCCTG GGCGCCTCCG 120 

AGCOCGGTAA CCTGTCGTCG GCCGCACCGC TCCCCGACGG CGCGGCCACC GCGGCGCGGC 180 

TGCTGGTGCC CGCGTCGCCG CCCGCCTC6T TGCTGCCTCC OGCCAGCGAA AGCCCCGAGC 240 

CGCTGTCTCA GCAGTGGACA GCGGGCATGG GTCTGCTGAT GGCGCTCATC GTGCTGCTCA 300 

TCGTGGCOQG CAAT.GTGCTG GTGATCGTGG CCATCQCCAA GACGCCGCGG CTGCAGACGC 360 

TCACCAACCT CTTCATCATG TCOCTGGCCA GCGCCGACCT GGTCATGGGG CTGCTGGIGG 420 

TGCCGTTCGG GGCCACCATC GTGGTGTGGG GCCGCTGGGA GTACGGCTCC TTCTTCTGCG 480 

AGCTGTGGAC CTCAGTGGAC GTGCTQTGCG TGACGGCCAG CATCGAGAOC CTGTGTQTCA 540 

TTGCOCTGGA COGCTACCTC GCCATCACCT CGCOCTTCCG CTACCAGAGC CTGCTGACGC 600 

GCGCGCGGGC GCGGGGCCTC GTGTGCACCG TGTGGGCCAT CTCGGCCCTG GTGTCCTTCC 660 

TGCCCATCCT CATGCACTGG TGGCGGGCGG AOAGCGACGA GGCGCGCCGC TGCTACAACS3 "720 

ACCCCAAGTG CTGCGACTTC GTCACCAACC GGGCCTACGC CATCGCCTCG TCCGTAGTCT 7 BO 

CCTTCTACGT GCCCCTGTGC ATCATGGCCT TCGTGTAGCT GCOGGTGTTC OGCGAGGCCC 840 

AGAAGCACGT GAAGAAGATC GACAGCTGCG AGCaCCGTTT CCTCGGCGGC CCAGCGCGGC 300 

CGCOCTCGCC CTCGCCCTCG CCCGTCCCGG CGCCCGCGCC QCCGCCCGGA CCCCCGCGCC 960 

OCGCCGCCGC CX3CCGCCACG GCCCCGCTGG qCAACGGGCG TGCGGGTAAG CGGCGGCCCT 1020 

CGCGCCTCGT GGCCCTACGC GAGCAOAAGG CGCTCAAflAC GCTGGGCATC ATCRTGGGCG 1080 

TCTTCACaCT CTGCTGGCTG CCCTTCTTCC TGGCCAACGT GGTGAAGGCC TTCCACCOCG 1140 

AGCTGGTGCC CQACCGCXTTC TTCGTCTTCT TCAACTGGCT GGGCTACGCC AACTCGGCCT 1200 

TCAACCCCAT CATCTACTGC OGCAGCCCCG ACITCGGCAA QGCCTXCCAG GGACTGCTCT 1260 

GCTGCGCGCG CAGQGCTGCC CGCCGOCGCC ACGCGACCCA CGGAGACCGG COGCGCGCCT 1320 

CGGGCTGTCr GGCCOGGCCC GGACCCCCGC CATOGOCCGG GGCOGCCTCG GACGACGACG 1380 

AOGAOGATGT CGTCGGGGCC ACGCCGCCCG CGCGCCTGCT GGAGCCCTGG GCCGGCTGCA 1440 

ACGGCX3GGGC GGCGGCGGAC AGCGACTCGA GCCTGGACGA GCCGTOCCGC CCCGGdTCG 1500 

CCTCGGAATC CAAGGTGTAG GGCCCGGCGC GGGGCGCGGA CTCCGGGCAC GGCTTCCCAG 1560 

GGGAAOGAGG AGATCT3TGT TTACTTAAQA OCGATAGCAG GTGAACTCGA AGCCCACAAT 1620 

CCTCGTCTQA ATCATCCGAG GCAAAGAGAA AAGCCACGGA OCGTTGCACA AAAAGGAAAG 16B0 
TTTOGGAAGG GA7GG3AGAG TGGCTTGCTG ATGTTCCTTG TTO 

Seq ID £10: BB2 Protein sequence 
Protein Accession #: NP_0 0067 5.1 

1 11 21 31 41 51 

I 1 I I I I 

MGAGVLVLOA SEPGNLSSAA PLPDOAATAA RLkVFASPPA SULPPASESP KPLSQQttTAG 60 

MBIiIMAlflVL LIVAGNVLVX VAIAKTPRJiQ TLTZSILFIMSI. ASADLVMSLL WPFGATIW 120 

WGRWEYG3FF CELWTBVDVX, CVTASIBTLC VIAlDRYliAl TfiPFRYQSTJa THARAIBSLVC 180 

TVWAISALVS FXiPXlKEWHR AEBDEA33CY NDBXCCDFVT NRAYAlASSV VSFYVPLCIM 240 

AFVYXrRVFRE AQKQVKKIDS CERRFLGGPA RPPSPSPfiPV PAPAPPPGPP RPAAAAATAP 300 

LANGRAGKRR PSRIiVALREQ KALKTtfillM GVFTLCWUPF FIiAKWKAFH RFL,VPDEX^V 360 

FFHHLGYAKS AFNPIIYCR3 PIXFRKAFQGX* LCCARRAARR RHATHGDRPR ASGCLARPGP 420 
PPSPGAASDD DDDDWGATP PARLLEFWAG CNGGAAADSD SGIiDEPCRPG FASE3KV 

Sea ID HOx BB3 DNA sequence 

Nucleic Acid Accession tti NM_000729.2 

Coding sequence: 2-. 421 

1 11 21 31 41 51 

I I I ) I I 

GGCTGAGCTG CCGGGCTGCT CCGGTTGGAA ACOCCAAGOC AGCTGOOGTC CTAATCCAAA 60 

AGOCATGAAC AQCGGCGTGT GCCTC5TGCGT GCTGATGGCG GTACTGGCGG CTOGCGCCCT 120 

GACGCAGCCG GTGCCTCCCG CAGATCCCX5C GGGCTCCGGG CXGCAGCGGG CAGAGGAGGG 180 

GCCCCGTAGQ CAGCTGAGGG TATCGCAGAG AAOGGATGGC GAGTCCCGA0 CQCACCTGGG 240 

CGCOCIGCTG GCAAGATACA TCCAGCAGGC CCGGAAAGCT CCTTCTGGAC GAATGTCCAT 300 

OQTTAAGAAC CTGCAGAACC TGGACCCCAG CCACAGGATA AGTGACCGGG ACTACATGGG 360 

CTGOA1GGAT TTTGGCCGXC GCAGTGCCGA GGAGTATGAG TACCCCXCCT AGAGGACCCA 420 

GCCGCCATCA GCCCAACGGA AGCAACCTCC CAACCCAGAG GAGGCAGAAT AAGACAACAA 480 

TCACACTCAT AACTCATTOr CTGTGGAGTT TGACATTGAA TGTATCTATT TATTAAGTTC S40 

TCAATGTGAA AATTGTGTCT GTAAGATTGT CCAGTGCAAC CACACACGCT CAOCAGAAGT 600 

TGTGGAAACT GAAGACAAAA CTGTTTTCTT CATCTGTGAC TCCTGTTCTG AAAATGTTGT 660 
TATGCTATTA AAGTGATTTC ATTCTGCC 

Seq ID NO: B84 Protein sequence 
Protein Accession #: NP^_000720 »1 

1 11 21 

I 1 t 

MNSGVCLCVL MAVLAAGALT QPVPPADPAG 
LTiARYIQQBR KAPSORMSIV KNLQNLDPSH 



31 41 51 

1 I I 

SGLQRAEBAP RRQLRV6QRT DGESRAHLGA 
RISBRDYMGW HDFORRSAEE YEYPS 



828 



WO 03/042661 



PCT/US02/36810 



TABLE 74 Gene Sequences Up- or Down- Regulated in Cancer 
Angiogenesi s 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 



Al DMA SEQUENCE 



Unigene number: 
Probe set Accession # : 
Nucleic Acid Accession fl; 
Coding sequence: 



endothelial differentiation, spningolipid 

HS. 154210 

M3121Q 

M31210 

251-13 96 



G- protein- coupled receptor, 1 



TCTAAAGGTC GGGGGCAGCA GCAAGATGCG AAGCGAGCCG TACAGATCCC GGGCTCTCCG 60 

AACGCAACTT CGCCCTGCTT GAGCGAGGCT GOGGTTTCCG AGGCCCTCTC CAOCCAAGGA 120 

AAAGCTACAC AAAAAGCCTG GATCACTCAT CGAACCACCC CTGAAGCCAG TGAAGGCTCT 190 

CTCGOCTOGC CCTCTAGCGT TCGTCTGGAG TAGCGCCACC CCGGCTTCCT GGGQACACAG 240 

GGTTGGCACC ATGGGGCCCA CCAGCGTCCC GCTGGTCAAG GCCCADCGCA GCTCGGTCTC 300 

TGACTAOGTC AACTATGATA TCATCGTCCG GCATTACAAC TACACGGGAA AGCTGAATAT 360 

CAGCGCGGAC AAGGAGAACA GCATTAAACT GACCTCGGTG GTGTTCATTC TCATCTGCTG 420 

CTTTATCATC CTGGAGAACA TCTTTGTCTT GCTGACCATT TGGAAAACCA AGAAATTCCA 430 

CCGACGCAXG TACT AT T TT A TTGGCAATCT GGCCCTCTCA GAOCTGTTGG CAGGAGTAGC 540 

CTACACAGCT AACCTGCTCT TGTCTGGGGC CACCACCTAC AAGCTCACTC CCGCCCAGTG 600 

GTTTCTGCGG GAAGGGAGTA TGTTTGTGGC CCTGTCAOCC TCCGTGTTCA GTCTCCTCGC 660 

CATCGCCATT GAGCGCTATA TCACAATGCT GAAAATGAAA CTCCACAACG GOAGCAATAA 720 

CTTCCGCCTC TTCCTGCTAA TCAGCGCCTG CTGGGTCATC TCCCTCATCC TGGGTGGCCT 780 

GOCTATCATG GGCTGGAACT GCATCAGTGC GCTGTCCAGC TGCTCCACCG TGCTGCCGCT B40 

CTACCACAAO CACTATATCC TCTTCTGCAC CACGGTCTTC ACTCTGCTTC TGCTCTCCAT 900 

CGTCATTCTG TACTGCAOAA TCTACTCCTT GGTCAGGACT CGGAGCCGCC GCCTGAOQTT 960 

CCGCAAGAAC ATTTCCAAGG CCAGCCGCAG CTCTGAGAAT GTGGCGCTGC TCAAGAOCGT 1020 

AATTATCGTC CTGAGCGTCT TCATCQCCTG CTGGGCACCG CTCTTCATCC TGCTCCTGCT 1080 

OGATGTGGGC TGCAAGGTGA AOACCTGTGA CATCCTCTTC AGAGCGGAGT ACTTCCXGGT 1140 

GTTAGCTGTG CTCAACTCCG GCAOCAACCC CAT CATTTAC ACTCTGACCA ACAAGGAGAT 1200 

GCGTCGGGCC TTCATCCGGA TCATGxCCTG CTGCAAGTGC CCGAGCGGAG ACTCTGCTGG 1260 

CAAATTCAAG CGACCCATCA TCGCCQQCAT GGAATTCAGC OGCAGCAAAT CGGACAATTC 1320 

CTCCCACCCC CAGAAAGACG AAGGGGACAA CCCAGAGACC ATTATGTCTT CTGGAAACGT 1380 

CAACTCTTCT TCCTAGAACT GGAAGCTGTC CAOCCACCGG ARGCGCTCTT TACTTGGTCG 1440 

CTGGCCACCC CAGTGTTTGG AAAAAAATCT CTGGGCTTCG ACIGCTGCCA GGGAGGAGCT 15 OO 

GCTGCAAGCC AGAGGGAGGA AGGGGGAGAA TACGAACAGC CTGGTGGTGT CGGGTGTTGG 1560 

TGGGTAGAGT TAGTTCCTGT GAACAATGCA CTGGGAAGGG TGGAGATCAG GTCCOGGOCT 1620 

GGAATATATA TTCTACCCCC CTGGAQCTTT GATTTTDCAC TGAGCCAAAG GTCTAGCATT 1680 

GTCAAGCTCC TAAAGGGTTC ATTXGGCCCC TCCTCAAAGA CTAATGTCCC CATGTGAAAG 1740 

CGTCTCTTTG TCTGGAGCTT TGAGGAGATG TTTTCCTTCA CTTTAGTTTC AAACCCAAGT 1B00 

GAGTGTGTGC ACXTCTGCTT CTTTAGGGAT GCCCTGTACA TOCCACACCC CACOCTCCCT 1860 

TCCCTTCADCA CCCCTCCTCA ACGTTCTTTT ACTTTATACT TTAACTACCT GAGAGTTATC 1920 

AGAGCTGGGG TTGTGGAATG ATCGATCATC TATAGCAAAT AGGCTATGTT GAGTACGTAG 1980 

GCTGTQGGAA GATGAAGATG GTTTGGAGGT GTAAAACAAT GTCCTTCGCT QAGGCCAAAG 2040 

TTTCCATGTA AOCGGGATCC GTTTTTTGGA ATTTGGTTGA AGTCACTTTG ATTTCTTTAA 2100 

AAAACATCTT TTCAATGAAA TGTGTTAOCA TTTCATATOC ATTGAAGCG3 AAATCTGCAT 2160 

AASGAAGCCC ACTTTATCTA AATGATATTA GCCAGGATCC TTGGTGTCCT AGGAGAAACA 2220 

GACAAGCAAA ACAAAGTGAA AACCGAATGG ATTAACTTTT GCAAACCAAG GGAGATTTCT 2280 

TAGCAAATGA GTCTAACAAA TATGACATCC GTCTTTCCCA CTTTTGTTGA TGTTTATTTC 2340 

AGAATCTTGT GTGATTCATT TCAAGCAACA ACATGTTGTA TTTTGTTGTO TTAAAAGTAC 2400 

TTTTCTTGAT TTTTGAATGT ATTTGTTTCA GGAAGAAGTC ATTTTATGGA TTTTTCTAAC 2460 

CCGTGTTAAC TTTTCTAGAA TCCACCCTCT TGTGCCCTTA AGCATTACTT TAACTGGTAG 2520 

GGAAC3CCAG AACTTTTAAG TCCAGCTATT CATTAGATAG TAATTGAAGA TATGTATAAA 2580 

TATTACAAAG AATAAAAATA TATTACTGTC TCTTTAGTAT GGTTTTCAGT GCAATTAAAC 2640 

CGAGAGATGr CTTGTTTTTT TAAAAAGAAT AGTATTTAAT AGGTTTCTGA CTTTTGTGGA 2700 
TCATTTTGCA CATAGCTTTA TCAACTTTTA AACATTAATA AACTGATTTT TTTAAAG 



60 A2 Protein sequences 
Gene name: 
Unigene number: 
Proneset Accession U* 
Protein Accession Wt 

65 Signal sequence: 



Cellular Local izat ion i 



endothelial differentiation, sphisgolipld Q-proteia-ooupled. receptor, 

HS. 154210 

M31210 

AAA5233G 

none found 

50-71, 92-110, 122-140, 160-177, 202-222, 251-269, 283-301 
plasma menforane 



70 1 11 21 31 41 51 

I I 1 ! I I 

MGPTSVPLVK AHRSSVSDYV NYD11TOHYN YTGKLHXSAD KENBIKLTSV VFILJCCFII 60 

IiEMIFVLLTI WKTKKFHEPM YYFIGJJLAliS DLLAGVAVTA NLLLSGATTY KLTPAQWFLR 120 

EGSMFVALSA SVFSItLAlAI ERYITMLKMK LHNG8NKFHL FLLISACWVI BLILGGLPIH 160 

75 GWHCISALSS CSTVLPXiYHK HYILFCTTVF TIiItLLSIVTIj YCRXYSliVRT R3RRIjTP51KN 240 

1SKASRSSBH VAUiKTVlXV LSVFIACWAP LFIliU^DVG CKVKTCDIliF HAEYFLVLAV 300 

LNSGTNPIIY TLTHKEHRSA PIRIMSCCKC PSGDSAGKFK RPIIAGMEFS RSKSUNSSHP 360 
QKDKGDNPET IMSSGDJVNSS 3 

80 A3 DMA SEQUENCE 
Gone name; 
Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 

829 



G protein- coupled receptor 51 
Hfl. 198612 
AA45292B 
NM 004624.1 



WO 03/042661 



PCT/US02/36810 



Coding sequence-. 



1-2B2G (underlined sequences correspond to start and stop codoaa) 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 11 21 31 41 SI 

1 I I I 1 I 

ATQQ CTTCCC CGCQGAGGTC CGGGCAGCCA GGGGGGCCGC OGCCGCCGCC ACCGCCGCCC 6Q 

GCGCGCCTGC TACTGCTACT GCTGCTGCCG CTGCTGCTGC CTCTGCCGCC CGGGGCCTGG 120 

GGCTGGGCGC GGGGCGCCCC CCGGCCGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC 180 

CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCOGCC 240 

GTGGAACTGG CCATCGAGCA GATGCGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC 300 

CTGCGGCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA 360 

ATAAAATACG GGCCGAAOCA CTTGATQGTG TTTGGAGGOG TCTGTCCATC CGTCACATCC 420 

ATCATTGCAG AQTCCGTCCA AGGCTGGAAT CTGGTGCAGC TTTCITTTGC TGCAACCACG 480 

CCTGTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACOGTCCC ATCAGACAAT 540 

QOGGTGAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG 6 DO 

CTGACGCflAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG AOCTGACTGG AGTTCTGTAT 660 

GGCGAGGACA TTGAGATTTC AGACACCGAG AGCTTCTCCA ACGATOCCTG TACCAGTGTC 720 

AAAAAGCTGA AGGGGAATGA TGTGCGGATC ATCCTTOGCC AGTTTGACCA GAATATGGCA 780 

GCAAAAGTGT TCTGTTGTGC ATACGAGGAO AACATGTATG GTAGTAAATA TCAGTGGATC 840 

ATTCCGGGCr GGTACGAGCC TTCTTGGTGG GAGCAGGTGC ACACGGAAGC CAACTCATCC 900 

CGCTGCCTCC GGAAGAATCT GCTCGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTCGAG 960 

CCCCTGAGCT CCAAGCAGAT CAAGACCATC TCAGGAAftGA CTCCACAGCA GTATGAGAGA 1020 

GAGTACAACA ACAAGCGGTC AGGCGTGGGG CCCAGCAAGT TCCACGGGTA CGOCTACBAT 1080 

GGCATCTGGG TCATCGCCAA GACACTGCAG AGGGCCATGG AGACACTGCA TQCCAGCAGC 1140 

CSGCACCAGC GGATCCAGGA CTTCAACTAC AOGGACCACA CGCTGGGCAG GATCATCCTC 1200 

AATGCCATGA ACGAGACCAA CTTCTTCGGG GTCAOGGGTC AAGTTGTATT COGGAATGGG 1260 

GAGAGAATGG GGAOCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA 1320 

GAGTACAAOG CTGTGGCCGA CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA 1380 

TCCGAACCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT 1440 

CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC 1500 

TTCTTCAACA TCAAGAACCG GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC 1560 

AACCTTATCA TCCTTGGAGG OATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT 1620 

GGATCCTTTG TCTCTQAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTQGATTCTC 1680 

ACCGtGGGCT ACACGACCGC TTTTGGGGCC AXGTTTGCAA AGACCTGGAG AGTOCAC3GCC 1740 

ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 1BD0 

GTGGGGGGCA TGCTGCTGAT OGAOCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC 1860 

CTGC5GAAGGA CAGTGGAGAA GTACAGCATG GAGC3GGGACC CAGCAGGAOG GGATATCTCC 1920 

ATCOGCCCTC TCCTGGAGCA CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC 1980 

TATQCCTACA AGGGACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGGA GACCCGCAAC 2040 

GTCAGCATCC CCGCACTCAA CGACAGCAAG TACATCGGGA TGAGTGTCTA CAACGTGGGG 2100 

ATCATGTGCA TCATCGGGGC CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG 2160 

TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAGCAOCA TCACCCTCTG CCTGGTATTC 2220 

GTGCCGAAGd TCATCACCCT GAGAACAAAC CCAGATGCAG CAACQCAGAA GAGGCGATTC 2280 

CAGTTCACrC AGAATCAGAA GAAAGAAGAT TCTAAAACGT CCAOCTCGGT CACCAGTGTG 2340 

AACCAAGCCA GCACRTCCCG OCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCGAATG 2400 

AAGATCAC&G AGCTGGATAA AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA 2460 

GAAAAGACCA CCTACATTAA ACAGAACCAC TACCAAGAGC TCAATGACAT OCTCAACCTG 2520 

GGAAACTTCA CTGAGAGCAC AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA 2560 

AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTOGAA CATGCAAAGA TCCTATAGAA 2640 

GATATAAACT CTCCAGAACA CATCCAGOGT CX3GCTGTCCC TCCAGCTCCC CATCCTCCAC 2700 

CACQCCTACC TCCCATCCAT GGGAGGCGTG GAOGCCAGCT GTGTCAGCOC CTGCGT(2AGC 2760 

CCCACOGCCA GCCCCOGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT GSTCTCGGGC 2820 
CTQTAA 



A4 Protein sequences 



Unigene number i 
Probeset Accession 
Protein Accession #- 
Signal sequences 
Pfazn domains: 



Cellular Localization i 



G protein-coupled receptor 51 

HS. 198 612 

AA452928 

NP 005449.1 

1-42 

7tra_3 1481-754] , ANF_r©ceptor [130-2041 
22-44, 477-499 r 517-S39, 552^574, 595-617, 
plasma membrane 



653-675, 692-713, 722-744 



1 

I 

MASPRRSGQP 
LMPIiTKEVAK 
IimGPNHLMV 
AVNPAIliKLL 
KKUOGNDVRI 
RCLRKNLLAA 
GIWVIAKTUJ 
BRMGTIKFTQ 
IiYSILSAIfTI 
GSFVSKKTFE 
VGGKI^IDIiC 
YAYKGLLMLF 
FCrVALVIXF 
NQASTSHLEG 



HAYb P 6 1 GOV 



11 
I 

GRPPPPPPPP 
GSXGRGVLPA 
FGGVCP8VTS 
KRYQHKHVGT 
ILGQFDQNMA 
MEGYIGVDFB 
RAMETLHASS 
FQDSREVKVG 
IiGMIMASAFL 
TLCTVRTWIIj 
ILICWQAVDP 
GCFLAWETJW 
CSTITLCLVF 
LQSEHHRLRM 
KAlliKNHLDQ 
BASCVSPCVfi 



21 
\ 

ARLIiIiLTjTjLP 
VELAIEQIRN 
IXAS5LQGHN 
LYQDVQRFBB 
AKVFCCAXEE 
PLSSKQIKTI 
RHQH2QDFNY 
EYNAVAD TIjK 
FFNIKNRNQK 
TVGYTTAFGA 
LHRTVEKYSK 
V51PAU9DSK 
VPKLlTIiRTM 
KITBIJDKDLE 
NPQLQMNTTE 
PTA3PRH33HV 



31 
I 

XfLLPLAPGAH 
ESLLRPYFLD 
LVQL8EAATT 
VRKDLTGVLY 
tJMYGSKYQWI 
SGKTPQQYKR 
TDHTLGH1IL 
IIKDTIRFQG 
LIKMS9PYMN 
MFAKTWRVHA 
EPDPAGKDIS 
YIQMSVYJTVG 
PDAATCWRRF 
EVTMQLQDTP 
PSRTCKDPIB 



41 . 
I 

GWARGAPRPP 
LRLYDTECDN' 
PVLADKKKYP 
GEDIE1 SETS 
IPGKYBPSWK 



51 
I 

PSSPPliSIKG- 
AKGLKAFYDA 
YFFRTVPSBW 



HAMHETNFFG 
SEPPKDKTII 
KLIIIX3GML6 
IFKNVKMKKK 
IRPTiT.F.HCEN 
IMCIIGAAVS 
QFTQdQKKED 
EKTTY1RQNH 
DIKSPBHIQR 



EQVHTEAN35 
PSKFHGYAYD 
VTGQfVVPHNG 
ItEQLRKISIiP 
YASZFIiPGLD 
IIKDOKLiVI 
THHTlMIiGTV 
PLTRDQPHVQ 
SKTSTSVT5V 
YQELNDHjNL 
HLBLQUim 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
£60 
720 
780 
840 
900 



830 



WO 03/042661 



PCT/US02/36810 



AS DMA SEQUENCE 
Gene name: 
unigene number^ 
Protoeset Accession #: 
Hucleic Acid Accession #; 



ESTs 

Ha. 293616 
AW0437B2 

none found 



Coding Bequeace: 3 B -1075 {underlined sequence corresponds to start and stop codon) 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



l 11 21- 31 41 51 

I I I 1 I I 

AGCAACGACG COGGGCAGCG GGAGCGGCGG CCGCGCCATG TGQCTGCTGG GGCGGCTGTG 60 

CCTGCTGCTG AGCAGCGCCG CGGAGAGCCA GCTGCTCCCC GGGAACAACT TCACCAATGA 12 0 

GTGCAACATA CCAGGCAACT TCATGTGCAG CAATGGACGG TGCATCCCGG GCGCCTGGCA 1B0 

GTGTGACGGG CTGCCTOACT GCTTQ3ACAA QAGTGATGAG AAGGAOTGCC CCAAGGCTAA 240 

GTCGAAATGT GGCCCAACCT TCTTCCCCTG TGCCAGCGGC ATCCATTGCA TCATTGGTCG 300 

CTTCCGGTGC AATGGGTTTG AGGACTGTCC COATGGCAGC GATGAAGAGA ACTGCACAGC 360 

AAACCCTCTG CTTTGCTCCA CCGCCCGCTA CCACTGCAAG AACGGCCTCT GTATTGACAA 420 

GAGCTTCATC TGCGATGGAC AGAATAACTG TCAAQACAAC AGTGATC3AGG AAAGCTGTGA 480 

AAGTTCTCAA GAACCCGGCA GTGGGCAGGT GTTTGTGACT *rCAGAGAACC AACTTGTGTA 540 

TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCOGTC ATTTTTGTGC TGGTGGTGGC 600 

CCTGCTGGCA CTGGTCTTGC ACCACCAGCG GAAGCGGAAC AACCTCATGA CGCTGCCCGT 660 

GCACCGGCXG CAGCACCCTG TGCTQCTGTC CCGCCTGGTG OTCCTGGACC ACCCCCACCA 720 

CTGCAACGTC ACCTACAACG TCAATAATGG CATCCAGTAT GTGGCCAGCC AGGCGGAGCA 7S0 

GAATGCGTCG GAAGTAGGCT CCCCACCCTC CTACTCOBAG GCCTTGCTGG ACCAGAGGCC 840 

TGCQTQOTAT QACCTTCCTC CACCGCCCTA CTCTTCTGAC ACGGAATCTC TGAACCAAGC 900 

CGACCTGCGC CCCTAOCGCT CCCGGTCCGG GAGTGCCAAC AGTGCCAGCT CCCAGGCAGC 960 

CAGCAGCCTC CTGAGCGTGG AAGACACCAG CCACAGCCCG GGGCAGCCTG GCCCCCAGGA 1020 

GGGCACTGCT GAGCCCAGGG ACTCTGAGCC CAGCCAGGGC ACTGAAGAAG T ATAAG TCCC 10B0 

AGTTATTCCA AAQTCCATAT GGGTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCCAG ACTTCAGAGA TGTTTTTCTG GCGTCTCAGT 1260 

TGACATGATC TGTTGTGCGT CTTTTCTGTC AGGTCACTCT TCCCTTGGGA CCCGAGATCA 1320 

CACCCTCATT TTTCACATTA TTCTGTTTCT GTTGGAGAQA CAGCATATAA AACAGTATTG 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG ATGCTCAGAA GTGCAGGAGA 1440 

CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGGA TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGCCA TCACTGGATG GTCACCOCGC CAAAAAAATT CCATTTGAGG IS 60 

ATCAAAACCT GCTTTGCACA ATOCTATTTG ATOCCCCCAa TTCAGCAQAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAGTCGC AACGTXATTT TGGTTTTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTQGCTT TAGAAATTTG CCCAAGAATG 1740 

CTCATTCTGA GAGCTTTCCT GAGCAGCATA TATCATCAGC CTCATCCTAA AATABGCAOa IflOO 

GAGGCCCTCC CATGAGTTTA TCCAAGTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC 1B60 

TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGGTTTCTG GACTGTCACC CTCCCAGCTG 132 0 

ACCTGCOCGX AQCCAAGGAA TGAGGACCTA ACTTGAGTTG GCCCAAAGTC TGACCTGGCT 1980 

GTATGTCGCT GTGGCCCACA CCCAGOCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

CTCGAAAGTT CCCTTAACAC TTGCAAAGTC CXTTTTAGCT GTGCATTTGG ACTTGAGGAC 2100 

ACTGGTTTCT ATCACAGGTG AGAGCCATGT TCAATACCTC CAQCAAGCTC TCCTGGCTDC 2160 

CIGCACTGTG CACQCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTAG TTAGAGTTAG 2220 

GGTCAGGGTC AGGCCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAGACA CATGGGGAAG 2280 

AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATT TT AAATCTTTTA GAAATGCATT 2340 

TGAAACAGTG TGTTTGTTTT TTCCCTTCTA GTIAAGGGAG TATTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGGTCCAAA GAAAGATGCA AAAGGAGATC 2460 

ACACCCTTGC CCCGCTGAGC CCCGTGATAA CAAGTCACTC CAGACTAAOC TGTGTGCCAG 2520 

ACATTTGTGC ATTGTTGCAC TTTGAGGTTA TTATTTATCA AGTTCTTGAA GGAAGCAGAA 2580 

AGAGGGACTC CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAGTT TTTCTTTTTT 2640 

CCAGTGAGCC ACAGGGCCCG CCTCCCTOCA GGAATAAGGG GTAAAACGTT 2700 

TGGCAAGAAA CCACACTQAC TGATGAGGGG TAAAATGGAA CCAGGTAGAG 2760 

CCACTCCGGG CAGCTGTCAC CCATTCAGAA CTXCTTTCGG CAGCTGAAGA AATGTTCAGT 2820 

AACCTGTTTG ACGCTAATTA AAACAGAGCC TQCAGGAAGT GGGGCTAAAG TGGCATTCAG 2880 

TGATCCTGTT CTGTAGACTT TTCTTTCITT TTTTAACCAA ATCCAAAOGA TBTTACAGAA 2940 

AAGCTAGCCA CTGGTAT1TT GTTTTGTTTA AAAAAAAAAA QAAAGAAAGA AAGAAAGAAA 3000 

AACGGAAAOG AACCTAGCTG CCTGTATCTT TCATTTTTAA AATAGCAG1T GAGTTATTTT 3060 

CTGAGTAATC CAATAAAOAA CTTTTGATGA CAGCCAGAAT GTGTTAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3160 

TTTCTGGTGC TCTGGAAGTT GTTTAQAGGA AAOAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

QAQTTAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT AT TATAGAT T 3360 

TQATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT OTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3540 

AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAOATCCGTG TTTTATTTTA GCATTAAATT 3600 

AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 3660 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 3720 

GAAAGGTTGT GTGTOGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAHARM 3840 

AAMMAAAAAC AWYKTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3900 

TGGGGCGGCG GGGCCCACTT AGGXACGGCG ACCACGCGGG CCCAAACGGG ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTOGGGAGA ATTCTCCACA CCAGAAAAAA AOGCGCCGGG 4020 

GGAAACCGCA GAGTGTTGOG TAAACCACAC CCGAAGAGAG AACTCAGAAQ CACACAAGCG 4080 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 



A6 Protein sequence £ 
Gene names 



831 



WO 03/042661 



PCT/US02/36810 



Unigano number: 
Probeset Accession 
protein Accession #: 
Signal sequence: 
Transcnerabrane domains : 
LDI*a domains i 
Cellular Localization-. 



Hs-293 616 
AH043782 
none Eound 
1-17 
169-191 

28-66. 70-108, 112-149 
plcEjma membrane 



HHIrL GVhCLttt LSSAAESQLL PGNWFTKECH IPGNFMCSNG RCIPOAWQCD GLPDCFDK9D 

EKECPXAKSK CGPTFFPCAS GIHCIIGRFR CBGFEDCPDG SDEEMCTANP LLCSTARYHC 

KNGLCIDKSF ICDGQNNCQD NS DEES CESS QEPGSGOVFV TSFJJQLVYYP SITYAIIGSS 

V1PVLWALL AZjVLHHQRKR NNT.-MTLPVHR LQHPVLI^RL WIjDHPHRCN VTYNVNNGIQ 

YVaSOAEQNA eEVGSPPSYS EALLDQRPAW YD&FPPPYSS DTESLNQADI* PPYRfiRSGSA 

NSASSQAASS DLSVEDTSHS PGQPGPQEGT ABPRDSEPSQ GTEEV 



60 
120 
180 
240 
300 



A7 DMA SEQUENCE 

Gene naste: 

Urxigene number: 
Probeset Accession : 
Nucleic Acid Accession t* = 
Coding sequence; 



Integrin, alpha 5 (fibronectln receptor, alpha polypeptide) 



Hs. 149609 
X06256 
NM_0022Q5 

1^3150 C underlined sequences cor 



respond to start and stop codons) 



1 11 f r i 1 r 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 
OGCCQACCCC CGCT8STGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 
GGCXTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 
GGATTCTCAG TGGAGTTTTA OCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 
CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTQ GTGCTGTCTA CCTCTGTCCT 
TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 
CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TQGAGTACAA GTCCTTOCAG 

TGGTTCGGGQ caacagttcg agcccatggc tcctccatct tggcatgcgc tccactgtac 

AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 
GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTTCAG CTGGGCAGCA 
GGACAGGGTT AdGCCAAGG AGGCTTCAGT GCOGAGTTCA CCAAGACTGG CCGTGTGGTT 
TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGOCAGATCC TGTCTGCCAC TCAGGAGCAG 
ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 
CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCIGTGGC TGTTGGTGAA 
TTCAGTOGTG ATGACACAGA AGACTTTGTT GCTGGIGTGC CCAAAGGGAA CCTGACTTAC 
GGCTATGTCA CCATCCTTAA TGGCTCAGAC AITOGATCGC TCTACAACTT CTCAGGGGAA 
CAGATGQCCT CCTACTTTGG CEATGCASTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 
OATGACTTGC TGGTOOGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GC6GOTCAG 
GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 
CTTACCCTCA CTGGCCATGA TGAGTTTGGC OGATTTGGCA GCTCCTTGAC CCCCCTGGGG 
GACCTGGACC AGGATGGCTA CAATGAXGTG GOCATCGG3G CTCCCTlTGG TGGGGAGACC 
CAGCAGGGAG TAGTGTTTGT AXTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 
CAQGTTCTGC AGCCCCTGXG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 
CGAGGAGGCC GAGACCTGGA TGGCAATGGA HATCCTGATC TQATTGTGGG GTCCTTTGGT 
GTCGACAAGG CTGTGGTATA CAGGGaCCGC CCCATCGTGT CCGCTAGTGC CTCCCTCAOC 
ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCCGCA GCTTAGAGGG GAAGCCTGTG 
GCCTGCATCA ACCTTAQCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 
GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AfiGGAGGGGT ACGGOGGGCA 
CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACOCAGACCC TGCPCATCCA GAATGGGGCT 
CGAGAGGATT GCAGAGAOAT GAAGATCIAC CTCAGGAACG AGTCAGAATT TCGAGACAAA 
CTCTCGCOGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 
CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 
ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGtfGCCTG ACCTGCAGCT GGAAGT GTTT 
GGGGAGCAGA ACCATGTGTA CCTGGGXGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 
CAGAATGTGG GTGAGGGTGG CGCCTATQAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 
GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 
TTTGCCGTGA ACCAGAGOCG CCTGCTGGTG TGTGAOCTGG GCAACCCCAT GAAGGCAGGA 
GCCAGTCTGT GGGGTGGCCT TCGGTIMACA GTCCCTCATC TCCGGGACAC TAAGAAAACC 
ATCCAGTTSG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGACGTGGTT 
TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAACGGTGT CTCCAAGCCT 
GAGGCAGTGC TATTCCCAGT AAGOGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 
GACCTGGGAC CTGCTGTOCA CCATGTCTAT GAGCTCA^CA ACCAAGGCCC CAGCTCCATT 
AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 
GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 
GAGTO3GATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 
TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 
TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 
TGGQCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 
TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 
CAGGTGGOCA CAGCTGTGCA ATOGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 
ATCATCATCC TAGCCATCCT GTTTGGCC1C CTGCTOCTAG GTCTACTCAT CTACATCCTC 
TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 
CTCAAGCCTC CAGCCACCTC TGATBQCTGA 



60 
120 
ISO 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
22B0 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3O60 
3120 



832 



WO 03/042661 



PCT/US02/36810 



A8 Protein seq uencer 

Gene name: Integrin, alpha 5 (fibranectin receptor, alpha polypeptide} 

Unigene number: Hs -149609 

Probeset Accession #; XG6256 

Protein Accession #: NP_00219G 

Signal sequence i 1-42 

Transmembrane domains: 998-1020 

Integrin alpha domains i 56-115, 260-318, 322^384, 388-444, 452-503, 1022-1036 

Cellular Localization: plasma membrane 



1 11 21 

I I 1 

KGSRTPESPL HAVQLRWGPR RRPPliLPLLL 
GPSVEFYRPO TDOVSVIiVGA P KANT 9QPGV 
LESSLSSSEG EEPVEYKSLQ WFGATVRAHJ3 
DNFTHILEYA PCRSDFSWAA GQGYOQGGFS 
IAESYYPBYL INI*VQGQLQT RQASSIYDDS 
GYVTIUTOSD IRSI»YHFSGE QMASYFGYAV 
EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG 
QQGWPVFPG GPGGliGSKPS QVLQPI^WAAS 
VDKAWYRGR PIVSASASLT IPPAMPNPEE 
GFTVKLQLDH QKQKGGVHHA LFIiASRQATL 
L3PIHIWjNF SLDPQAPVD3 EGIiRPAXiHYQ 
GEQNHVYLGD KNALHLTFHA QNVGEGGAYE 
FAVMQSRLLV CDIjGNPMKAG A3LWGGLRFT 
SFRL6VEAQA QVTI2JJGV6KP EAVLFPVSDW 
SQOVI^LSCP QAIiEGQQLdkY VTRVTGLNCT 
BA6SGPQILK CPEAECFRLR CEU3PLHQQE 
YKALKMPYRI LPRQIiPQKER QVATAVQWTX 
Y1CLGFFKRSL PYGTAMEKAQ LKPPAT&DA 



31 41 51 

I I i 

LLLPPPPRVG GFKLDAEAPA VLSGFFGSFF 60 

tQGGAVYLCP WGA8PTQCTP IEPDSKGSRL 120 

SSIIACAPIaY SWRTBKEPLS DPVGTCYIiST 1BQ 

AEPTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

YIiGYSVAVGE FSGDDTEDFV AGVPKGNIiTY 3 00 

AATDVNGDGb DDLLVGAPLL MDRTPDGRPQ 360 

KFGSSLTFIiG DLDQPGYUDV AIGAPFGGET 42 0 

HTPDFFGSAIi RGGRDIJX3NG YPDI»XVGSFG 480 

RSCSLEGNPV ACINLSPCLN ASGKHVADSI 540 

TQTlrLXQNGA REDCREMKIY IiRNBSEFRDK 600 

SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

AELRVTAPPB AEYSGI»VRHP GNFS9LSCDY 720 

VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

HPRDQPQKBE DL3PAVEHVY ELIWQGPSSI 840 

TNEPINPKGL ELDPEGSLEH QQKREAPSRS 900 

S0SLQLHFHV WAKTFLQKEH QPFSLQCEAV 960 

AEGSYGVPLW IIILAILFGI* UUW3LLIYIL 1020 



A9 DMA SEQUENCE 

Gene names Integrin, beta 1 (fibronectin receptor, beta, antigen CD29 includes MDF2, MSK12) 

Unigene imnberi Ha. 28 7797 

Probeset Accession «: X07979 
Nucleic Acid Accession #t NMJI02211.1 

Coding sequence: 1-23 97 (underlined sequences correspond to start and stop codone) 



1 11 21 31 41 51 

1 I I 1 1 I 

ATGAATTTAC AACCAATTTT CTGGATTGGA CTOATCAGTT CAOTTTGCTO TGTGTTTGCT 60 

CAAAC3W3ATG AAAATABATG TTTAAAAGCA AATGCCAAAT CATGTGGAGA ATGTATACAA 120 

GCAGGGCCAA ATTGTGGGTG GTGCACAAAT TCAACATTTT TACAQGAAGG AATGCCTACr 1B0 

TCTGCACGAT GTGATGATTT AGAAGOCTTA AAAAAGAAGG GTTGCGCTOC AGATGACATA 240 

GAAAATCCCA GAGGCTCCAA AGATATAAAG AAAAATAAAA ATGTAACCAA OCGTAGCAAA 30D 

GGAACAGCAG AGAAGCTCAA GCCAGAGGAT ATTACTCAGA TCCAACCACA GCAGTTGGTT 360 

TTGCGATTAA GATCAGGGGA GCCACAGACA TTTACATTAA AATTCAAGA0 AGCTGAAGAC 420 

TATCCCATTG ACClCTACTA CCTTATGGAC CTGTCTTACT CAATGAAAGA CGATTTGGAG 480 

AATGTAAAAA GTCTTGGAAC AGATCTGAXG AATGAAATGA GGAGGATTAC TTOGGACTTC 540 

AGAATTGGAT TTGGCTCATT TGTGGAAAAG ACTGTGATGC CTTACATTAG CACAACACCA 600 

GCTAAGCTCA GGAACOCTTG CACAAGTGAA CAGAACTGCA CCAGCCCATT XAGCTACAAA 660 

AATGTGCTCA GTCTTACTAA TAAAGGAGAA GTATTTAATG AACXTGTTGG AAAACAGOGC 720 

ATATCTGGAA ATTTGGATTC TCCAGAAGGT GGTTTCGATG CCATCATGCA AGTTGCAGTT 780 

TGTGGATCAC TGATTGGCTG GAGGAATGTT ACAEGGCTGC TQGTGTTTTC CACAOATGCC 840 

GOGTTTCACr TTGCTGOAOA TOGQAAACTT GGTGGCATTG TTTTACCAAA TGATGGACAA 900 

TGTCACCTG9 AAAATAATAT GTACACAATG AGCCATTATT ATGATTATCC TTCTATTGCT 960 

CACC1TGTCC AOAAACTGAG TGAAAATAAT ATTCAGACAA TTTTTGCAGT TACTGAAGAA 1020 

TTTCAGCCTG TTTACAAGGA GCIGAAAAAC TTGATCCCTA AGTCAGCAGT AGGAACATTA 10 80 

TCtGCAAATT CTAGCAATGT AATTCAGTTQ ATCATTGATG CATACAATTC CCTTTOCTCA 1140 

OAAGTCATTT TGGAAAACGG CAAATTCTCA GAAGGAGTAA CAATAAGTXA CAAATCTTAC 1200 

TGCAAGAAOG GGGTGAATGG AACAGGGGAA AATGGAAGAA AATGTTCCAA TATTTCCATT 1260 

OGAGATGAGG TTCAATTTGA AAtTTAGCATA ACTTCAAATA AGTGTCCAAA AAAGGATTCT 1320 

GACAGCTTTA AAATTAGGCC TCTGGGCTTT ACGGAGGAAG TAGAGGTTAT TCTTCAGTAC 13 BO 

ATCTGTGAAT GTGAATGCCA AAGOGAAGGC ATCCClGAAA GTCCCAAGTG TCATGAAQGA 1440 

AATGGGACAT TTGAGTGTGG OGCGTGCAGG TGCAATGAAG GGCGTGTTGG TAGACATTGT 1500 

GAATGCAGCA CAGATGAAGT TAACAGTGAA GACATGQATQ CTTACTGOVG GAAAGAAAAC 1560 

AGTTCAGAAA TCTGCAGTAA CAATGGAGAG TGCGTCTGCG GACAGTGTGT TTGTAGGAAG 1620 

AGGGATAATA CAAATGAAAT TTATTCTGGC AAATTCTGCG AGTGTGATAA TTTCAACTGT 1680 

GATAGATCCA ATGGCTTAAT TTGTGGAGGA AATGGTGTTT GCAAGTGTCG TG0JGK5TGAG 1740 

TGCAAOCCCA ACTACACTGG CAGTGCATGT GACTGTTCTT TGGATACTAG TACTTGTGAA 1800 

GCCAGCAACQ GACAGATCTG CAATGGCGGG GGCATCTGCG AGXGTGGTGT CXGrAAGTGT 1860 

ACAGATCCJGA AGTTTCAAGG GCAAACGTGT GAGATGTGTC AGACCTGCCT TGGTGTCTGT 1920 

GCTGAGCATA AAGAATGTGT TCAGTGCAGA GCCTTCAATA AAGGAGAAAA GAAAGACACA 1980 

TGCACACAGG AATGTTCCTA TTTXAACATT ACCAAGGTAfl AAAGTCGGGA CAAATTACCC 2040 

CAGCC3GGT0C AACCTGATCC TOTGTCCCAT TGTAAGGAGA AGGATGT7GA CGACTGTTGG 2100 

TTCTATTTTA OGTATTCAGT GAATGGGAAC AACGAGGrCA TGGTTCATGT TGTGGAGAAT 2160 

CCAGAGTGTC CCACTGGTCC AGACZATCATT OCAATTGTAQ CCGGTGTGGT TGCTGGAATT 2220 



833 



WO 03/042661 



PCT/US02/36810 



GTTCTTATTG □CCTTGCATT ACTGCTOATA TGGAAGCTTT TAATGATAAT TCATGACAGA 2280 
AGOGAGTTTG CTAAATTTGA AAAGGAGAAA ATGAATGCCA AATGGGACAC OGOTGAAAAT 2340 
CCTATTTATA AGAGTGCCGT AACAACTGTG GTCAATCCGA AGTATGAGGG AAAATGA 



10 



15 



A10 Protein sequence s 
Gene name : 
Unigene numbers 
frobeBet Accesfiion fts 
Protein Access ton 
Signal sequence? 
Transmembrane domains i 
IN a domain: 
PSI domain: 

Cellular Localization: 



Integrin. beta 1 (fibronectin receptor, beta, antigen CD29 includes MDF2, MSK12) 

Hs. 287797 

X07979 

NP_OO2202.Z 

1-21 

732-754 

34-464 

26-76 

plasma membrane 



20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



i 
1 

MNIjQPIFWIG 
SARCDDLEAL 
LRLRSGEPQT 
RIGFGSFVEK 
ISGNLDSPBG 
CHLENNMYTM 
SAUSSNVIQL 
GDEVQFEISI 
KGTFECGACR 
RTJNTNSrYSG 
ASNQQICNGR 
CTQECSYFNT 
PECPTGPD I I 
PIYKSAVTTV 



11 
I 

LISSVCCVFA 
KKKGCPPDDI 
FTIjKFKRAED 
TVMPYISTTP 
GFDAIMQVAV 
9HYYDYPSIA 
IIDAYNSLSS 
TSNKCFKKDS 
CBEGRVGKHC 
KFCECDKFNC 
GICECGVCKC 
TKVE3RDKLP 
PIVAGWAGI 
VKPKYEOK 



21 
1 

QTDEttRCLKA 
ENPRGSKDIK 
YPIDLYYIMD 
AKLRHFCT3E 
CGSLIGWRNV 
HLVQKLSENN 
EVILKNGXLS 
DSFK1HPLGP 
BCSTDEVNSE 
DRSNGLICGG 
TDPKFQGQTC 
QPVQPDPVBH 
VIiTGIiAIiltLl 



31 41 51 

I 1 I 

NAKSCGECIQ AGPNOGHCTN STFLQEGNPT 
KNKHVTNRSK GTAEKLKPED ITQIQPQQLV 
LSYStlKDPLiE NVKSLGTDLM NEMRRITfiDF 
0NCT6PFSYK HVLSLTNKGE VFNELVGKQR 
TRLT.WSTDA GFHFAGDGKI, GGIVLPNDGQ 
IQTIFAVTEE FQPVYKBLKN LIPKSAVQTL 
EGVTISYKSY CKNGVNGTGK NGRKCSNISI 
TBBVEVXLQY ICECECQSEG IPBSPKCHBG 
DMDAYCRKBN SSEICSNNGE CTCGQCVCSK 
KGVCKCRVCE CNPNYTGSAC ECSLDTETCE 
EHCQTCLQVC AEHKECVQCR AFNRGEKKDT 
CKEKDVDDCW FYFTYSVBGN NSVMVHWKN 
WKLLHIIHDR REFAKFEKEK MNAKWDTGEN 



SO 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 



All DMA 3EQUBMCB 
Gene name: 
Unigene numbers 
Probeset Accession #: 



select in E 
He. 89546 
M24736 



(endothelial adhesion molecule 1} 



Nucleic Acid Accession Hi 



NM_000450 

1-1833 (underlined sequences correspond to start and stop codons) 



Coding sequence: 

1 11 21 31 41 SI 

1 I I 1 I I 

ATQATTGCTT CACAGTTTCT CTCAGCTCTC ACrTTGOTaC TTCTCATTAA AGAGAGTGGA 60 

GCCTGGTCTT ACAACACCTC CACGGAAGCT ATGACTTATG ATGAGGCCAG TOCTTATTGT 120 

CABCAAAGGX* ACACACACCT GGTTGCAATT CAAAACAAAO AAGAGATTGA GTACGTAAAC ISO 

TOCATAT TGA. GCTATTCACC AAGTTATTAC 1GGATTGGAA TCAGAAAAGT CAACAATGTG 240 

TGGGTCTGGG TAGGAACCCA GAAACCTCTG ACAGAAGAAG CCAAGAACTG GGCTCCAGGT 3D0 

GAACCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAGA TCTACATCAA GAGAGAAAAA 360 

GATGTGGGCA TGTGGAATGA TGAGAGOTGC AGCAAGAAGA AGCTTGCCCT ATGCTACACA 420 

GCTGCCTGTA CCAATACATC CTGCAGTGGC CACGGTGAAT GTOTAOAGAC CAT CAATAAT 480 

TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAGT GTGAGCAAAT TGTGAACTGT 540 

ACAOCCCTQG AATCCCCTGA GCATGGAAGC CTGGTTTGCA GTCACCCACT GG8AAACTTC 600 

AGCTACAATT CTTCCTGCTC TATCAGCTGT GATAGGGQTT ACCTGCCAAG CAGCATGGAG 660 

ACCATGCAGT GTATGTCCTC TGGAGAATGG AGTGCTCCTA TTCCAGCCTG CAATGTGGTT 720 

GAGTGTGATG CTGTGACAAA TCCAGCCAAT GGGTTCGTGG AATGTTTCCA AAACCCTGGA 780 

AGCTTCCCAT GGAACACAAC CTGTACATTT GACTGTGAAG AAGGATTTQA ACTAAtGGGA 840 

GCCCAGAGCC TTCAGTGTAC CTCATCTflGG AATTGGGACA ACGAGAAGCC AACQTBTAAA 900 

GCTGTGACAT GCAGGGCCGT CCGCCAGCCT CAQAATGGCT CTGTGAGGTG CAGCCATTCC 960 

CCTGCTOGAQ AGTTCACCTT CAAATCATCC TGCAACTTCA CCTGTGAGGA AGQCTTCATQ 1020 

TTGCAGGGAC CAGCCCAGGT TGAATGCACC ACTCAAGGGC AGTGGACACA GCAAATCCCA 1080 

GTTTGTGAAO CTTTCCAGTG CACAGCCTTG TCCAACCC03 AGCGAOSCTA CATGAAtTTG^r 1140 

CTTCCTAGTG CTTCTGGCAG TTTCCGTTAT GGGTCCAGCT GTGAGTTCTC CTGTGAGCAG 1200 

GGTTTTOTGT TGAAGGOATC CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTOGGACAAC 1260 

GAGAAGCCCA CATGTGAAGC TQTGAGATGC GATQCTGTCC ACCAGCCCCC GAAGGGTTTG 1320 

GTOAQGTGTG CTCATTCCCC TATTGGAGAA TTCACCTACA AGTCCTCTTQ TGCCTTCAGC 1380 

TGTGAGGAGG GATTTGAATT ATATGGATCA ACTCAACTTG AGTGCACATC TCAGGGACAA 1440 

TGOACAGAAG AGGTTCCTTC CTGCCAAGTG GTAAAATGTT CAAGCCTGGC AGTTCCGGGA 1500 

AAGATCAACA TGAGCTGCAG TGOGOAGCCC GTGTrTGGCA CTGTGTGCAA GTTCQCCTGT 1560 

CCTGAAGGAOr GGACGCTCAA TGGCTCTGCA GCTCGGACAT GTGGAGCCAC AGGACACTGG 1620 

TCTGGOCTGC TACCTACCTG TQAAGCTCCC ACTGAGTCCA ACATTCCCTT GGTAGCTGGA 1600 

CTTTCTGCTG CTGGACTCTC CCTCCTGACA TTAGCACCAT TTCTCCTCTG GCTTCGGAAA 1740 

TGCTTAOGGA AAQCAAAGAA ATTTGTTCCT GCCAGCAQCT GCCAAAGCCT TGAATCAGAC 1800 
GGAAGCTACC AAAAGCCTTC ttacatcctt taa 



A12 Protein 



Gene name i 
Unigene number: 
Erobenet Accession #t 
Protein Accession 4: 
Signal sequence: 
Transmembr an e doma in s s 
C- lectin domain: 



Selectin E (endotbelial adhesion molecule 1) 

Hs. 89546 

M24736 

KIP_000441 

1-22 

555-573 

23-139 



834 



WO 03/042661 



PCT/US02/36810 



Cellular Localization.- plasma 



5 

10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 



1 
I 

WTASQFLSAL 
SILSYSPSYX 
DVGMWNDERC 
TAIiESPEHGS 
ECDAVTNPAN 
AVTCRAVRQP 
VCEAFQCTAL 
EKPTCEAVRC 
WTBEVP6CQV 
SGLIiPTCEAP 
GSYQKPSYIL 



11 
1 

TLVLIjIKESG 
WIGIRKVNNV 
SKfCKDAI/TYT 
LVCSHPL6HF 
GFVECFQHPG 
QNGfiVRCSHS 



DAVHQPPKGL 
VKCSSLAVPG 
TESMIPLVAG 



21 

I 

AWSYHTSTEA 
WVWVGTQKPL 
AACTNTSC9G 
SYWSSCSISC 
SFPWNTTCTF 
PAGEFTFK68 
LPSASG8FRY 
VRCAHSPIGE 
KIMMSCSGEP 
I>SAAGDE1iIjT 



31 
I 

MTYDEA8AYC 
TEEAKNWAPG 
EOECVETTNN 
DRGYLPSSMB 
DCEEGFELMG 



FTYK8SCAFS 
VFQTVCKFAC 
LAPFLLWUUC 



41 
I 

QQKYTHLVAI 
EPNWRQKDED 
YTCKCDPGPS 
TMQCMSSGEW 
AQ3LQCTSSG 
LQGPAQVECT 
GFVLKGSKRIi 
CEEGFELYGS 
PEGWTUSIGaA 
CLRKAKKFVP 



51 
I 

QNKEElEYIiN 
CVEIYIKREK 
GLKCEQXVNC 
SAPIPACNW 
NWDNEKPTCK 
TQGQWTQQIP 
QCGPTG2WDN 
TQLECTSQGQ 
ARTCGATGHW 
AS9CQSLESD 



60 
120 
ISO 
24D 
300 
3 60 
420 
480 
540 
600 



A13 UNA SEQUENCE 
Gene name* 
Uhigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence ; 



G protein -coupled receptor 39 

Hs. 85339 

AA349893 

NM_001S08 

1-1362 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

ATGGCTTCAC OCAGCCTCCC GGGCAGTGAC 
CCCGAGTTTG AGGTGGCCAC CTGGATCAAA 
TTCGTGATGG GCCTTCTGGG GAACAGCGCC 
AAAGGATACT TGCAGAAGGA GGTGACAGAC 
TTGQTQTTCC TCATCGGCAT GCGCATGGAG 
ACGTCCAGCT ACACCCTGTC CTGCAAGCTG 
GCTACGCTGC TGCACGTGCT GACACTCAGC 
TTCAGGTACA AGGCTGTGTC GGGACCTTGC 
GTCACCTCCG CCCTGGTGGC ACIGCCCTTG 
GTGAA0GTOC CCAGCCACCG GGGTCTCACT 
CAGCCCGAGA CCTCCAA7JAT GTCCATCTGT 
CAGTCCAGCA TCTTCGGCGC CTTCGTGCTC 
ATGTGCTGGA ACATGATGCA GGTGCTCATG 
ACGCGGCCTC CGCAGCTGAG GAAGTCCGAG 
ACCATCATCT TCCTGAGGCT GATTGTH3TG 
ATTCGGAGGA TCATGGCTGC GGCCAAACCC 
GCGTACATGA TCCTCCTCCC CTTCTCGGAG 
CCGCTOCTGT ACRGGGTGTC CTCGCAGCAG 
TGCCGCCTGT CGCTGCAGCA GGOCAACCAC 
ACCACCGACA GCGCCOGCTT TGTGCAGCGC 
TCTGCAAGGA GAACTGAGAA GATTTTCTTA 
TCTAAGTCCC AGTCATTGAG TCTCGAGTCA 
AATTCTGCTG CAGAGAATGG TTTTCAGGAG 



A14 Protein sequence; 
Gene name: 
XJnigene nunbec: 
Probeset Accession #i 
Protein Accession ft: 
Signal sequence i 
Pfam domains : 
Transmembrane domains: 
Cellular Localisation.! 



31 
I 

TGCTCCCAAA 
ATCACCCTTA 
ACCATTCGGG 
CACATGGTGA 



CACACTTTCC 
TTTGAGCGCT 
CAGGTGAACC 
CTGTTTGCCA 
TGCAACCGCT 
ACCAACCTCT 
TACCTCGTGG 
AAAAGCCAGA 
AGCGAAGAGA 



AAGCACGACT 
ACGTTTTTCT 
TTTCGGCGGG 
GAGAAGCGCC 
CCQTTGCTCT 
AGCACTTTTC 
CTAGAGCCCA 
CATGAAQTTT 



41 
I 

TCATTGATCA 
TTCTGGTGTA 
TCACCCAGOT 
GTTTGGCTTG 
TCATCTGGAA 
TCTTCGAGGC 
ACATCGCCAT 
TGCTGATTGG 
TGGGTACTGA 
CCAGCACCCG 
CCAGCCGCTG 
TCCTGCTCTC 
AGGGCTCGCT 
GCAGGACCGC 
TATGCTGGAT 
GGACGAGGIC 
ACCXCAGCTC 
TGTTCGTGCA 
TGCGOGTACA 
TCGCOTCCCG 
AGAGCGAGGC 
ACTCAGQCGC 
GA 



51 
\ 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCGGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



G protein- coupled receptor 39 

Hs. 85339 

AA34&B93 

HKJ>01598, NP_001409 
none found 

7tm_l [72-172, 224-344] 

32-S4, 68-90. 111-133, 151-173, 

plasma membrane 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7 BO 
840 
900 
960 
102 0 
1080 
1140 
1200 
1260 
1320 



221-243, 280-301, 320-342 



1 11 21 31 41 

, c I I 1 I I 

OD WA8PSI»PGSD CSQIIDHSHV PEFEVATWIK ITLILVYLII FVWGDLGHSA 

KGYLQKEVTD HMVSLACSDi: 1.VFLIGMPME FYSIIWNPI/T TSSYTLSCKL 
ATIjLHVLTLS FERYIAICHP PRYKAVSGPC QVKLLIGFVW VTSAIiVAIiPIi 
VMVPSHRGLT CSIRSSTRKHE OPETSNMSIC TNIiSSRWTVF QSSIFGAFW 
MCKHMMQVLM KfiQKGSLAGG TRPFOLRXBE SBESH.TARRQ TIIPLHIitW 
70 IRRIMAAAKP KHDWTKfiYPE AYMILTjPFSB TFFYLSSVIN PLLYTVSBQQ 

C&IjSDQHANH EKRLRVHAHS TTDSARPVQR PLLFASRRQ8 SARRTBKIFL 
SK8QSIiSr.ES LEPKSGAKPA NSAAENGFQE HEV v 



51 
I 

TIRVTQVIiQK 60 

HTFIiFEACSY 120 

LFAKGTEYPI* 1B0 

YliWLLBVAF 240 

TLAVCWMPNQ 300 

PRKVFVQVLC 360 

STPQBEAEFQ 420 



75 



80 



AJ Prostate 

A15 DMA sequence 



Unigene numbers 
Probeset Accession #: 
nucleic Acid Accession 
Coding sequence: 

1 11 



CEGP1 
Hs. 2223 99 
AA256485 
AJ400877 

B1-3O80 (underlined 



sequences correspond to start and stop codons) 



41 



51 

835 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



111111 

GGCGTCCGCG CACACCTCCC CQCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 

CCGCAACCGC TGAGCCATCC ATGO GGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 

CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 180 

CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TQAGTGTGCC CAAGGGCTAG 240 

ATGACTGCCA TGCOGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 

AOCCTGGCTA CCAAGGGGAA GGCAGGCAGT QTGAGGACAT CGATGAATGT GGAAATGAGC 360 

TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 

TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 

AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT S40 

GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 

GCCTtJAGCTQ CATGAATAAQ GATCACGGCT GTAGTCACAT CTQCAAGGAG GCCCCAAGQG 660 

GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

TCTTGACCTG TAACCATOGG AACGGTGGGT' GCCAGCACTC CTGTGACGAT ACAGCCGATG 760 

GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG S40 

AGCGAGAGGA CACTGTCCTG QAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 

ATAAACGGGT GAAACGGCGG CTGCTCATGG AAAOGTGTGC TGTCAACAAT GGAGGCTGTG 960 

ACCOCACCTG TAAGGATACT TCGACAGGTQ TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 

GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTFTTGACTG CGGCTGCAAG AAAGGATTTA 1140 

AATTATXAAG AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 

GTOACCACAG CTGCATCAAC CACCCTGGCA CATTTQCTTG TGCTTGCAAC CGAGGGTACA 1260 

OCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 232 0 

GTCAGCAGGT CTGTGTGAAC! ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 13 SO 

AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 

CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1S00 

GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT QTAACCTTTA 1560 

AGCTAAATGA AGGCAAGTGT AGXTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 

CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 16 B0 

GCAGCTCTGG CAAGCAAGTC OCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 

TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTMTGACC 1800 

TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA I860 

AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 

AAAAGCCTCC CAGAACATCT GAAC3CCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 

CAGAAAAOCA ATGTGTCAGT TOCAGGGCTG QGACCTATTA TGATGGAGCA CGAGAACGCT 2040 

GGATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAAOCAT 2100 

GCCCRAGACC AGOAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 

GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 

GTGCCCTGGG CACGTTCCAG CCTGAAQCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 

GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 

CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 

ACTTTGATGG CTCCACAAAC ATAACCCAOT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 

GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 

AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAQA 2640 

TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGOGGAAA ACCTCTTCAT 2700 

CCAATTCTGT GACAACATAT GAAACCTGCC AGAOGTACGA ACGCCCCATC GCCTTCACCT 2760 

CCAGGXCAAA GAAGCTGTGG ASTCftGTTCA AGTCCAATGA AGGGAAGAGC GCTAGAGGGT 2 020 

TCCAGGTCCC ATAOGTGACA TATGAXGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2B80 

GAGATGGCAG GCTCTATGCA TGTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 

TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAQTAC ACAGCCCAGG 3000 

AGTCCCGAGA GATGTTTCCA AGATGGTTCA TCGGATTGCT AGGTTOCAAA GTGTCCAGGT 306D 

TTTTGAGACC TTAgAA ATGA CTCAOCCCAC GTGCCACTCA ATACRAATOT TClGOTATAG 3120 

GGTTGGTGGG ACAGAGCTGT CTTOCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGOCTC 3180 

CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TAjCAGATATT TTGGTAAATT 3240 

GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 

GAGCTTCTCA CTGGTGTGGa CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 

TTGGTCAGGC TAGGTGAGAC TCRCCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGA0TC 3420 

TGTAGTGGAA AGGAGGCCAC AGAA2AAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 

CCGGCCCTCT CTAAGGGAGC CCTCTGCACT OGTGTGCAGG CTCTGACCAQ GCAGAACAGG 3540 

CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAG CCACCTTGAG ACCTGGGAGG 3600 

ACTCAQTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 

AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 



A16 Protein sequence 
Gene name: 
Tlnigene number: 
Probeaet Access Ian #: 
Protein Accession Mi 
Signal sequence: 
Transmembrane domains i 
PFAM domains: 

Cellular Localization; 



CBGP1 

Ha. 222 3 99 

AA256485 

CAB92285 

1-31 

none 

EGF-like_domainB {49-04,132-167, 177-213 ,286-321^407-442] 
CUB_domain [809-918] 
may be secreted 



1 11 21 31 41 51 

I I \ I I I 

MGVAGKNRPG AAWAVLLLLL LLPKLLUiAG AVPFGRGRAA QPQEDVDECA QGLDDGHAOA 60 

LGQBTTTTSYK C9CKPGYQGE GRQCEDIDEC GNHLNGGCVH DCLN1PGKKR CTCFDGFMLA 120 

HDGHNPTiDVD KCLEBMGGC3Q HTCVNVMGSY ECCCKBGFPL SDNQHTCIHR SEEGLSCMHK 180 

DEGCSHTCKE APRG3VACBC RPGFELAKNQ RDCILTCNBG NGGCQHSCQD TADGPECSCH 240 

PQYKMHTDGR SCLEREDTVL BVTESHTISV VDGDKRVKBR X.LMBTCAVKK GGCDRTCKDT 300 

836 



WO 03/042661 



PCT/US02/36810 



STGVHCSCPV GFTLQLDGKT CXDIDECQTR NGGCDHFCKN IVGSFDCGCK XGFKLLTDEK 3 60 

S CQDVDE CSLi DETCDHSCIN HPGTFACACN RGYTkYGFTH CODTNECSIN NGGCQQVCVN 42 0 

TVG6YECQCH PGYKLHNNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 4 BO 

SSDVTTIRTS VTFXIrfffEGKC SLKNAKLFpE GTiRPALPEKH 9SVXE9FRYV NLTCSSGKOfV 540 

PGAPGRPSTP KEMFITVEPE LETNQKEVTA ECDLSCIVKR TEKRLRKAIR TIiRKAVHREQ; 600 

FRLQLSGMNIj EVAKKPPRTS ERQAESCGVG QGHAENQCVg CRAGTYYDGA RERCILCPNG 660 

TFQNEEGQMT CEPCPRPGNS GAIjKTPEAWN MSBCGGLCQP GEYSADGFAF CQLCALGTFQ 72 D 

PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSEGHFY NTTTHRCIEC PVGTYQPEFG 7S0 

KNNCV9CPGW TTTDFDSSTN ITQCKNRRCG QEIjQDFTGYI ESPKYPGWYP ANTECTWTIH 840 

PPPKRRI1.IV VPEIFLPIED DCGUYLVMRK TSSSNSVTTY BTCQTYBRPI AFTSRSKKLW 900 

IQFKSMEGNS ARGFQVPYVT YOEDYQELIE OIVRDGRkYA SENHQEILKD KKDIKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 



A17 DNA sequence 

Gene namet ESTs 

Unigeae number; He. 293102 

Probeset Accession # = AI951118, AW170035, AL044891, AI908272 

Kucleic Acid Acceasioa #x none found 

Coding sequence i 1-489 (entire sequence is an open- reading- frame ) 

1 11 21 31 41 51 

I 1 I I 1 I 

CAAAftAGAAA TAGATAAAAT AAATGGAAAA TTAGAAGftGT CTCCTGATAA TGATGGTTTT 60 

CTGAAGGCTC CCTGCAGAAT GAAAGTTTCT ATTCCAACTA ASGCCTTAGA ATTGATGGAC 120 

ATGCAAACTT TCAAAGCAGA GCCTCCCGAG AAGCCATCTG CCTTCGAGCC TGCCATTGAA 1B0 

ATGCAAAAGT CTGTTCCAAA TAAAGCCTTG GAATTGAAGA ATGAACAAAC ATTGAGAGCA 240 

OATCAGATGT TCCCTTCAGA ATCAAAACAA AAQAASGTTG AAGAAAATTC TTGGGATTCT 300 

GAGAGTCTCC GTGAGACTGT TTCACAGAAG GATGTGTGTG TACCCAAGGC TACACATCAA 360 

AAAGAAATGG ATAAAATAAG TGGAAAATTA GAAQATTCAA CTAGCCTATC AAAAATCTTG 420 

GATACAGTTC ATTCTTGTGA AAGAACAAGG GRACTTCAAA AAGACCCCT3 TGACCCACGT 480 
TCAGGAAAA 



Ala Protein sequence 
Gene name: 
Unigene number: 
Probeset Accession # : 
Protein Accession tts 
Signal sequence: 
Transmerrtjrane domains i 
Cellular localization: 



ESTs 

Hs. 2 93102 

AI951118, AW17003S, 

none found 

none 

nuclear 



AE.044B91, AI90B272 



1 * 11 21 31 41 51 

I I I 1 I 1 

QKEIDKINGK IjEESPBNBGF UCAPCEMKVS IPTKALELMD MQTFXABPPE KP SAFE PA IE 60 
MQKSVP HKAL BUCNEQTLRA DQHFPSEBKQ KXVEENSWDS E8LRBTV6QK DVCVPKATHQ 120 
KEMDKISGKL EDETSItSKJXt DTVH&CERTR EUQKDPCDPH SGK 



A19 DMA SSQUENCE 
Gene name: 
Unigene nuittoeri 
Probeaet Accession #t 
Nucleic Acid Accession # = 
Coding sequence: 



TMPRSS3a 
Hs -298241 
A1538613 
AB038157 

202-1566 (underlined 



correspond to start and stop codons) 



1 11 21 31 41 SI 

111)1) 

ACCGGGCACC GGACGGCTCG GGTACTTTCQ TTCTTAATTA GGTCATGCQC GTGTGAGCCA 60 

GGAAAGGGCT GTGTrTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CGATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 1B0 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCOGC CTGCTGTTGA AGCCCCCTTC 240 

tcattcosat cgctttttgg ccttgatgat ttgaaaataa gtcctgttgc aocagatgca 300 

gatgctgttg ctgcacagat cctbtcactg ctgcca.ttga agttttttcc aatcatcgtc 360 

attgggatca ttgcattgat attagcactq gccattggtc tgggcatcca cttcgactgc 420 

tcagggaagt acagatgtcq ctcrtccttt aagtgtatcg agctgatagc tcgatgtgac 480 

GGAGTCTCGG ATTGCAARGA CGGGGAGGAC GAGTACCGCT GTQTCCGGGT GGGTGGTGAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTrGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC B40 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AQTGGGCCTG GGAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GSCTCXGTCA TCACGOCCCT GTGGATCATC 960 

ACTGCTGCAC ACTOTGTTTA TGACTTOTAC CTCCCCAAGX CATGGACCAT CCAGGTGGGT 1020 

CIAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCRAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TOCCTGCOCA ACTGTGAAGA GAACTTCCCC 12 00 

GATGGAAAAG TGTBCTGQAC GTCAGGAIGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 1260 

TCCOCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAOG 1320 

GAOOTGTACG GTOGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT OACGGGTGGC 13 BO 



837 



WO 03/042661 



PCT/US02/36810 



OTGGACAGCT GCCAGGGGGA CAGCGGGGGG CCCCTGGTGT GTCAAGAGAG GAGGCTGTGG 144D 

AAGTTAGTGG GAGCGACCAG CTTTGGCATC GGCTGCGCAG AGGTOAACAA GCCTGGGGTG 1SD0 

TACACCCGTG TCACCTCCTT CCTOQACTGG ATCCACGAGC AGATGGAGAG AGACCTAAAA 1560 

ACCTGAAGAG GAAGGGGACA AGTAGCCACC TGAGTTCCTG AGGTOATGAA GACAGCCDGA 1620 

5 TCCTCCCCTG GACTCCCGTG TAGGAACCTG CACACGAGCA GACACCCTTG GAGCTCTGAG 1680 

TTCCGGCACC AGTAGCAGGC CCGAAAGAGG CACCCTTCCA TCTGATTCCA GCACAACCTT 1740 

CAAGCTGCTT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT CGCTCTGTTG CCCAGGCTGG 1800 

AGTGCAGTGG CGAAATCCCT GCTCACTGCA GCCTCCGCTT CCCTGGTTCA AGOGATTCTC IB 60 

TTGCCTCAGC TTCCCCAGTA GCTGGGACCA CAGGTGCCCG CCACCACACC CAACTAATTT 1920 

10 TTQTATTTTT AGTAGAGACA OGGTTTCRCC ATGTTGGCCA GGCTGCTCTC AAACCCCTGA 1980 

CCTCAAATGA TGTGCCTGCT TCAGCCTCCC ACAGTGCTGG GATTACAGQC ATGGGCCACC 2040 

ACGCCTAGCC TCACQCTCCT TTCTGATCTT CACTAAGAAC AAAAGAAGCA GCAACTTGCA 210D 

AOGGCGGCCT TTCCCACTGG TCCATCTGGT TTTCTCTCCA GGGGTCTTGC AAAATTCCTG 2160 

ACGAGATAAG CAGTTATGTG ACCTCACOTG CAAAGCCACC AACAGCCACT CAGAAAAGAC 2220 

15 GCACCAGCCC AGAAGTGCAG AACTGCAGTC ACTGCACGTT TTCATCTCTA GGGACCAGAA 22 90 

CCAAACCCAC CCTTTCTACT TCCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGGA 2340 

ATGACTCGTT TAAGGCCTAT TTTCATGATT TCTTTGTAGC ATTTGGTGCT TGACGTATTA 2400 

TTGTCCTTTG ATTCCAAATA ATATGTTTCC TTCCCTCAAA AAAAAAAAAA AAAAAAAAAA 2460 

AAAAAAAA 

20 

AZ 0 Protein sequence ; 

Gene name: TMPRSS3a 

Unigene number i He. 2 982 41 

25 Probeset Accession #: AI53 9613 

Protein Acceeeion ft: BAB20077 

Signal sequences none found 

Transmembrane domains; 43-65 r 239-261 

_ _ Tryp_SPc domains 216-444 

30 Cellular idealization: not determined 



35 
40 
45 

50 
55 
60 
65 
70 
75 
80 



i 
I 

HGBHDPPAVE 
IALAXQLGIR 
TAASWK.TMCS 
VTALHHSVYV 
LOGGSVITTL 
LGHDIXLMKIi 
AVPLIBtlKIC 
FGIGCAEVNK 



11 
I 

APFSFKSIiFG 
FECfiGKYfiCR 
PDWKGHYANV 
REGCASGHW 
WIITAAHCVy 
AGPkTFNStt 
NHRDVYGGII 



21 
I 

U3DIJCI3PVA 
SSFKCIELIA 
ACAQUGPPSY 
TLQCTACGHR 
DI.YT>PK9WTr 
QPVCLPWSEB 
SPSMLCAGYIi 



31 
1 

PDADAVAAQI 
RCDGVSDCKD 
VSSDNLRVS5 
RGYSSRIVGG 
QVGLVS LLDN 
NFPDGKVCWT 
TGGVDSCQGD 
DIiKT 



41 
1 

liSLliPLKFFP 
GKDBYRCVRV 
LEGQFREEFV 
NMSLLSDWPW 
PAPSHLVEKI 
SGWGATEDGA 
SGGPLVCQER 



SI 
I 

IIVIGIIALI 
GGQNAVLQVF 
SlDHIiLPDDK 
OASLQFQGYH 
VYHSKYKPKR 
GDA9FVUJHA 
RLWKLVGATS 



60 
120 
180 
240 
300 
360 
420 



ESTa ; opposite strand to TRPB1 



A21 UNA SEQUENCE 

Gene name * 
Unigene number: 
Probeset Accession #: AA42B090 
Nucleic Acid Accession *t AA428090 

Coding sequence; 1-558 (underlined sequences correspond to start 



end stop codons) 



1 
I 

ATGA ASCCCA 
CAAGTTGTTG 
TTTTCAQACA 
GATTCTGTTG 
GQAAAAACAA 
CTATTGTTTA 
AAGTGTTTTG 
AAGGGCATTG 
GAAAAAGGGA 
CCCAAAAATG 
CAAGCAAATC 
TTTOCTTCTG 
GTTCTGCCTC 
TTTAACCAAA 
GGAATTGCAT 



CTGAATGTTT 
CTTTTTACAC 
ACAAGTTGAG 
GATTTCACCA 
TATATATATA 
AAAGTGAATC 
TATTCAAGTT 
CACCAAGAAC 
ACGAAAAAAT 
ATCAGGACTA 
ATTTTCTGCT 
OCCCTCATGC 



11 
I 

GTTTGAAACA 
GAGAAGAAAT 
CAGATGATCT 
GGCTTCCTGG 
CTAACCTCAG 
AGCTCCAGGG 
TTCAGCTTGC 
CCATTATGAA 
AAAAAGAAAA 
AGAAATGAAG 
TTCXGAAGCA 
QTTAAAAACA 
ATTTCTCTTT 
CAAATAAGAT 
CATGTTCACA 
TDGTAACATA 
TTGAAATAAA 
AAAATATATC 
AGAGTCCAGG 
TTTCAATTCA 
TATATATATA 
CATATAAACC 
TAGATGTAAC 
ATGTTTGTGA 
TTCATGTGTT 
CAAATTGAAT 
GOTGAATATT 
CTCCCCCAAA 



21 
I 

CTGGGCTGAT 
TCATCTACAA 
TATGTCCATA 
TAGAGCTGGA 
CCATTTGCCC 
GTCATACTAT 
AGTATTAAAA 
A0COCCTCAA 
AAAAATAATA 
ATTGTATAAT 
ATCAATTATT 



31 



41 
I 



CAAAGTCATC 
ATTTGATATA 
TGAGTATACC 
ATCATTACCC 
CACATTTTTA 
AGTGAGAGAG 
CTTTATCAAT 
GCAGAGCCTG 
TATATTACAA 
ATACCAACGC 
AGACATCTTT 
GTAGAAATGA 
GTAAGAACAQ 
TCTTTTTCTT 
GCTGTAAGTT 
GTTTTCCATG 



ATAAAAATGT TTTCTGAAAT 
GTAGTATCTG TATCTTATCT 
GTGGAGG AAT CAGACAGCTG 
GACAGACTGC AACCCAAGAC 
AATTTCCACC TTGCTAGTTA 
GGCAATCTAC TGGTGGAATT 
CAAAAAAAAC ATTGCCTCCA 
AGACTCTCTG CTATTTTCAA 
ATTAGAAGGA TTTGTXCCTT 
GAGGGGAGAT ACTAATTATT 
TATATACTTT ATGT TCTGTC 
TCTTGCCAAC GTCCTTTCCT 
TTTCAGGGAA CTTGCCCTGA 
TTAATTTAAA CTTTTTGAGA 
GAATTCAAAG TTAAACTTCA 
AATACTCGAC ACTCAAXATT 
TGTTATCTCT CTGGAGAAAG 
TGTTTGTTTA AGAAAAAAAA 
ATAAGTAATA ATTTTTTAGA 
TATATATATA TATATATATA 
TGATCTGTAT TTCCTATTGC 
CGTTATGTGT AACTGGTBGT 
GCTGCCTGAA GATTGTTTGC 
ACATGCACTA TGAAAACAAA 
AACTATTATA GCCAACATTC 
AGCAACATGA AATCATTCCA 
AAATTTTAJCA TTGGCATTTT 
TGGTTGTCAA ATAGTCCGC 



SI 
I 

AGACCAAAGG 
TGTAOAAAAC 
CTACAACCGT 
AAAACCCAGA 
TCAGAGTTCA 
TATTCTGAGC 
GCTGCAAAGC 
AACATGGAAA 
AATTTGGGCT 
TTAAACTCTC 

ATCTGGATCT 
TTAATTTGAT 
TGATTGATTA 
TAAGCAGGAG 
TGATACTCAA 
TAGTATATAT 
TCAAAGCACA 
ATGGTGATTT 
TATATATATA 
TAGAAGGATG 
AAAACTTTAT 
ATAAGAAATA 
ATAAAATAAA 
TAGTATTCAA 
TATGAAAGAC 
GAGATGTTCC 



60 
120 
100 
240 
300 
360 
420 
4B0 
540 
£00 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



A22 Protein sequence t 



WO 03/042661 PCT/US02/36810 



Gene name: 
Uhigene number: 
Probeset Accession #- 
Protein Accession #: 
Signal sequence : 
Transmembrane domains i 
Cellular Localization: 



opposite strand to TRP51 
none 
AA428D90 
none found 
none found 

113-129 

not determined 



10 



15 



1 11 21 31 41 SI 

I I ! I I I 

MKPSUCHWAD IKMPSEIDQR QWGEEIHLQ WSVSYLVEK FSDTDDLMSI VEESDSCYBR 60 

DSVGLPGRAS DRLQPKTKPR GftTTNI*SHLP NFHLASYQ3S IiLFKLQGSIY GNLLVEPILS 120 

KCPVQLAVI,K QKKHCLQLQS KGIAIKKAPQ HLSAIPKTHK BKGSKKHKKII IHMC8DIWA 180 



20 

25 
30 
35 
40 
45 
50 
55 
60 
65 
70 



A23 UNA SEQUENCE 

Gene name; 
Uhigene number: 
Probeaet Accession # : 
Nucleic Acid Accession ft 



ESTs 
Hs. 29383 
AW207206 
AL133619 



Nucleic Acid Accession ft: AL133619 
Coding sequence i 1-2070 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGAGCGGTG 
CGQCQCCGOC 
CT CAGGCAGA 
CAGCAGCAGC 
GAAAACAAGa 
ACACTGCCGC 
GGGGGAACAC 
CCTOTATGCC 
AGCCGTOQCT 
GGGCCTGAGG 
CCAAGTAGAG 
CCTCAGATTG 
ATGCTGGGGG 
GCAGCAACCA 
GCACTTCCGC 
GCTCACTTCC 
TGQJVflCCAGC 
GAjCATGGAGA 
CTGTTCTOGG 
GACAGGACAC 
CCCTGOTGCT 

AGGCTGAAGG 
BGCQGTAGCG 
TCTGTCAAGT 
TCCTTCAACA 
CCCCTACTTC 
GAOAAAGCAG 
AGGCAGATGG 
ACCaCACTTA 
ACCCAAGAGC 
CCQOAQGAAG 
AAGAGCCTCT 
CTGAAGCAGA 
AAACGGCGCC 



11 

I 

CGGGGGTGGC 
GCCAGCGCCC 
GCQACCCGCA 
ACTCGGAGAT 
GTGAGCCGGC 
TCCCGCAQCA 
AGGACGGGGA 
AACCCAQTGG 
GGACGATGTT 
TCATTGCAGG 
CTQAAATGGG 
CTGCTGTGGC 
CCCAGGOGAT 
TGGGGACAAA 
ATCCTGACAG 
CATTATCTTT 
CTGGGAACAT 
AGGGGGTTGA 
CAAAGTGTGG 
GGGAAGAGGC 
XTCCAGATGS 
GGGTCTGCAT 
AGGGCTCCTC 
CCGACACTOT 
CCATCTCTAA 
AGCAAGATTC 
ACAACAGCAA 
AGGCCTCTAA 
GGGCGGGGGC 
GGCAGTGCGA 
TGCG GCftC CT 
CTAGCTTTCC 
CCAAGAAATG 
CCCCGAAGAA 
TGCATCGCTC 



21 
1 

GGCTGGGACG 
CTCTGTGGGC 
GAAACGGAAC 
GCTOGCCAAG 
GCGGGGCCCT 
CAQAAACACA 
GCOCCTCCAG 
QTACAGGTTC 
ATGCAGCCAA 
GCGGCAGGTG 
AAGGAACCCC 
CAGGCCCAGG 
ATGGACACAC 

gggaggaagc 
cggcccccac 
ggggctgggg 
cgcagctggg 
gggagggccc 
cccaagtcgg 

CATGCTTTCC 
CCCCTCAGGA 
CAACGGAGTG 
ACGGACACRC 
GCGCTCTGCr 

ttcagccaac 

AAAAGCTGAC 
GCTGGACAAA 
TGCAGGAGCT 
ACACCCCCCA 
AGTGCTCATC 
CRAGTCCCTC 
CAGGGACCAA 
CCTGAGCCCA 
CAACTTTGCC 
AGTGCTTTGA 



A2.4 Protein sequence: 
Gene name: 
Unigane number: 
Prdbeset Accession #: 
Protein Accession #t 
Signal sequence: 
Transmembrane domains - 
Cellular Localization: 



31 
I 

CGGCCCCCCA 
GTCCAGTCCT 
CTGGACCTGG 
CTCCATGAGG 
AGGOOGGCCC 
GCCATCAACT 
AC TOT OCT TG 
TGGGGGftCCT 
GCACAGCACG 
GCCACAGGGT 
TGGQACAGCC 
ATTTCCAGCC 
TCCATCCAGG 
AGAGTCCXQT 
CCAGCCCAGG 
CTGACATCAG 
GCAGTGCCTA 
TTCCCTAGCC 
CAGCCCCAGC 
CTGGQGACCT 
AACCACCTTT 
TQGGTAGAGC 
AGGCCAGGAG 
GCAGACAGCC 
TCTCAAGGCA 
GTCTCCCAGA 
GTTCCTGGGG 
GCCTGTATGG 
ATGATCCTGC 
CGCGAGCTGT 
CTGGAAGGGA 
GAAGCCACGC 

GAGAGGCAGA 



ESTs 
Ha. 293 83 
AW207206 
T43457 



41 
I 

GCTCGCCGAC 
TGAGGCCGCA 
AGAAAAGCCT 
AGATCGAGCA 
TGCCTCCCCA 
CCAGCACACG 
CCCACCTGGC 
GGACAGATGC 
TGCTGCTCTC 
□CTCCCCAGA 
CCTGCCCTGC 



GATCCCTTCC 
TTCCTTGCCA 
ATCCTGGGCT 
GAGGACATCT 
GGGCTCTCCC 
GCTGTGGCAA 
CCTGCAGTGC 
GCTGTTCCAT 
CCAGGGCCTC 
CGGGAGGACC 
QCAAOCGTGG 
TCTCCATGTC 
AGGCCAGGCC 
AGGGGGACCT 



51 
1 

CCCGGQCTCT 
GAGCCCGCAG 
GCAGTTCCTQ 
TCTGAAGCGG 
GGCACACTCA 
CCTGGGCTCA 
TGCACTGGCC 
CGCTACCTCT 
GGGAAGCCCA 
CCTCCCTCCT 
TAGATCTTTG 
GAGTCCTCAC 
TGCCATCTGG 
CTTGTCCAfia 
GTGGTCTCAA 
QACTGGTGGA 
TTCCCAGGGA 
CTCCAGTGAG 
TGGGGACGCT 
GTGTCCCAAG 
TGCTCCCTTG 
CAGCCCTGCC 
GCGTCTTGCG 
AAGCTTCCAG 
CCAGOCC6GC 
GGAAGAGGAG 



GGAACAGCCA 
OCCTTCCCCT 
GGAATACCAA 
GCCAOAGGCC 
ATFTCCCCAA 
AGCGTGCCAT 
AGAGGCTGCA 



GCACCAGGGC 
GCGAAAGCCC 
(XTCCTGCAQ 
CCAGGCAGCC 
GGTCTCCACC 
CCTGCCCGCA 
GGCAATGCAG 



€0 
120 
ISO 
240 
300 
360 
42 0 
48D 
540 
600 
660 
720 
780 
040 
900 
960 
1020 
10BO 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
19B0 
2040 



303-322 

not determined 



75 
80 



1 
I 

MSGAGVAAGT 
QQQHSEMIAK 
GGTQDGEPLQ 
GFEVIAGRQV 
MLGAQGINTH 
AHFFL3LGLG 



GAHWVCINGV 
SVKSISNSAK 



11 
I 

RPPSSPTPGS 
LHEEIEHLKR 
TVUffllAALA 
ATGCSPDLPP 
SIQGSLPAIW 
I.T8GGHLTGG 
QPQPCSAGOA 
HVKPGGPSPA 
SQGKARPQPG 



21 
I 

KRHRQRPSVG 
HJKGBPARGP 
PVCQPSGYRF 
PSRAEMGRNP 
AATMGTKGGS 
HSQPGNIAAG 
DRTSEEAMIiS 



31 
I 

VQSLRJPQSPQ 
RPALPPQAHS 
WGTWTDAATS 
WDSPCPARSL 
BVLPPCHIaGK 
AVPJKALPSQG 
LGTCCSMCPK 
HPGGKRGRLA 
VSQKADLEEE 



41 

I 

KRQ9DPQKKN 
TLPLPQHHNT 
SRGKTMLCSO 
PQIAAVAHPR 
ALPHPDSGPH 
DMEKGVEGGP 

pscFPDGpga 

GGSADTVRSP 
PLLSNSKUDK 



51 
1 

LDIiEKSLQFL 
AIHSSTRLG9 
AQHVLLSGSP 
ISSPMALSPH 
PAQDPGLWSQ 



NHLSRASAPL 
ADfiLSMSSFQ 
VPGVQGQAHK 



60 
120 
180 
240 
300 
360 
420 
480 
540 



839 



WO 03/042661 



PCT/US02/36810 



BKAEASNAGA ACMGN*5QBQG RQHOAGAHPP MILPLPLRKP TTLRftCEVLI EELWNTKLLQ 600 
TQELRHLKSL LEGSQRPQAA PEEASPPRDQ EATKFPKVST KSLSKKCLSP PVBEHAILPA £60 
LKQTPKHNPA ERQKRLQAMQ KKRUiRBVIi 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 



A25 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probe set Accession # : 
Nucleic Acid Accession 
Coding sequence: 



ESTs; prolactin receptor 

US. 25252 
AA057193 
NH_D00 949 

29S-2153 {underlined sequences correspond to start and stop codons} 



i 

GOAGGCTGAA 
CTCCCTCTTT 
ATGTCCTCGT 
TAAAGAACTC 
GTGAACTTCT 
CATCTGCAAC 
TACCTCCTGG 
GCTGGTGGAG 
QGGAAGGAGA 
GCCACTTTGG 
CTAACCAOAT 
AGCCAGACCC 
ACCTGTGQAT 
TCCTOTATGA 
GGCAGCAAAC 
TTCC3CTGCAA 
TACCTAGTGA 
dTGTCATCTG 
GCATCTTTCC 
AGGGCAAGTC 
ACTATGAGGA 
TOTCAQTCCA 
ACACTGACTC 
AACCCCAGGC 
CTGAAACAAC 
TTCATGCTGG 
CCAGATCCTC 
CACX33GCCAC 
AGTCTAQAGA 
CTGACCAGGA 
OGTTGGATTA 
AACAGAGAQA 



CTAAAAACGT 
ATCAAGCTGA 
TGGGTGGTTT 
ACTAATGGAA 
AATGCPCAAG 
TTTTTAACCA 
AACTGYGATT 
AAAAGCACAC 
GCTTTCATGA 
AAOMGACAA 
AACTGCATAA 
AAAGAAAGAA 
TTGCTGATAT 



11 
I 

ATCCCCAGAC 
CTGGATTTTA 
GCAGGAAGTA 
TCCTATTCAT 
GATACATTTC 
OGTTTTCACT 
AAAACCTGAG 
□CCTGGGACA 
GACACTCATG 
CAAGCAGTAC 
GGGAAGCAGT 
TCCTTTGGAQ 
TAAATGGTCT 
AATTCGATTA 
AGAGTTTAAG 
ACCAGACCAT 
CTTCACCATG 
TTTGATTATT 
GCCAGTrCCT 
TGAAGAACTA 
CTTGCTGGTG 
TTCAAAAGAA 
AGGCCGGGGG 
CAATCOCTCC 
CCACACCTGG 
TGGATCCAAA 
TTACCACAAT 
TCTGTTGAAT 
AGAGGGAAAG 
TACGCCCTGG 
TGTGGAGATT 
GAACAGCGGC 
GTCOGGOGTC 
GGCTTGCTTT 
GAAAGCCCTG 
GQATTACCTG 
TGATTGGTTA 
AATGTAGTCA 
CTTGCCXCTT 
TGTAGATTTA 
TGCTTAGTAT 
TTTGGCATGG 
AAQAAAATTT 
CCTTTACACT 
tTGCATTCCAG 
GCAAGTAAGA 



21 

I 

GCCQGTTTTC 
CCGACOGTTC 
CTCATCOACT 
GGAGGCGAAC 
CTGCAGCAAG 
CTGCTACTTT 



GATGGAGGAC 
CATGAATQTC 
ACCTCCATGT 
TTCTCGGATG 
CTGGCTGTGG 
CCACCTACCC 
AAftOCCGAGA 
ATTCTCAGCC 
GGATACTGGA 
AATGATACAA 
GTCTGGGCAa 
GGGCCAAAAA 
CTGAGTGCCT 
GAGTATTTAG 
CACCC3UVGTG 
AGCTGTGACA 
ACArrCTATO 
GACCCCCAGT 
TGTTCAACAT 
ATTACTGATG 
GAAGCAGGTA 
GCAACCCAGC 
CTGCTGOCCC 
CACAAGGTCA 
AAQCCCAAGA 
ATGGATAACA 
GAAGAATCAG 
GCCAACTTCA 
GATCCCGCAT 
AAATGTGATT 
GACTGACACT 
TCTCCAGCAG 
CTTTTTGCTG 
TGTTGAGGGA 
GACAGACGGA 
TCCACATAGG 
CCTCGTCCAT 
AACAAATTGT 
AAT 



31 

i 

TGGGCTGGGC 
GCGAAACAGC 
GATGTGGCAG 
ACTGAGGATG 
AGAAGGCAGC 
TTCTCAACAC 
GTCGTTCTCC. 
11CCTACCAA 
CAGACTACAT 
GGAGGACATA 
AACTTTATGT 
AAGTAAAACA 
TGATTGACTT 
AAQCZAGCTGA 
TACATCCAGG 
BTGCATGGAG 
CCGTGTGGAT 
TGGCTTTGAA 
TAAAAGGATT 
TGGQATGCCA 
AAGTAGATGA 
AAQGTATGAA 

ATCCTGAGGT 
GCATAAGCAT 
GGCCCTTACC 
TGTGTGAGCT 
AAGATGCTTT 
AGAGGGAGGT 
AGGAGAAAAC 
ACAAAGATGG 



41 
I 

TT T"CTG ClTA 
TTTCCACACA 
ACTTTGCTCC 
CTTTCCACAT 
CAACATGAAG 
CTGCCTTCTG 
CAATAAGGAA 
TTATTCACTG 
AACOGGTGGC 
CATCATQATG 
GGACGTGACT 
GCCAGAAGAC 
AAAAACTGGT 
GTGGGAGATC 
ACAGAAATAC 
TCCAGCGACC 
CTCTGTGGCT 
GGGCTATAGC 
TGATG CT CAT 
AGACTTTCCT 
TAGTG^GGAC 
ACCCACATAC 
TTTGTCTGAA 
CATTGAGAAG 
GGAAGGCAAA 
ACAGCCCAGC 



AAAATGCTCT 
AGAAAGCTTC 
CCCCTTTGGC 
TGCATTATCA 



ACATCCTGGT 
CCAAAGAGGC 
CTGCAACATC 
GTT1TACACA 
TTTCTTCAGG 
ACTAAAGCTC 
CTGATTCCAG 
TTAGITATAA 
CAATGCCAAT 
AATGAAATTG 
AAAATGCCAT 
TTTATTAGGA 
TTACATAAGT 



GTTGGTGCCA 
CCCACCATCA 
AAGCAAGTGC 
CTCCTTTCAC 
TAACACTACA 
CCAGCTCCTT 
AACAAATCAT 
AACTATQTGT 
AGGTATATCC 
TCAAAATTGT 
OAAAATTGCT 
TTACCCAAAT 
TCCTATACCT 



SI 
I 

CTCACICCTT 
ATGGAGCTTC 
CTGACAAAAC 
OAACCCTGAA 
GAAAATGTGG 
AATGGACAGT 
ACATTCACCT 
ACTTACCACA 
CCCAACTCCT 
GTCAATGCCA 
TACATAGTTC 
AGAAAACOCT 
TGGTTCAOGC 
CATTTTGCTG 
CTTGTCCAGQ 
TTCATTCAGA 
QTOCTTTCTG 
ATGGTGACCT 
CTGTTGGAGA 
CCCACTTCTG 
CAGCATCTAA 
CTGGATCCTG 
AAGTGTGAGG 
CCAGAGAATC 
ATCCCCTATT 
CAGCACAACG 
CCTGCAGGTG 
CAAACCATTA 
CATTCTGAGA 
TCCGCTAAAC 
TTGCTACCAA 
AATAAGGAGT 
BATCCACATG 
CTTGAACAGA 
AGGCTCCAGC 
TGATAGCTTG 



TCA TGCT CCA 
TATGTTTCCT 
TCAATGAAAT 
TCTGGAAAAG 
TTACCATAGA 
TTTGAAAAAC 
ATAACCATTx 
TACTGACACA 



A26 Protein sequence i 
Gene naraei 
Unigene number: 
Probeset Accession # = 
Protein Accession #s 
Signal sequence: 
Transmembrane domains: 
PN3 domains? 
Cellular Localization: 



KSTay prolactin receptor 

Be. 25252 

AA057193 

NPJD00940 

1-23 

237-253 

28-112, 127-215 
plasma membrane 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
19H0 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



75 



80 



i 
1 

MKHNVABMV 



11 
I 

FTLIiIjFLJITC 



21 



31 

I 



■VTY3VQPDPP 
EJHFAQQQTE 
VAVLEAVICL 
PPPTSDYHDL 



I»EIiAVKVKQP 
FKIXjSLHPGQ 
IIWAVAliKG 
LVEYTjEVDDS 
PSTPYDPEVI 



IjUIGQLPPGK PEIFKCRSPW 
GGEHBCHFGK QYTSMWRXYI 
EDRKPYLWIK KSPPTlilDLK 
KYLVQVRCKP DHGYMSAWSP 
YSMVTCIFPP VPGPKIKGFD 
EDQHLMSVHS KEKPSQGMKP 
EKPENPBTTH TWDPQCISME 



41 SI 
I I 

KHTPTCWHRP GTDGGItPTiJl 60 

MMVNATBQNG SSFSDELYVD 120 

TGHPTLLYEX RLKFBKAAEW 180 

ATFIQIPSDF TMNDTTVWIS 240 

AHLLEK3XSB ELLSALGCQD 300 

TYIjDPDTDSG RGSCDSPSLli 360 

GKIPYFHAGG SKCSTHPLPQ 420 
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PSQHNPRSSY HHITnVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 

SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIEKVNKDGA LSXiPKQREN SGKPKKPGTP 540 

J2NMKEYAKVS OVMDNHIIiVL. VPDFHAKNVA CFEESAKJEAP PSLBQNQAEK ALANFTATSS 600 
XCHLQLGGLD YLDPACFTHS FH 



A27 DMA SEQUENCE 

Gen© namei Human neuropeptide Y receptor Yl (npyyi) 



Unigexie number: 
Probeset Accession #i 
Wucleic Acid Access ion tt: 
Coding sequence: 



Hs .169266 

L07615 

NW_0009Q9,1 

209-1363 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

CATTCCCACC CTTCCTTCTT TAATAAGCAG GAGOGAAAAA GACAAATTCC AAAGAGGATT 60 

GTTCAGTTCA AGGGAATGAA GAATTCAGAA TAATTTTGGT AAATGGATTC CAATATCGGG 120 

AATAAGAATA AGCTGAACAG TTGACCTGCT TTGAAGAAAC ATACTGTCCA TTTGTCTAAA 180 

ATAATCTATA ACAACCAAAC CAATCAA AAT G AATTCAACA TTATTTTCCC AGGTTGAAAA 240 

TCATTCAGTC CACTCTAATT TCTCAGAGAA GAATGCCCAG CTTCTGQCTT TTGAAAATGA 300 

TGATTGTCAT CTGCCCTTGG C CAT OAT ATT TACCTTAGCT CTTGCTTATG GAGCTGTGAT 360 

CATTCTTGGT GTCTCTOGAA ACCTGGCCTT GATCATAATC ATCTTGAAAC AAAAQGAGAT 420 

GAGAAATGTT ACCAACATCC TGATTGTGAA CCTTTCCTTC TCAGACTTGC TTGTTGOCAT 480 

CATGTGTCTC CCCTTTACAT TTGTCTACAC ATTAATGGAC CACTGGGTCT TTGGTGAGGC 540 

GATGTGTAAG TTGAATCCTT TTGTGCAATG TGTTTCAATC ACTGTGTCCA TTTTCTCTCT 600 

GGTTCTCATT GCTGTGGAAG GACATCAGCT GATAATCAAC CCTCGAQGGT GGAGACCAAA 660 

TAATAGACAT GCTTATGTAG GTATTGCTGT GATTTGGGTC CTTGCTOTQa CTTCTTCTTT 720 

GCCTTTCCTG ATCTACCAAG TAATGACTGA TGAGCCGTTC CAAAATGTAA CACTTGATGC 760 

GTACAAAGAC AAATACGTGT GCTTTGATCA ATTTCCATCG OACTCTCATA GGTTGTCTTA 840 

TACCACTCTC CTCTTGGTGC TGCAGTATTT TGGTCCACTT TGTTTTATAT TTATTTGCTA 900 

CTTCAAGATA TATATAOGCC TAAAAAGGAG AAACAACATG ATGGACAAGA TOA3AGACAA 960 

TAAGTACAGG TCCAGTGAAA CCAAAAGAAT CAATATCATG CTGCTCTCCA TTGTGGTAGC 1020 

ATTTGCAGTC TGCTGGCTCC CT CTT ACCAT CTTTAACACT GTGTTTGATT GQAATCRTCA 1080 

GATCATTGCT ACCTGCAACC ACAATCTGTT A3TCCTGCTC TGCCACCTCA CAGCAATGAT 1140 

ATCCACTTGT GTCAACCCCA TAT TTT ATGG GTTCCTGAAC AAAAACTTCC AGAGAOACTT 1200 

GCAOTTCTTC TTCAACTTTT GTGATTTCGG GTCTCGGGAT GATGATTATG AAACAATAGC 1260 

CAIGTCCACG ATGCACACAG ATGTTTCCAA AACTTCTTTG AAGCAAGCAA GCCCAGTCGC 1320 

ATTTAAAAAA ATCAACAACA ATGATGATAA TOAAAAAATC TGAA ACTACT TATAGCCTAT 13 B0 

GGTOCCGGAT GACATCTGTT TAAAAACAAG CACAACCTGC AACATACTTT GATTACCTGT 1440 

TCTCCCAAGG AATGGGGTTG AAATCATTTG AAAATGACTA AGATTTTCTT GTCTTGCTTT 1500 

TTACTGCTTT TGTTGTAGTT GTCATAATTA CATTTGGAAC AAAAQGTGTG GGCTTTGGGG 1560 

TCTTCTGGAA ATAGTTTTGA CXMACATCT TTGAAGTGCT TTTTGTGAAT TTATGCATAT 1620 

AATATAAAGA CTTTTATACT GTACTTATTG GAATGAAATT TCTTTAAAOT ATTACGATGC 1680 

QCTGACTTCA GAAGTACCTG CCATCCAATA CGGTCATTAG AXTGGGTCAT CTTGATTAGA 1740 

TTAGATTAGA TTAGATTGTC AACAGATTGG GCCATCCTTA C T TTATQAT A GGCATCATTT 1800 

TAGTGTGTTA CAATAGTAAC AGTATGCAAA AGCAGCATTC AGGAGCCGAA AGATAGTCTT I860 

GAAGTCATTC AGAAGTGGTT TGAGGTTTCT GTTTTTtGGT GGTTTTTGTT TCJTTTTTTTT 1920 

TTTTTTCACC TTAAGGGAGG CTTTCATTTC CTCCGGACTG ATTGTCAGTT AAATCAAAAT 1980 

TTAAAAATGA ATAAAAAGAC ATACTTCTCA GCTGCAAATA TTATGGAGAA TTOOGCACCC 2040 

ACAGGAATGA AGAGAGAAAG CAGCTCCCCA ACTTCAAAAC CATTTTGGTA CCTGACAACA 2100 

AGAGCATTTT AGAGTAAT TA ATTTAATAAA GTAAATTAGT ATTGCTOCAA ATAJ3CTAAAT 2160 

TATATTTATT TGAATTGATG GTCAAGAOAT TTTCCATTTT TTTTACAGAC TGTTCASTGT 2220 

TTGTCAAGCT TCTGGTCTAA TATGTACTCG AAAGACTTTC CGCTTACAAT TTGTAGAAAC 2280 

ACAAATATOG TTTTCCATAC AGCAGTGCCT ATATAGTOAC TGATTTTAAC TTTCAATGTC 2340 

CATCTTTCAA AGGAAGTAAC ACCAAGGTAC AATGTTAAAG GAATATTCAC TTTAOCIAGC 2400 

AGGGAAAAAT ACACAAAAAC TGCAGATACT TCATATAGCC CATTTTAACT TGTATAAACT 2460 

GTGTGACTTG TGGCGTCTTA TAAATAATGC ACTGTAAAGA TTACTGAAEA GTTGTGTCAT 2520 

GTTAATGTGC CTAATT T CAT GTATCTTGTA ATCATGATTG AGCCTCAQAA TCATTTGGAG 2580 

AAACTATATT TTAAAGAACA AGACATACTT CAATGTATTA TACAGATAAA GTATTACATG 2640 

TGTTTGATTT TAAAAGGGCG GACATTTTAT TAAAATCAAT ATTGTTTTTG CTTTTTCTGA 2700 
GGAOTCTCTT TCAGTTTCAT TTTTTCTCAT CCCATGACTT CCCTCCGATG GT 



A28 Protein aeguengej 



unigene numbers 
Probeset Accession #: 
protein Accession fci 
Signal sequence £ 
Pi am domains: 

Cellular Localization: 



Y receptor Yl {NPYYI) mRNA, exon 2-3 and complete cda 



Human neuropeptide 
He. 169266 
L07615 
NP_0D 0900,1 
none found 
7tra 1 [57- 91 J 

39-61, 77-99, 118-139, 157-179. 212-234, 264-2B6, 
300-322 

plasma membrane 



i 11 21 31 41 51 

I I I I I i 

MNSTLFSQVE MHSVB9NFSE KNAQEJiAFEBT DDCBLPIiAMI FTIiAUVYGAV IIl^VSOKLA 60 

LIIIlIiKQKB MENVTN1LIV WL8FSDBLVA IMCLPPTPVY TIMDHWVFGB AMCKLNPFVQ 120 

CVSITVSIFS LVLIAVKRHQ LIIUPHGWRP N5TRHAYVG1A VIWVLAVA3S IiPFLIYQVMT 180 

DEPFQKJVTLD AYKDKYVCFD QFPSDSHRiS YTTT.Ti7.VLQY PQPLCFIFIC YFKIYIRLXR 240 

RNNMMDKMRD NKYRSSETKR INTMLLSrW APAVCWLPLT IFHTVFDWHH QUATCNHNL 300 

LFLLCS1»TAM ISTCVNPIFY GFLWKKFQRD LQPFFNFCDF RSRDDDYETI AMSTMHTDVS 360 

841 



WO 03/042661 



PCT/US02/36810 



KTSLKQASPV AFKKINNNDD NEKl 



A2g DMA SEQUSHCE 

Gene nam©; Homo sapiens G protein -coupled receptor (HOT7T17S) 
Unlgene number: Hs- 208229 

Probeset Accession AiBi91&a 
Nucleic Acid Accession #s NM_032551.1 

Coding sequence: 1-1197 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGCACACCG TGGCTACGTC CGGACCCAAC GCGTCCTGGG GGGCACCGGC CAACGCCTCC 60 

GGCracCCGG GCTGTGGCGC CAACGOCTCG GAjCGGCOCAG TCCCTTCQCC GCGGGCCGTG 120 

GACGCCTGGC TCGTGCCGCT CTTCTTCGCG GCGCTGATGC TGCTGGGCCT GGTGGGGAAC 180 

rCGCTGGTCA TCTACGTCAT CTGCCGCCAC AAGCCGATGC GGACCGTGAC CAACTTCTAC 240 

ATCGCCAACC TGGCGGCCAC GGACGTGACC TTCCTCCTGT GCTGCGTCCC CTTC&CQGCC 300 

CTGCTGTACC CGCTGCCCGG CTGQGTGCTG GGCGACTTCA TGTGCAAGTT CGTCAACTAC 360 

ATOCAGCAGG TCTCGGTGCA GGCCACGTGT GCCRCTCTGA COGCCATGAG TGTGGACCGC 420 

TGQTAOGTGA CGGTGTTCCC GTTGGGCGCC CTGCAOCGCC GCACGCCCCO CCTGGOGCTG 480 

GCTGTCAGCC TCAGCATCTG GGTAGGCTCT GCGGCGGTGT CTGCGOCGGT GCTCGCCCTG S40 

CACCGCCTGT CACCCGGGCC GCGCGCCTAC TGCAGTGAGG CCTTCCCCAG CCGCGCCCTG 600 

GAG CGCGCCT TCGCACTGTA CAACCTQCTG GCGCTGTACC TG C TGCCGCT GCTCGOCACC 660 

TGCGCCTGCT ATGCOGCCAT GCTGOGCCAC CTGGGCCGGG TCGCCGTGCG CCCCGCGCCC 720 

GCCGATAGCG CCCTGCAGGG GCAGGTGCTG GCAGAGCGCO CAGGCGCCGT GCGGGCCAAG 780 

TGGTGGCGGC CGTGGTCCTG CTCTTOGCGG CCTGCTGGGG CCCCATCCAG 640 

TGCTGCAGGC GCIGGGCOCC GCGGGCTCCT OOCACCCACG CRGCTACGCC 900 

GCCTACQOGC TTAAGACCTG GGCTCACTGC ATGTCCTACA GCAACTCCGC GCTGAACCCG 960 

CTGCTCTACG CCTTCCTGGG CTCGCACTTC C5GACAGGGCT TCCGCCGCGT CIGCCCCTGC 1020 

GCGCCGCGCC GCCCCCGCCG CCCCCGCCGG CCCGGACCCT CGGACCCCGC AGCCCCACAC 10B0 

GCOGAGCTGC ACCGCCTGGG GTCCCACCCG GCCCCCGCCA BGGCGCAGAA GCCAGGGAGC 1140 
AGTGGGCTGG OCGCGCGCGG GC7GTGCGTC CTGGGGGAGG ACAACGCCCC TCTCTGA 

A30 Protein sequence 

Gene names Homo sapiens G protein- coupled receptor (H0T7T175) , raRNA 

Uaigene number'. Eb~208229 

Protein Accession AT819198 

Signal sequence : none found 

Pfam domains i 7tm_l IS9-3231 

Transmembrane domains! 43-65* 86-108, 122-144, 159-181, 203-225, 260-282 

Cellular Localization: plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MHTVATSGPN ASWGAPANAS GCPGCGANAS DGPVPSPHAV EAHLVPLPEA A05LLOLVGN 60 

SDV3YVICHH KPKHTVTNFY IANIAATDVT FUiCCVPFTA LLYPLPGWVL GDFMCKFVNY 120 

IQOVSVQATC ATLTAHSVDR WYVTVFPLRA LERRTPRliAL AVfiLSlWVGS AAVSAPVLAL 180 

BRI*SPGPRAY CSEAFPSRAL EHAPALYMLL AIiYI«IiPI»&AT CACYAAMLBH LGRVAVRPAP 240 

ADSALQGQVL ABHAGAVKAK V3RLVAAVYL LFAACKGPIQ LFLVLQALGP AGSHHPRSYA 300 

AYAMTHAHC MSYSNSAIiNP 1ULYAFLGSHF RQAFRRVCPC APBRPKRPRR PGPSDPAAPH 360 
AKLHRLGSHF APAHAQKPGS SGLAARGLCV LGBENAPI, 

A31 UNA SEQUENCE 

Gene namei CH22JPGEaJES.706 j cachexia, EGP LAG seven-pa SB G-type receptor 1, Elamingo (Drosophila) homoloo 
(CBL8R1) 

XJnicfene number: Ha. 2523 B7 

Probeset Accession 41 z NW_014246 

Hucleie Acid Accession #t NM_014246 

coding sequence: 1-9045 (underlined sequences correspond to start and stop codons) 



1 H 21 31 41 51 

I 1 I I I i 

ATGGCGCCGC CGCCGCCGCC CGTGCTGCCC GTGCTGCTGC TCCTGQCCGC CGCCGCCGGC 60 

CTGCOGGCOA TGGGGCTGCG AGCGGCCGCC TGGGAGCCGC GCGTACCCGG CGGGACCCGC 120 

GCCTTCGOCC TCCGGCCCGG CTGTACCTAC GCGGTQGGCa CCGCTTGCAC GCCCCGGGOG 180 

CCGCSGGAGC TGCTGGACGT GGGCCGCGAT GGGCGGCTGG CRGGACGTCG GCGCGTCTOG 240 

GGCGCGGGGC GCCCGCTGCC GCTGCAAGTC OGCTTGGTGG CCCGCAGTGC CCCGAOGGCG 300 

CTGAGCCGCC GCCTGCGGGC GCGCACGCAG CTTCCOGGCT GCGGAGCCCG TGCCCGGCTC 360 

TGCGQAACCG GTaCCCGGCT CTGCGGGGCG CTCTGCTTCC CCGTCCCOGG CQGCTGCGCG 420 

GCCGCGCAGC ATTCGGCGCT CGCAGCTCOG ACCACCTTAC CCGCCTGCCG CTGCOCGCCG 480 

CaCCCCAGGC CCCGCTGTCC OGQCOGTCCC ATCTGCCTGC CGCCGGGOGG CTCGGTCCGC 540 

CTGCGTCTGC TGTGCGCCCT GCGGCGCGCG GCTGGCGCOG TCCGGGTGGG ACTGGCGCTG 600 

GAGGCCGCCA CCGCGGGGAC GCCCTCCGCG TCGCCATCCC CATCGCCGCC CCTGCCGCCG 660 

AACTTGCCCa AAGCCCGGGC GGGGCCGGCQ CaAOaOGCCC GGCGGOaCAC GAGCGGCAGA 720 

GGGAGCCTGA AGTTTCCGAT GCCCAACTAC CAGGTGGCGT TGTTTGAGAA OGAACCGGCG 780 

GGCACOCTCA TCCTCCAGCX GCACGCGCAC TACACCATCG AGGGCGAGGA GGAGCGCGTG 840 

AGCIATTACA TGGAGGGGCT GTTCGACGAG CGCTCCCGGG GCTACTTCCB AATOTACTCT 900 

GCCACGGGCG CCGTGAGCAC GGACAGCGTA CTGGACCGCG AGACCAAGGA GACGCAGGTC 960 

CTCAGGGTGA AAGCCGTGGA CTACAGTACG OCGCCGOGCT OGGCCACCAC CTACATCACT 1020 

GTCTTGGTCA AAGACACCAA CGACCACAGC CCGGTCTTOG AGCAQTCQGA GTACCGCGAG 10 80 

CGCGTGCGGG AGAACCXGGA GGTGGGCTAC GAGGTGCTGA CCATCCGCGC CAGCGACCGC 1140 
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GACTCGOCCA TCAACGCCAA CTTGCGTTAJC COCXJTQTTGG GGGGCGCGTG GGACGTCTTC 12 00 

CAGCTCAACG AGAGCTCTGG CGTGGTGAGC ACACGGGCGG TGCTGGACCG GGAGGAGGCG 12 60 

GCCGAGTACC AGCTCCTGGT GGAGGCCAAC GACCAGGGGC GCAATCCGGG CCCGCTCAGT 1320 

GCCACGGCCA CCGTGTACAT CGAGGTQOAG GACGAGAACG ACAACTACCC CCAGTTCAGC 1380 

J GAGCAGAACT ACGTGGTCCA GGTGCCCGAG GACGTGGGGC TCAACACGGC TGTGCTGCGA 1440 

GTGCAGGCCA CGGACOGGOA CCAGGGCCAG AACGCGGCCA TTCACTACAG CATCCTCAGC 1500. 

GGGAACGTGG COGGCCAGTT CTACCTGCAC TCGCTGAGCG GGATCCXGGA TGTGATCAAC 1560 

CCCTTGGATT TCGAGGATGT CCAGAAATAC TCGCTGA3CA TTAAGGCCCA GGATGGGGGC 1620 

1 CGGCCCCCGC TCATCAATTC TTCAGQGGTG GTGTCTGTGC AGGTGCrGGA TGTCAACGAC 16 BO 

i\J AACGAGCCTA TCTTTGTGAG CAGCCCCTTC CAGGCCACGG TGCTGGAGAA TGTQCCCCTG 1740 

GGCTAOCCGG TGGTGCACAT TCAGGCGGTG GACGOGGACT CTGGABAGAA CGCCCGGCTG 1500 

CACTATCGCC TGGTGGACAC GGCCTCCACC TTTCTGGGGG GCGGCAGCGC TGGGCCTAAG I860 

AATCCTGCCC OCACCCCTGA CTTCCCCTTC CAGATCCACA ACAGCrCCGG TTGGATCACA 1920 

1 <- GTGTGTGCCG AGCTGGACCG CGAG3AGGTG GAGCACTACA GCTTCGGGGT QGAGGCGGTG 1980 

ID GACCACGGCT CGCCCCCCAT GAGCTCCTCC ACCAGCGTGT CCATCAOOGT GCTGGACGTG 2040 

AATGACAACG ACCCGGTGTT CACGCAGCCC ACCTACGAGC TTCGTCTGAA TGAGGATGOG 2100 

GCCGTGGGGA GCAGCGTGCT GACCCTGCAG GCCOGCGACC GTGACGCCAA CAGTGTGATT 2160 

ACCTACCAGC TCACAGGOGG CAAGACCOGG AACCGCTTTG CACTCAGCAG CCAGAGAGGG 2220 

GGCGGCCTCA TCACCCTGGC GCTACCTCTG GACTACAAGC AGGAGCAGCA GTACGTGCTQ 2280 

/A) GCGGTGACAG CATCCGACGG CACACGGTCG CACACTGCGC ATGTCCTAAT CAAOGTCACT 2340 

GATGCCAACA CCCACAGGCC TGTCTTTCAQ AGCTOCCATT ACACAGTGAG TGTCAGTGAG 2400 

GACAGGCCTG TGGGCACCTC CATTGCTACC CTCAGTQCCA ACGATGAGGA CACAGGAGAG 2460 

AATGOCCGCA TCACCTACGT GATTCAGGAC CCCGTGCCGC AGTTCCGCAT TGACCCCGAC 2520 

0< AGTQGCAOCA TGTACACCAT GATGGAGCTG GACTATGAGA ACCAGGTOGC CTACACGCTG 2S80 

ZD ACCATCATGG CCCAGGACAA CGGCATCCCG CAGAAATCAG ACACCACCAC CCTAGAGATC 2640 

CTCftTGCT CG ATGCCAATOA CAATGCACCC CAGTTCCTGT GGGATTTCTA CCAGGSTTCC 2700 

ATCTTTGAGG ATGCTCCACC CTCGACCAGC ATCCTCCAGG TCTCTGCCAC GGACOGGGAC 2760 

TCAGGTCOCA ATGGGCGTCT GCTGTACACC TTCCAGGGTG GGGAOTACGG CGATG6GGAC 2820 

TTCTACATCG AGCCCACGTC CGGTQTGATT CGCADCCAGC GCCGGCTGGA COGGGAGAAT 2880 

OK) GTGGOCGTGT ACAACCTTTG GGCTCTGGCT GTGGATCGGG QCAGTCCCAC TCCCCTTAGC 2940 

GCCTCGGTAG AAATCCAGGT GAOCATCTTG GACATTAATG ACAATGCCOC CATGTTTGAG 3000 

AAGGACGAAC TGGAGCTGTT TGTTGAGGAG AACAACCCAG TGGGGTCGGT GGTGGCAAAG 3060 

ATTCOTGCTA ACGACCCTGA TGAAGGCCCT AATGCCCAGA TCATGTATCA GATTGTGGAA 3120 

GGGGACATGC GGCATTTCTT CCAGCTGGAC CTGCTCAACG GGGACCTGCG TGCCATGGlG 3180 

OJ GAGCTGGACT TTGAGOTCCG GCGGGAGTAT GTGCTGGTGG TGCAGGCCAC GTGGGCTCOG 3240 

CTGGTGAGCC GAGCCACGGT QCACATCCTT CTCGTGGACC AGAATGACAA CCCGCCTGTG 3300 

CTGOCCGACT 'TCCAGATCCT CTTCAACAAC TATGTCACCA ACAAGTCCAA CAGlTTCCCC 3360 

ACCGGCGTGA TCGGCTGCM CCQG6COCAT GACCCCGACG TGTCAGACAG CCTCAACTAC 3420 

Aft ACCTTOGTGC AGGGCAACGA GCTGCGOCTG TTGCTGCTGG ACCCCGCCAC GGGOGAACTG 3480 

CAGCICAGCC GCGACCTGOA CAACAACCGG CCGCTGGAGG CGCTCATGGA GGTGTCTGTG 3540 

TCTGATGGCA TCCACAGCGT CACGOCCTTC TGCACCCTGC GTGTCACCAT CATCACGGAC 3600 

GACATGCTOA CCAACAGCAT CACTGTCCGC CTGGAGAACA TGTCCCAGGA GAAGTTCCTG 3660 

TCCCCGCTGC TGGCCCTCTT CGTGQAGGGG GTGGCCGCCG TGCTGTCCAC CACCAAGGAC 3720 

. _ GAOGTCTTCQ TCTTCAACGT OCAGAACGAC AOCGACGTCA GCTCCAACAT CCTGAACGTG 3780 

HD ACCTTCTCGS CGCTGCTGCC TGGCGGCGTC OGCGGOCAQT TCTTCCCGTC OGAGGACCTG 3840 

CftGGAGCAGA TCTACCTGAA TCGGAGGCTG CTOACCACCA TCTCCACGCA GCGCG1GCTQ 3900 

CCCTTOGACG ACAACATCTG CCTQCGCQAG CCCTGCGAGA ACTACATGAA OTGCGTGTCC 3960 

GTTCTGCSAT TOGAGAGCTC CGCGCCCTTC CTCAGCTCCA CCACCGTGCT CTTCCGGCCC 4020 

ATOCACCCCA TCAACGGCCT GCGCTGCCGC TGOCOGCCCQ GCTTCACCGG CGACTACTGC 4080 

D\J GAGACGOAGA TOGACCTCTG CTACTOCGAC CCGTGCGGCG CCAACGGOGG CTGCCGCAGC 4140 

CGCGAGGGCG GCTACACCTG CGAGTGCTTG GAGGACTTCA CTGGAGAGCA CTGTGAGGTG 4200 

GATGCCCGCT CAGGCCGCTG TGCCAACGGG GTGTGCAAGA ACGGGGGCAC CTGCGTGAAC 4260 

CTGCTC3VTCG GCGGCTTOCA CTGCGTGTGT CCTCCTGGCG AGTATGAGAG GCCCTACTGT 4320 

QAGGTGACCA CCAGGAGCTT CCC3GCCCCAG TCCTTOGTCA CCTTCCGQGG OCTGAGACAG 4380 

CGCTTCCACT TCACCATCTC CCTCACGTTT GCCACTCAGG AAAGGAACGG CTTGCTTCTC 4440 

TACAACGGCC GCTTCAATGA GAAGCACGAC TTCATOGCCC TGGAGATCGT GGAC5GAGCAG 4500 

GTGCAGCTCA CCTTCTCTGC AGGCGAGACA ACAACGACCG IGGCACCGAA GGTTOCCA0T 4560 

GGTGTGAGTG ACGGGCGGTG GCACTCTGTG CAGGTGCAGT ACIACAACAA GCCCAATATT 4620 

s*ri GGCCACCTGC5 GCCTGCCCCA TGGGCCGTCC GGGGAAAAGA TOGCCGTGGT GACAGTGGAT 4680 

°U GATTGTGACA CAACCATGGC TCTGCGCTTT GGAAAGGACA TCGGGAACTA CAGCTGCX3CT 4740 

GCCCAGGGCA CTCAGACCGG CTCCAAGAAG TCCCTGGATC TGACOSGCOC TCTACTCCTG 4B00 

GGGGGTGTOC CCAACCTGCC AGAAGACTTC CCAGTGCACA ACCGGCAGTT OGTGGGCTGC 4860 

ATGCGGAACC TGTCAGTCGA CX3GCAAAAAT GTGGACATGG GCGGATTCAT CBCCAACAAT 4920 

GGCACCCGGG AAGGCXGCGC TGCTCGGAGG AACTTCTGCG ATGGGAGGOG GTGTCftfiAAT 4980 

OD GGAGGCACCT GTGTCAACAG GTQGAATATG TATCTGTGTG AGTGTCCACP GCGATTOBGC 5040 

GGGAAGAACT GTGAGCAAGC CATGCCTCAC CCCCAGCTCT TCAGCGGTGA GAGCGTCGTG 5100 

TDCTGGAGTG ACCTGAACAT CATCATCTCT GTGCCCTGGT ACCTGGGGCT CATGTTCCGG 5160 

ACCCX3GAAGG AGGACAGCGT TCTGATGGAG GCCAOCAGTG GTGGGOCCAC CAGCTTTOGC 522Q 

n CTCCAGATCC TGAACAACTA CCTCCAGTTT GAGGTGTCCC AOGGDCOCTC CGAXGTGGAG 5280 

/U TOCGTGATGC TGTCCGGGTT GOGGGTGAOC GACGGGGAGT GGCACCACCT GCTGATOGAG 5340 

CTGAAGAATG TTAAGGAGGA CRGTGAGATG AAGCACCTGG TCACCATQAC CTTGGACTAT 5400 

GGGATGGACC AGAACAAGGC AGATATDGGG GGCATGCTTC CCGGGCTGAC QGTAAGGAGC 5460 

GTGGTGGTCG GAGGCGCCTC TGAAGACAAG GTCTCQSTQC GCCGTGGATT CCGAGGCTGC SS20 

- ATGCAGGOAG TGAGGATGGG GGGGACGCCC ACX^ACGTCG CCACCCTGAA CATGAACAAC 5580 

/D GCACTCAAGG TCAGGGTGAA GOACGGCTGT GATGTGGAOG ACCCCTGTAC CTCGAGC3CCC 5640 

TGTCCCCCCA ATAGCCGCTQ CCACGACGCC TGGGAGGACT ACAGCTGCGT C1GTGACAAA 5700 

GGGTACCTTG GAATAAACTG TGTGGATGCC TGTCADCTGA ACCXTCTGOGA OAACATGGGS 5760 

GCCTGCGTGC GCTCCCKCGG CTCCCOGCAG GGCTACGTGT GCGAGTGTGG GCCCAGTCAC 5820 

Qn TAGGGGCCGT ACTGTGAGAA CAAACICGAC CTTCCGTGCC CCAGAGGCTG GXGGGGGAAC 5880 

OU OCGGTCTGTG GACCCTGCCA CTGTGCCGTC AGCAAAGGCT TTGATCCCGA CTGTAATAAG 5940 

ACCAAOGGCC AGTGCCAATG CAAGGAGAAT TACTACAAGC TCCTAGCCCA GGACACCTGT 600O 

CTGCCCTGCG ACTGCTTOCC CCATGGCTCC CACAGCCGCA CTTGCGACM GGCCACOGGG 6060 

CAGTGTGCCT GCAAGOCGGG CGTCA1CGGC CGCCAGTGCA ACCGCTGOGA CAAOOOGTTT 612 O 

GCCGAGGTCA CCACGCTCGG CTGTGAAGTG ATCTACAATG GCTGTCCCAA AGCATlTGAG 61 9 O 

843 
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GCCGGCATCT GGTGGCCACA GACCAAGTTC GGGCAGCCGG CXGCGGXGDC ATGCCCTAAG 6240 
GGAICOGTTG GAAATGCGGT CCGACACTGC AGCGGGGAGA AGGGCTGGCX GCCCCCAGAG 63C0 
CTCTIXAACT GTACCACCRl 1 CTCCTTCGTG GACCTCAGGG CCATGAATGA GAAGCTGAGC 6360 
CGCAAXGAGA CGCAGGTGGA CGGCGCCAGG GOCCTGCAGC TGGXGAGGGC GCTGCGCAGT 642 0 
GCTACACAGC ACACGGGCAC GCXCXTTGGC AATGACGTGC GCACGGCCTA CCAGCTGCTG 6480 
GGCCACGICC TTCAGCACGA GAGCXGGCAG CAGGGCTTCG ACCXGGCAGC CACGCAGGAC 6540 
GCCGACXIXC ACGAGGACGT CATCCACTCG GGCAGCGCCC TCCTGGCCCC AGCCACCAGG 6600 
GCGaCGTGGG AGCAGATCCA GCGGAGCGAG GGCGGCACGG CACAGCTGCT CCGGCGCCTC 6660 
GAGQGCTACT TCAGCAACGT GGCACGCAAC GTGCGGCGGA CGTACCTGCG GCCCTTCGTC 6720 
ATCGTCAGCG CCAACAIGAX TCTTGCTGTC GACATCTTTG ACAAGXXCAA CTTTACGGGA 6780 
GCCAGGGTCC CGCGATXCGA CACCATCCAT GAAGAGTXOC CCAGGGAGCl GGAGTCCXCC SB 40 
GTCTCCTTOC CAGCCGACTX CTTCAGACCA CCTGAAGAAA AAGAAGGCCC CCTGCTGAGG 6900 
CCGGCXGGCC GGAGGACCAC CCCGCAGAOC ACGOGCCCGG GGCCTGGCAC CGAGAGGGAG 6960 
GCCCCGATCA GCAGGCGGAG GCGACACCCT BATQACGCTG GCCAGTTCGC CGTCGCTCTG 7020 
GTCATCATTT ACGGCACCCT GGGGCAGCTC CTGCCCGAGC GCTACGACCC CGACCGTCGC 70 B0 
AGCCTCGGGT TGCCTCACCG GCCCATCATT AATACCCCGA TGGTGAGCAC GCTGGTGTAC 7140 
AGCGAGGGGG CICCGCTCCC GAGACCCCTG GAGAGGCCCG XCCXGGTGGA GTTCGCCCTG 7200 
CTGGAGGTGG AGGAGOGAAC CAAGCCTGTC TGCGXGTTCT GGAACCACTC CCTGGCCGTT 7260 
GGTGGGACGG GAGGGTGGTC TGCCCGGGGC TGCGAGCTCC TGTOCAGGAA COGGACACAT 732 D 
GTCGCCTGCC AGTGCAGCCA CACAGCCAGC TTTGCQGTGC TCATGGATAT CTCCAGGOGT 7380 
GAGAAOGGGG AGGXCCTGCC XCTGAAGAXT GTCACCTAXG CCGCXGXGXC CXXGTCACTG 7440 
GCAGCCCTGC TGGTGGCCTT CGTOCXCCXG AGCCTGGTCC GCATGCTGCG CTCCAACCTG 7500 
CACAGGATTC ACAAGCACCT CGCCGTGGOG CTCTTCCTCT CTCAGCTGGT GTXCGTGATT 7560 
GGGATCAACC AGACGGAAAA CCCGTTTCTG TGCACAGXGG TTGCCATCCT CCTCCACTAC 7620 
ATCTACATGA GCACCTTTGC CTGGACCCTC GTGGAGAGCC TGCATGTCTA CCGCATGCTG 7680 
ACCGAGGTGC GCAACATGGA CACGGGGCCC ATGCGGTTCT ACTACGTCGT GGGCTGGGGC 7740 
AT CCC BQCCA TTGTCACAGG ACTGGCGGTC GGOCTGQACC CCC&GGGCTA CGGGAACCCC 7 BOO 
GACTTCTGCT GGCTGTCGCT TCAA3ACACC CTGATTTGGA GCTTTGDGQG GCCCATCGGA 7860 
BCTGTTATAA TCATCAACAC AGTCACTTCT GTCCTATCTG CAAAGGTTTC CTGCCAAAGA 792 0 
AAGCACCATT ATTATGGGAA AAAAGGGATC GTCTCCCTGC TGAGGAGCGC ATTOCTCCTG 7960 
CTGCTGCTCA TCAGOGCCAC CTGGCXGCIG GGGCTGCTOG CTGTGAACOG CGATGCACTG 8040 
AGCTTTCACX ACCTCTTCGC CATCTTCAGC GGCTTACAGG GCCCCTTCGT CCXCCXXIXC 8100 
CACTGOGTGC TCAACCAGGA GGTCCGGAAG CACCTGAAGG GOGTGCTCGG CGGGAGGAAG 8160 
CTGCACCTGG AGGACTCCGC CACCACCAGG GCCACCCTGC TGACGCGCTC CCTCAACTGC 8220 
AACACCACCT TCGGTGAOGG GCCTGACATG CTGCGCACAG ACTTGGGCGA GTCCACCGCC B280 
TOGCTGGACA GCATCGTCAG GGATGAAGGG ATCCAGAAGC TCGGCGTGTC CTCTGGGCTG 8340 
GTGAGGGGCA GCCACGGAGA GCCAGACGCG TCCCTCATGG CCAGGAGCTG CAAGGATCCC 8400 
CCTGGCCACG ATTCCGACTG AGATAGCGAG CTGTCCCTGG ATGAGCAGAG CAGCTCTTAC 8460 
GCCTCCTCAC ACXCGTCAGA CAGCGAGGAC GATGGGGTGQ GAGCTGAGQA AAAATGGGAC 8520 
CCGGCCAGGG GCGCCGTCCA CAQCACCCCC AAAGGGGAOG CTGTGGCCAA CCACGIXCCG 8580 
GCCGGCTGGC CCGACCAGAG CCTGGCTGAG AGXGACAGXQ AGGACCDCM3 CGGCAAGCCC 8640 
CGCCTGAAGG TGGAGACCAA GGTCAGOGTG GAGCTGCACC GCGAGGAGCA GGGCAGTCAC 8700 
CGTGGAGAGT ACCCCCCGGA CCAQGAGAGC GGGGGCGCAG CCAGGCTTGC TAGCAGCCAG 8760 
CCCCCAGAGC AGAGGAAAGG CATCTTGAAA AATAAAGTCA CCTACCCGCC GO0GCTGACG 8820 
CTGACGGAGC AGACGCTGAA GGGGGGGCTC CGGGAGAAGC TGGCOGACTG TGAGCAQAGC 8880 
CCCACATCCT CGCGCACGTC TTCCCTBGGC TCTOGCGGCC CCGACTGCGC CATCACAGTC 8940 
AAGAGCCCTG GQAGGGAGCC GGGGCGTGAC CACCXCAACG GGGTGGCCAT GAATOTGCGC 9000 
ACTGGGAGCG CCCAGGCCGA TGGCTCCGAC TCTGAGAAAC O GTGAG GCAA GCOOGTCAOC 9060 
CCACACAGGC TGCBGCATCA CCCTCAGACC TTGGAGCCCA AGGGGCCACT GCCCTTGAAG 9120 
TGGAGTGGGC CCAGAGTGTa GC0GTCCCCA TGGTGGCAGC CCCCCGACTG AICATCCAGA 918 0 
CACAAAGGTC TTGGTTCTCC CAGGAGCTCA GGGCCIGTCA GACCTGGTGA CAAGTGCCAA 924 0- 
AOGCCACAGG CATGAGGGAG GCGTGGACCA CTGGOCCAGC ACCGCTGAGT CCTAAGACTG 9300 
CAGTCAAAGC CAGAACTGAG AGGGGACCCC AGACTGGGCC CAGAGGCTGG OCAGAGTTCA 9360 
GGAACGCCGG GCACAGACCA AAGACCGCGG TCCAQCCCOG CCCAGGCGGG CATCTCATGG 9420 
CAGTGCGGAC CCGTGGCTGG CAGCCCGGGC AGTCCTTTGC AAAGGCACCC CTXGXCXTAA 9480 
AATCACTTCQ CTATGTGGGA AAGGTGGAGA TACTTTTATA TATTTGTATG GGACTCTGAG 9540 
GAGGTGCAAC CXGTATATAT ATTGCATTCG TGCTGACTTT GTTATCO0GA GAGATCCATG 9600 
CAATGATCTC TTGCTGTCXX CTCTGTCAAG AITGCACAGT TGTACTTGAA TCTGGCATGT 9660 
GTTGACGAAA CTGGTGCCCC AGCAGAXCAA AGGTOGGAAA TAOGXCAGCA GTGGGGCTAA 972 0 
AACCAAGGGG CTAGAAGCCC TACAGCTGCC TTCGGCCAGG AAGTGASGAT GGTGTGGGCC 9780 
CTCJOCOGCCG GCCCCCTGGG TCCCCAGTGT TCQCTGTGTG TGCGTTTGTC CTCTGCTGCC 9840 
ATCTGCCCC3G GCTGTGTGAA TTCAAGACAG GGCAGTGCAG CACTAGGCAG GTGTGAGGAG 9900 
CCCTGCTGAQ GTCACTGTGG GGCACGGTTG OCACACGGCT GTCATTTTTC ACCTGGTCAT 9960 
TCTGTGACCA CCACCCCCTC CCCTCACCGC CTCCCAGGTG GCCCGGGAGC TGCAGGTGGG 10020 
GATGGCTTTG TCCTTTGCTC CTGCTCCCCG TGGGACCTGG GACCTTAAAG CGTTGCAGGT 10080 
TCCTGATTTG GACAGAGGTG XGGGQCCTTC C3U3QCCGTTA CATACCTCCT GCCAATTCTC 10140 
TAACTCTCTG AGACTGCQAG GATCTCCAGG CAGGGTTCTC CCCTCTGQA0 TCTGACCAAT 10200 
TACTTCATTT TGCTTCAAAT GGCCAATTGT GCAGAGGGAC AAAGGCACAG CCACACTCTT 10260 
CAACGGTTAC CAAACtGTTT TTGGAAATTC ACACGAAGGT CGGGCOCACT GCAGGCAGCT 10320 
GGCACAGCGT BGCCCGAGGG GCTGTGGAAC GGGXCCCGGA ACTGTCAGAC ATGTTTGATT 10380 
TTAGCGTTTC CTTTGTTCTT CAAATCAGGT GCCCAAATAA GTGATCAGCA CAGCTGCTTC 10440 
CAAATAGQAG AAACCATAAA ATAGQATGAA AATCAAGTAA AATGCAAAGA TGTCCACACT 10500 
GTTTTAAACT TGACCCTGAT GAAAATGTGA GCACTGTTAG CAGATGOCTA TGGGAGAGGA 10560 
AAAGCGTATC TGAAAATGGT CCAGOACAGG AGGATGAAAT GAGATCCCAG AGTCCTCACA 10620 
CCTGAA TGAA TTATACATGT GCCTTACCAG GTGAGTGGTC TTTCGAAGAT AAAAAACTCT 10680 
AGTCCCTTTA AACGTTTGCC CCTGGCGTXr CCXAAGTACG AAAAGGTTXT lAAGXClTOG 10740 
AACAGXCTCC TTTCAXGACT XTAACAGGAT XCTGCCCCCX OA0GTGTAAT TTTTIXGTTC 10800 
XATTXTTTTC CACGXACTCC ACAGCCAACA XCACGAGGTG XAATTTTTAA TTXGATCAGA 10860 
ACTGTTACCA AAAAACAACT GTCAGTTTTA TTGAGATGGG AAAAAIGTAA ACCTATTTTT 10920 
ATTACXXAAG ACTTTATGGG AGAQATTAGA CACXGGAGGT TTXTAACAGA ACGTGTATTT 10980 
ATXAAIGTIC AAAACACXGG AATTACAAAT GAGAAGAGTC XACAATAAAT TAAGATTTTT 11040 
GAATTTGTAC TTCTGDGGTG CIGGTXXTTC TCCACAAACA OCCCCGCCCC XCCCCAXGCC 11100 
CAGGGTGGCC GTGGAAGGOA CGGXTTACGG AOGTGCAGCT GAGCXGXCCG TGTCCCATGC 11160 
TCCCICAGCC AGXGGAACGT GCGGGAACTT TXXGTCCATT CCCTAGTAGG CCXGCCACAG 11220 



844 



WO 03/042661 
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5 

10 
15 



CCTAQATQGO CAGTTTTTGT CTTTCACCAA ATTTGAGGAC TTTTTTTTTT TGCCATTATT 11260 
TCTTCAGTTT TCTTTTCTTG CACTGATCTT TCTCCTCTCC TTCTGTGACT CCAGTGACTC 11340 
AGACGTTAGA CCTCTTOATG TTTICCCACT GGTCCCTGAG GCTCTGTTC 



A32 Protein Sequence 
Gene name: 

Unigone number-: 
Protein Accession #; 
Signal sequence: 
Cadherin domains: 

Pram domains: 
laatrophilin/GPS domains: 
Transmembrane dociaina • 
Cellular Local! za ti on z 



CH22_FGKNES.70fi; cadOaerln, EGF LAG seven-pass G-type receptor 1, flamingo 
(Drosopnila) liomolog (CELSR1) , 
Hs, 252387 



NP 055061 



1-20 

250-344 , 358-449, 464-556 r 570-678 . 692-780, 754-883, 897-990, 1004-1092 
1110-1199 ' 
Laminin_EGF 12003-2048], 7tm_2 [2465-2706] 
2407-2460 

1219-1238, 2473^2492, 2507-2522, 2529-2547, 2616-2634, 2659-2675, 2687-2704 
plaa™ membrane 



20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



i 
I 

MAPPPPPVLP 
PREItLDVGRD 
CGTGARLCOA 
LRIiCftLRRA 
G3LKFPMPNY 
ATGAVSTDSV 
RVRE20LEVGY 
AEYQLLVEMf 
VQATDRDQGQ 
RPPLINSSGV 



11 
I 

VLLLCLAAAAA 
GRLAGRRRVC 
liCFPVPGGCA 
AGAVKVGIiAL 
QVALFENEPA 
LDRETKETHV 
EVLTIRASDR 
DQGRUPGPLS 
HAAIHYSILS 
VSVQVLBVND 



TYQ1.TGGNTR 
DANTHRPVFQ 
SGTMYTMMEL 
IFBDAPPSTS 
VAVYNLWALA 
TRANDPETEGP 
XiVSRATVHIIj 
TFVOjGHnEIiEI* 
UMI/TNSITVR 
TFSAT1T1PGGV 
VLRFDBSAPF 
REGGYTCECF 
BVTrRSFPPQ 
VQLTFSAQET 
ECDTTWAVRF 
MBNLSVDdKN 
GKKCEQAMPH 
LQILNNYLQF 
GMDQNKADIG 
ALKVRVKDGC 
ACVRSPGSPQ 
TNGQGQCKEN 
ABVTTJjGCEV 
LFHCTTISFV 
GHVLQHSSWQ 
EGYFSNVARN 
VSFPADFFRP 
VIIYKTLGQL 
LEVEERTKPV 
ENGEV LPXiK I 
GZNQTEHPFIi 
IPArVTOIAV 

khhxygkkgi 

HCVLNQEVRK 
SIiDSIVRDEG 



TSVBITVLDV 
NR P AIi SS ORG 
SSHYTVSVSE 
DYENQVAYTL 
ILQVSATDRD 
VDRGSPTP&S 
HAQIMYQIVE 
LVDQNDNPPV 
LLLD P ATGEL 
LEKJMSQBKFL 
RGQFFPSRDL 
LSSTTVLFRP 
EDFTGEHCEV 
SFVTFRGLRQ 
TTTVAPKVPS 
GKDIGUYSCA 
VDMAGF1AKN 
PQLF3GESW 
EVSHGPBDVE 
GMLPGLTVR8 
DVBDPCTSSF 
GYVCECGPSH 
YYKLLAQtrTC 
EYNGCPKAFE 
DLRAfWJEKLS 
QGFDLAATQD 
VRRTYLRPFV 
PEBKEGPLLR 
IiPERYDPDRR 
CVFW2SKSLAV 
VTYAAVfiLSIi 
CTWAII/LHY 
GLDPQGYGDTP 
V3LLRTAFLL 
HLKGVLGGRK 
IQKLGVSSGL 



21 
I 

LPAMGLRAAA 

6aqsflplqv 
aaqhsalaap 
eaatagtpsa 

GTLILQLHAH 
LRVKAVDYST 
DSPINANliRY 
ATATVYIBVE 
GNVAGQFYltH 
HEPIFVSfiPF 

npaptpcfpf 
ndndpvftqp 
gglitijul.pl 

DRPVGTSIAT 
TXMAQDNGZP 
SGPNGRLLYT 
ASVElQVTJL 
GDMRHFFQLD 
LPDFQIIiFNH 
QLSRDLDNNR 
SPLLALFVBG 
QEQIYLNRTL 
IHPIKGLRCR 
DARSGRCANG 
RFHFTI8LTF 
GVSDGHHHSV 
AQGTQTG3KK 
GTREGCAARR 
6WSDLKIIIS 
SVMLSGLRVT 
WVGGASEDK 
CPPNSRCHDA 
YGFYCEKXLD 
liPCPCFPHGS 
AGIWMPQTKF 
RNETQVDGAR 



rVTKNMILAV 
PAGRRTTPQT 
SLRLPHRPII 
GGTGGWSARG 
AALLVAFVLL 
lYHSTFAWTL 
DFCHLSLQDT 
LLLI SATWI/L 



31 

I 

WEPRVPGGTR 
RLVARSAFTA 
TTLPACRCPP 
SPSPSPPLPP 
YTIEGEEEKV 
PPRSATTYIT 
RVLGOAHDVF 
DEHDNYPQFS 
SLSGILDVIN 
Q ATVLRNV PL. 
Q I UN S SOW XT 
TYELRLNEBA 
DYKQEQQYVIi 
L9ANDEDTGE 
QKSDTTTLEI 
FQGG3JDGDGD 
DLNDHAPMFE 
LLNGDLRAMV 
YVTWKSKSFP 
PLEALHEVSV 
VAAVLSTTKD 
LTTISTQRVti 
CPPGFTGDYC 
VCKNGGTCVN 
A < TOERNGLXili 
QVQYYNKPNI 
SLDLTGPLLli 
HFCDGRRCQN 
VPHYLGLHFR 
33GBWHKLLIE 
VSVRRGFRGG 
WEDYSCVCDK 
LPCPRGWW0NT 
KSRTCDMATG 
GQPAAVPCPK 
ALOLVRALRS 
GSALLAPATR 
DIFDKFNPTG 
TRPGPGTERE 
NTPMVSTLVY 



41 
1 

AFALRPGCTY 
LSRKURARTH 
RPRPRCFGRP 
NLPEARAGPA 
SYVMEGLFDE 
VLVSDTNDH& 
QLNESSGWS 
EQNYWQVPE 
PLDFEDVQKY 
GYPWH1QAV 
VCASLDREEV 
AVG9SVLTL0 
AVTASDGTHS 
KAHITYVIQD 
LILDANDMAP 
FYIBPTSGVI 
KDBLELFVEE 
ELDFEVRREY 
TOVIGCIPAH 
fiDGlHHVTJVF 
DVFVFT3VQND 
PFDDHICLRE 
ETEIDLCYSD 
LLIGGFHCVC 



51 
I 

AVGAACTPRA 
LPGCGARARL 
ICLPPGGSVR 



RSRGYFRIDS 
PVFEQSEYRE 
TRAVLDREEA 
DVGLKTAVliR 
SLSIKAQDGG 
DADS GEN ARI* 
EHYSFGVEAV 
ARDRDAHSVI 
ETAHVTiIfJVT 
PVPQFR1DPD 
QFLWDFYQGS 
RTQRRLDRfiS 
NNFVQSWAK 
VLWQATSAP 



RL.KVETKVSV 
LTEQTLKGRI, 
TGSAOADGSD 



ELHRBBQGSH 
REKLADCEQS 



PARGAVH STP 
RGEYPPDQES 
PT5SRTSSLG 



SLVRMLRSNL 
VESLHVYRML 
LIWSFAGPLG 
GLrLAVNRDAL 
ATLLTRSLUC 
SLMPHSCKDP 
KGDAVANHVP 
GGAARLAS9Q 
SGGPDCAITV 



GHLGLPHGPS 
GGVPHliPEDF 
GGTCVHRHNH 
TRKEDBVLME 
LKNVKEDSEH 
MQGVRMGGTP 
GYLGHJCVDA 
FVCGPCHCAV 
QCACKPGVIG 
GSVGNAVRHC 
AT03TGTLPG 
AAWEQJQRSE 
ARVPRFDTIH 
AP1SRRRRHP 
SEGAPLPRPI, 
VACQCSSTAS 
HSZHKHLAVA 
TEVRKIE*TGP 
AVIIIMTVTS 
SFHYLFAIPS 
NTTFGCK3PDM 



CTLRVTIITD 
TDVSSKILNV 
PCEKfYMKCVS 
PCOAWGRCRS 
PPGSYERPYC 
FIALEIVDEQ 
GEKKaWTVD 
PVHNRQFVGC 
YLCBCPLRFG 
ATSGGPTSFR 
KELVTMTLDY 
T20VATLKMNN 
CBLNPCEHMG 



RQCNRCDNPF 
SGEKGWLPPE 
KDVRTAYQLL 
GGTAOLtLRRIi 



DDAGQFAVAL 
KRPVLVEFAL 
FAVLMDISRR 
LFLSOLVFVI 
MRFYYWGWG 
VLSAKVaCQR 
GLQGPFVUjF 
LRTDLGE3TA 
LSLDEQSSSY 



AGHPDQSLAE 
PPBQRKGILK 



NKVTYPPPL>T 
HLNGVAMNVR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
B40 
900 
960 
1020 
10B0 
1140 
1200 
12 60 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
192D 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



A3 3 DMA SEQUENCE 

Gena namei CXCR3 (a protein- coupled receptor 9), chemokiJie {C-x-c> receptor 3 
Unigene number? Es. 198252 

Probeeet Accession #s X95876 
Krucleic Acid Accession fts X95876 

Coding sequences 69-1175 (underlined sequences correspond to start and atop codons) 

1 11 21 31 41 51 

1 I I I I I 

CCAACCACAA GCACCAAAGC AGAGGGGCAG GCAGCACACC ACCCAGCAGC CAGAGCACCA 
GCCCAGC CAT GG XCCTTGAG GTGAGTGACC ACCAAGTGCT AAATGACGCC GAGGTTGCCG 

845- 



60 
120 



WO 03/042661 



PCT/US02/36810 



CCCTCCTGGA GAACTTCAGC TCTTCCTATG ACTATGGAGA AAACGAGAGT GACTCGTGCT 180 

OTACCTCCCC GCCCTGCCCA CftGGACTTCA GCCTGAACTT CQACOGGGCC TTCCTGCCAG 240 

CCCTCTACAG CCTCCTCTTT CTGCTGGGGC TGCTGGGCAA CGGCGCGGTG GCAGCCGTGC 3 00 

TGCTGAGCCG GCGGACAGCC CTGAGCAGCA CCGACACCTT CCTGCTCCAC CTAGCTGTAG 360 

CAGACACGCT GCTGGTGCTG ACACTGCCGC TCTGGGCAGT GGACGCTGCC GTCCAGTGGG 450 

TCTTTGGCTC TGGCCTCTGC AAAGTGGCAG GTGCCCTCTT CAACRTCRAC TTCTACGCAG 480 

GAGCCCTCCT GCTGGCCTOC ATCAGCTTTG ACCGCTACCT GAACATAGTT CATGCCACCC 540 

AGCTCTACCG CCGGGGGCCC CCGGCCCGCG TOACCCTCAC CTGCCTGGCT GTCTGGGGGC 600 

TCTGCCTGCT TTTOGCCCTC CCAGACTTCA TCTTCCCGTC OGCCCRCCAC GACGAGCGCC 660 

TCAACGCCAC CCACTGCCAA TACAACTTCC CACAGGTGGG CCGCACGGCl CTGCGGGTGC 720 

TGCAGCTGGT GGCTGGCTTT CTOCTGCCCC TGCTGGTCAT GGCCTACTGC TATGCCCACA 780 

TCCTGGCCGT GCTGCTGGTT TCCAQGGGCC AGCGGCGCCT GCGGGOCATG OGGCTGGTGG 840 

TGGTHOTOGT GGTGGCCTTT GCCCTCTGCT GGACCCCCTA TCACCTGGTG GTGCTGGTGG 900 

ACATCCTCAT GGACCTGGGC GCTTTGGCCC GCAACTGTGG CCGAGAAAGC AGGGTAGACG 960 

TGGCCAAGTC GGTCACCTCA GGCCTQGGCT ACATGCACTG CTGCCTCAAC CGQCTGCTCT 1020 

ATGCCTTIGFT AGGQOTCAAG TTCCGGGAGC GGATGTGGAT GCTGCTCTTG CGCCTGGGCT 1080 

GCCCCAACCA GAGAGGGCTC CAGAGGCAGC CATCGTCTTC CCGCCGGGAT TCATCCTGGT 1140 

CTGAGACCTC AOAGGCCTOC TACTOGGGCT TGTGAGGCCG GAATGOGGGC TCCCCTTTCG 1200 

CCCACAGXCT GACTTCCeCQ CATTCCAGGC TCCTCCCTCC CTCTGOCGGC TCTGGCTCTC 1260 

CCCAATATCC TCGCTCCCGG GACTCACTGG CAGCCCGAGG ACCACCAGGT CTCCCGGGAA 1320 

GCCACCCTCC CAGCTCTGAG GACTGCACCA TTGCTGCTCC TTAGCTGCCA AGCCOCATCC 13 B0 

TGCOGOOCGA GGTGGCTGCC TGGAGCCCCA CTGCCCTTCT CATTTGGAAA CTAAAACTTC 1440 

ATCTTCCCCA AGTGOGGGGA GTACAAGGCA TGGOGTAGAG GQTQCTOCCC CATGAAGCCA 1500 

CAGCCCAGGC CTCCAGCTCA GCAGTGACTG TGGCCATGGT CCCCAAGACC TCTATATTTG 1560 

CTCTTTTATT TTTATGTCTA AAATOCTGCT TAAAACTTTT CRATAAACRA GATOGTCAGG 1620 

ACCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA BKCLITERAT 1680 

UREBHCBLAS TFBHCORTHO L0GOESHOUS ECXCRMYLEV SERQVLDASD FAPELEHSTS 1740 

PYDYQEKESD FSDSPPCPQD FfiUTFDRTFL PALYSLLFLIi GIZLGNGAVAA VLLSQRTAL6 IB 00 

STDTFLIiHIiA VADVLE.TCTL PLWAVHAAVQ WVFGPGLCKV AGALFHIHFY AQAFMACIS I860 

FDRYTjSJVHA TQIYRRDPRV RVALTCIVVW GLCLLFALPD FIYLSANYDQ RLNATHCQYN 1920 

FPQVGRTALR VLQIiVAGFJjIj PLLVMAYCYA HILAVI^VSR GQHRFRAMRIi VWWAAFAV 1980 

CWTFYHItWIj VDILMDVGVI* A2NCQRKSHV DVAKSVTSGK OYMHCCUTPL ItYAFVGVKFR 2040 
EQMWMLFTRL GRSDQRGPQR QPS8SRRESS W9ETTEASYL GL 



A34 Protein sequence 
Gene name? 
Unlgene ntrmben 
Protein Accession ill 
Signal eecjuence? 
Pfam domains: 



DRY box: 

Cellular Localization; 



CXCR31G protein-coupled receptor 9) ? chemokine(C-X-C> receptor 3 
Hs. 198252 

P49682 
none found 
7tm_l [70-318] 

57-78, 92-113, 129-147, 169-190, 222-243, 257-275, 307^323 
148-149 

plasma membrane 



1 11 21 31 41 51 

I I 1 I 1 ] 

MVLBV8J3HQV LHDAEtfAALI* ENFSSSYDYG ENESDSCCTS PPGPQDFSLN PDRAFliPALY 60 

SLLFLIjGUjG NGAVAAVLLS RRTAltSSTDT FLLHLAVADT LLVLTLFLWR TOAAVQWVFG 120 

SGLCKVAGAIi FWINFYAGAL LLACISFDRY LNIVHATQLY KRGFFARVTI* TGLAVWGLCL 180 

It F ALPDF I FL SAHHDBRLNA THCQYKFPQV GRTALRVLQli VAGET&P&bV WAYCYAEILA 240 

VLIiVSRGQRR LRAMRLVVW WAFAIjCKTP YHLWIiVDIIi MDLGALARNC GHESRVDVAK 300 

SVTGGLGYMH CCLMPLXiYAF VGVKFRKRMH MLLLRLGCPN QRGLQRQPSS SRRCSSWSET 360 
SEASYSQL 



A35 DMA SEQUENCE 
Gone name: 
"Onigene number i 
Proceset Accession #2 
Wucleic Acid Accession tt; 
Coding sequence: 



COI6 gene {clone KGC;52S7> 



Differentially 
Hb. 69517 
AA447522 
BC0O1291 

44-541 (start and stop codcms are underlined) 



1 11 21 31 41 51 

111111 

GGOGGCGCCG CGCGCTOACC CTCCCTGGGC ACCGCTGGGG ACGATGGCGC TGCrfCGCCTT 60 

GCTGCTGGTC GTGGCCCTAC CG0GGGTGTG GACAGACGCC AACCTGACTG CGAGACAACG 120 

AGATCCAGAG GACTCCOU3C GAACGGACGA GGGTGACAAT AGAGTG1GQT QTCATGTTTG 180 

TGAGAGAGAA AACACTTTCG AGTGCCAGAA CCCAAGGAGG TGCAAATGGA CAGAGGCATA 240 

CTGOGTTATA GCGGCCGTGA AAATATTTCC ACGTTTTTTC ATGGTTGCGA AGCAGTGGTC 300 

CGCTGGTTGT GCAGCGATGG AGAGACCCAA GGCAGAGGAG AAGCGGTTTC TCCTGGAAGA 360 

GCCCRTGCCC TTCTTTTACC TCAAGTGTTG TAAAATTCGC TACTGCAATT TAGAGOGGCC 420 

ACCTATCAAC TCATCAGTGT TCAAAGAATA TGCTGGGAGC ATGGGTGAGA GCTGTGGTGG 4 BO 

GCTGTGGCTG GCCATCCTCC TGCTGCTGGC CTCCATTGCA GCCGGCCTCA GCCTGTCTTG 540 

AGGCACGOGA CTGCCACAGA CTGAGCCTTC CGGAGCATGG ACTCGCTOCA GACCGTXGTC 600 

ACCTGTTGCA TTAAACTTGT TTrCTGTTGA TTACCTCTTG GTTTGACTTC CCAGGGTCTT 660 

GGGATGGGAQ AGTGGGGATC AGGTGCAGTT GGCTCTTAAC CCTCAAGGGT TC3MTAACTC 720 

ACATTCAGAG GAAGTCCRGA TCTCCTGAGT AGTOATTTTG GTGACAAGTT TTTCTCTTTG 780 

AAATCAAACC TTGTAACTCA TTTATTGCTG ATGGCCACTC TTTTCCTTGA CTCCCCTCXG 840 

CCTCTGAGGG CTTCAGTCATT GATGGGGAGG GAGGCCCRAG TACCACTCAT GGAGAGTATQ 900 

TGCTGAGATG CTTCOGACCT 7TCRGGTGAC GCAGOARCAC TGGGGGAGTC TGRATGATTG 960 

GGGTGAAGAC ATCCCTGGAG TGAAGGACTC CTCRGCATGG GOGGCAGTGG GGCACACGTT 1O20 

AGOGCTQCCC CCATTCCAGT GGTGGAGGCG CTGTGGATGG CTGCTTTTCC TCRACCTTTC 1O80 



846 



WO 03/042661 



PCTYUS02/3681© 



5 

10 
15 
20 
25 

30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



CTACCAGATT CCAGGAGGCA GAAGATAACT AATTGTGTTO AAGAAACTTA GACTTCACCC 1140 

ACCAGCTGGC ACAGGTGCAC AGATTCATAA ATTCCCACAC GTGTGTGTTC AACATCTGAA 1200 

ACTTAGGCCA AGTAQAGAGC ATCAGGGTAA ATGGCGTTCA TTTCTCTGTT AAGATGCAGC 1260 

CATCCATGGG GAGCTGAGAA ATCAGACTCA AAGTTCCACC AAAAACAAAT ACAAGGGGAC 1320 
TTCAAAAGTr CACGAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 



A3 6 Protein sequencer 
Gene name; 
Unigene number-: 
Proheset Accession # ; 
Protein flcce&nion #i 
Signal sequence: 
Transmembrane domain: 
Cellular- localization: 



Differentially expressed C016 gene (clone MGC:S2S7) 

HS- 69517 

AA447522 

AAH012S1 
1-17 (first underlined sequence) 
146 - 162 

plasma membrane 



1 11 21 31 41 51 

I 1 I I I I 

HALLALLLW AURVWTDAN LT ARQ RDF ED SQRTDKGTJNR VWCHVCEHEN TFEQGNPRRC 
KWTEFYCVIA AVKIFPRFFM VAKQC5AGCA AMERPKPHSK HFULiEEPMPP PYLKCCKIRiT 
CNLEGPPXNS SVFKEYAQSM GESCGGLWIiA IIiLLIASIAA GLSLS 



60 
120 



A3 7 DNR SEQUKMCB 
Gene name: 

Unigene number: Eg. 2 93616 

Probe set Accession #: AW043782 
Nucleic Acid Accession it: none found 

Coding sequence: 38- 1075 (underlined sequence corresponds to start and stop codon> 



1 

! 

AGCAACGACG 
CCTGCTGCTG 
GTGCAACATA 
G1X51X3ACGC3G 
GTCGAAATGT 
CTTCCGGTQC 
AAACCCTCTG 
GAGCTTCATC 
AAGTTCTCAA 
TTACCCCAGC 
CCTGCTGGCA 
GCACCGGCTG 
CTGCAAOGTC 
GAATGCGTCG 
TGCGTGGTAT 
CGACCTGCCC 
CAOCAGCCTC 
GGGCACTGCT 
AGTFATTCCA 
TGCTCATGGG 
AACTATCXCT 
TGACATGATC 
CACCCTCATT 
AAATAGGCTG 
CGCTGGAOCC 
ATQATCTAAC 
ATCAAAACCT 
AAGAAftACTT 

aaggactctg 
CTCATTCtGA 
GAGCCCCTCC 
TACACCTGCC 
ACCTSCCCGT 
GTATGTCCCT 
CTCCAAAGTT 
ACTGGTTTCT 
CXGCACTGTG 
GGTCAOGGTC 
AQACAATTTG 
TGAAACAGTG 
AGCTGTCTCT 
ACACCCTTGC 
ACATTTGTGC 
AGAGGGACTC 
TTCTCTGTGT 
AGGTGTTGTT 
CCACTCCGGG 
AACCTGTTTG 
TGATCCTGTT 
AAGCIAGCCA 



11 
I 

CCGGQCAaca 
AGCAGCGGCG 
CCAGGCAACT 
CTGCCTGACT 
GGCCCAACCT 
AATGGGTTTG 
CtTTGCTCCA 
TGOGATGGAC 
GAACOCOGCA 
ATCACCTATG 
CJX3GTCTTGC 
CAGCACCCTG 
ACCTACAACG 
GAAGTAGGCT 
GACCTTCCTC 
CCCTAOCGCT 
CTGAGCGTGG 
GAGCCCAGGG 
AAGTCCATAT 
AAGCTCTTTA 
GCATTCCCCT 
TQTTGTGCGT 
TTTCACATTA 
G6AOAGAGCA 
AATTCTCTCT 
CAGGAGGCCA 
GCTTTGCACA 
TGGACGTGAG 
AAAOCATCTA 
GAGCTTTCCr 
CATGAGTTTA 
CTGGCTCTAC 
AGCCAAGGAA 
GTGGCCCACA 
CCCTTAACAC 
ATCACAGGTG 

CACacrccTc 

AGGCCTCTCC 
GAGTCAAGAT 
TGTTTGTTTT 
TTTTTTGTTT 
CCCGCTGAGC 
ATTGTTGCAC 
CTCTCTCCCT 
CCAGTCAGCC 
TGGCAAGAAA 
CAGCTGTCAC 
ACGCTAATTA 
CTGTAGACTT 
CTGGTATTIT 



21 
I 

GGAGCGGCGG 
CGGAGAGCCA 
TCATGTGCAG 
GCTTCGACAA 
TCTTCCCCTG 
AGGACTGTCC 
CCGCCCGCTA 
AGAATAACTG 
GTGGGCAGGT 
CCATCATCGG 
ACCACCAGCG 
TGCTGCTOTC 
TCAATAATGG 
CCCCACCCTC 
CACCGCCCTA 
CCCGGTCCGG 
AAGACACCAG 
ACTCTGAGCC 
GGGTTAATCT 
AGCACCXGTA 
CCTCCCCCAG 
CTTTTCTGTC 
TTCTGTTTCT 
ATGTTTCTGT 
GCTGGOTAGT 
TCACTGGATG 
AT C CTATTTG 
TAACACCCTT 
CCCTGTATAA 
CAGCAGCATA 
TCCAAGTTCT 
AGCCACTTAC 
TGAGGACCTA 
CCCAQCCTGT 
TTGCAAAGTC 
AflAGCCATGT 
TTCCCAAGGT 
CAACATCCCA 
TTTCCATTTG 
TTCCCTTTCTA 
TTCCTTTAAC 
CCCGTGATAA 
TTTGAGGTTA 
CCGTGTATAG 
ACAGGGCCOQ 
CCACACTGAC 
CCATICAGAA 
AAACAGAGCC 
TTCTrrCTTT 
GTTTTGTTTA. 



31 
I 

CCQCBCCATG 
GCTGCTCCCC 
CAATGOACGG 
GAGTGATGAG 
TGCCAGCGGC 
CGATGGCAGC 
CCACTGCAAG 
TCAAGACAAC 
GTTTQTGACT 
CAGCTCCGTC 
GAAGOGGAAC 

CATCCAGTAT 
CTACTCCGAG 
CTCTTCTGAC 
GAGTGCCAAC 
CCACAGCCCG 
CAGCCAGGGC 
GCTCTGACTT 
AQQATGTCTC 
ACtrtCAOAGA 
AGGTCACTCT 



41 
1 

TGGCTGCTGG 
GGGAACAACT 
TGCATCCCGG 
AAGGAGTGCC 
ATCCATTGCA 
GATGAAGAGA 




GCTATATTGG 
TACCT TATAG 
GTCACCCCCC 
ATGCCCCCAG 
CAGCAGTCGC 
AT7CTGGCTT 
TATCAXCAGC 
CAGCTCCTAA 
CTGGTTTCTG 
ACTTGAGTTG 
CTTGCTCATT 
CTTTTTACCT 
TCAATAGCTC 
CCCAATACCA 
GTAGfTTCTC 
GATCTATTTT 
GTTAAQGGAC 
AAGGTCCAAA 
CAAGTCACTC 
TTATTTATCA 
TCTCTATQTT 
CCTCCCTGCA 
TGATGAGGGG 



AGTGATGAGG 
TCAGAGAACC 
ATTTTTGTGC 
AACCTCATGA 
GTCCTGQACC 
GTGGCCAGCC 
GOCTTOCTGG 
ACGGAATCTC 

agtgocagct 

GGGCAGCCTG 
ACTGAAGAAG 
GTTGCCATTC 
AAGTTACAGT 
TGTTTTTCTG 
TCCCTTGGGA 
CAGCATATAA 
ATGCTCAGAA 
GATTTGGGGA 
CAAAAAAATT 



TGCAGGAAGT 
TTTTAACCAA 
AAAAAAAAAA. 



AAOGTTATTT 
TAGAAATTTG 
CTCATCCXAA 
AATGCAGGCT 
GACTGTCACC 
GCCC&AAGTC 
CATGCAGCCT 
GTGCATTTGG 

cagcaagctc 
gcacctctag 
ctctgagaca 

AAATCTTTTA 
TATTTATATG 
GAAAGATGCA 
CAGACTAACC 
AGTTCTTGAA 
TGTGCTAGTT 
GGAATAAGGG 
TAAAATGGAA 
CAGCTGAAGA 
GGGGCTAAAG 
ATCCAAAGGA 
GAAAGAAAGA 



CCAAGGCTAA 
TCAXTGGTCG 
ACTGCACAGC 
GTATTGACAA 
AAAGCTGTGA 
AACTTGTGTA 
TGGTGGTGGC 
CGCTGCCOGT 
ACCCCCAGCA 
AGGCGGAQCA 
ACCAGAGGCC 
XGAACCAAGC 
CCCAGGCAGC 
GCCCCCAGGA 
TATAAGTCCC 
TAACAATl'TG 
TTBQGATATr 
GCGTCTCAGT 
CCCGAGATCA 
AACAGTATTG 
GTGCAOGAGA 
TTTGGGTTAG 
CCATTTGAGC 
TCAGTGGCCA 
TGGTTTTGTG 
CCCAAGAATG 
AATAGGCAGG 
GCCAAGACCC 
CTCCCAGCTG 
TQACCTGGCT 
CAACACTGGC 
ACPTGAGGAC 
TCCTGGCTCC 
TTAGAGTTAG 
CATGGGCAAG 
GAAATGCATT 
TGTATAGGAA 
AAAGQAGATC 
TGTGTGCCAG 
GGAAGCAGAA 
TTTCTTTTTT 
GTAAAACOTT 
CCAGGTAGAG 



TGGCATTCAG 
TGTTACAGAA 
AAGAAAGAAA 



60 
120 
180 
240 
300 
360 
420 
480 
E40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1360 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2200 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



847 



WO 03/042661 



PCT/US02/36810 
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AACGGAAAGG 
CTGAQrAATC 
CATTTCATCT 
TTTCTGOTGC 
GAGTTAATCT 
AGATAAGGGA 

tgattttttt 
tttttggggg 
ttxttaactc 
aaaatagtct 
agttacack3 
tttttaaaaa 

GAAAGGTTGT 
TTATACTTTC 
AAMMAAAAAC 



GAAACCCTGG 
GGAAACOGCA 
GGACTCAACC 



AACCTAGCTG 
CAATAAAGAA 
CCTGTGAGTC 
TCTGGAAGTT 
CACTCGCITT 
TGCCXACTAA 
AATGAATGTT 
GAGGGGGTTT 
ATTCCAACCA 
CATCTCTTTT 
TGATGACTGG 
ATGCAACTAA 
GTGTCGTTGC 
TAATAAATTT 
AWYWTTGGGG 
GGGCCCACGT 
CCAAGAAAAA 
GAGTGTTGCG 
AGGAGGACCC 



CCTGTATCTT 
CTTTTGATQA 
AGAAGGQCTT 
GTTTAGAGGA 
TCTGCTTCCA 
TGCTTTTTTA 
TTTAAAAATA 
OTTTTCCAAC 
GGAAGCTTTT 
TTTCTCAAAT 
CCTATTAOCT 
GTGGTTAATA 
TTTTTGTGTT 
GCAGTTTCAT 
GGGCTTGGGC 
AGGTACGGCO 
GGTGGCGAGA 
TAAACCACAC 
AAGGGAACCC 



TCATTTTTAA 
CAGCCAGAAT 
TATTTCTCCC 
AAGAATTCTA 
GGCATCTTAG 
AAACAAACAG 
TATAAATAGG 
TCAAGATGGC 
TTATACATTG 
GAGATCCGTG 
GACTCAGCTC 
GTGTGTGACG 
TTGGTTAGGC 
TCTTTCTGTT 
CTCGGAAAAA 
ACCACGQ3GG 
ATTCTCCACA 
CCGAAGAGAG 
GATAGAGTAC 



AATABCACTT 
GTGTTAGAAC 
TTTGATGGGG 
ATTTTAATTA 
GAAAAACAAA 
QQACATTTTT 
ACACCAAAGC 
ACATTAGTGG 
CCTAAATCTA 
TTTTATTTTA 
CCTCTACCTT 
CTCAAAGTTA 
TTGGTTTTGT 
TGTGCAAAWG 
GTTTTTAACA 
CCCAAACGGG 
CCAGAAAAAA 
AACTCAGAAG 
G 



GAGTTATTTT 
TCTGGCTGAA 
CCCCTTCTTC 
ATTGCGCAQT 
TGGTTTTAGT 
ATTATAGATT 
GGO^GGGTTT 
CCftGCAATAT 
CGCCAACCAG 
GCATTAAATT 
GAAATTGACA 
ATGTAAACTG 
TTTTTAATTT 
GWMCTAMARM 
CCACTTCGGG 
ACCCCAGAAG 
ACGCGCCGGG 
CACACAAGCG 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
37A0 
3B40 
3900 
3960 
4020 
4080 



sequence : 



A3B Protein 
Gene name: 
Unigene number: 
Prabeset Accession #: 
Protein Accession #: 
Signal sequence; 
Transmembrane domains* 
LDLa domains? 
Cellular Localization: 



ESTs 

HS. 293616 
AW043762 
none found 
1-17 
169-131 

28-66* 70-108, 112-149 
plasma membrane 



t r v t r r 

MWLLGPLCIJi LSSAABSQLL PGNNFTKECW IPGNFMC3NG RCIPGAWQCD GliPDCFDKSD 60 

EKECPKAKSK CGPTFFPCAS GIHCIIGRFR CNGFEDCPDG SDEENCTANP XiL CS T AR YTIC 120 

KNGLCrDKSF 1CDGQNNCQD BTSDEESCSSS CEPGSGQVFV TSENQDVYYP SITYAIIGSS 150 

VIPVLWALL AIjVLHHQRKR KHLMTIiPVHR LQHPVLItSHIi WltDEPHHOKT VTYNVNNGIQ 240 

YVASQAHQHA SBVGSPPSYS EAIiLDQRPAW YDLPPPPYSS DTSSLNQADL PPYRSRSGSA 300 
NBA3S0AASS LLSVEDTSHS PGQPGPQEGT AEPHDSEPSQ QTEEV 



ESTS 

Hs. 128899 
AA983251 
AA9B3251 
(underlined 
31 



A3 9 JMA sequence 
Gene namei 
Unigene number i 
Probeset Accession #: 
Nucleic Acid Accession #: 
Coding sequence: 1-1749 
1 11 21 

III. 

ATGC TGTCTG GCTTCTTGAT QAGTCCCAGT ACCCAGCACA 
GGAAAGAAAC TTOOGTGGGA GGCTTCCATC GGTQCGCACA 
GAOCGGGAGA GGGAOAGCCG GCOGGAGGCT GCCGGOCTCC 
GGGGAGGCGG AGAAGGGGAA CCGGGQCGAG COGCCOGCCT 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACttGCGG 
CGCCTGCGTC CXGGAOGTTC CCGGGGGAGG GTCCGGTTGC 
TCOSGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG 
GCTOCTGGAC CTAGGGOCCG GCGTCGTCGC CTCCTGGGCG 
CCGCGCGGAA AGCGOCGCGG GACAGTCAGT GACGAGGCCC 
CTTCTCGGAG ACCGTOCTGC GCTCTCTGGA GACGCGCTGT 
TGTGGGGCGC TCGCCGCTCG TOCGTCTCCT CATCCTGGAA 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA 
TGGCTGGACG CGCAGGOCGT CTGGOGCATC GGCTTCCAGT 
GGOGACGCCA CCATCTGCTG CBGCAGCTGC GCGTTGCGCT 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC 
CGGGOGGACA AAGAGGGGCC CCGACGGCTC GGCAGGGCTT 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC 
TCCCGGAAAG GAAGGCAGCT CCTCAGGGCT TTGCOOGGGC 
GGCGGA3T0C CATCTTCTCC ACGOGGCGQC CCCTCTCCOC 
ATCTACGTGC CGTTCCTCAT TGTTGGCTC3C GTGTTTGTCG 
GTGG1GGCAG CCTGTTGCTG CAGATGTCTC GGGCCTAAGC 
GCCCCAGGGG GTAACCGCTT gatggagaoc ATCCCCATGA 
CGGGGGTCGT CCTCACGCCA GTCCAQCACA GCTGCCAGTT 
GGGGCCCGGG CQCCOCCAAC AAGGTCACAG ACCAACTGTT 
AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC 
CAGATUGTGC CACATCAAGG GCAGTATCTG CAtTCCCCCAT 
CAOGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG 
AQGCAGATTC AGTCCCCCTT CCCTCAGATTC AACAGTGAAC 
ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA 
GATTCTCGAG GTGGAAGTCC GCACATGXCG GTGGTATTTA 
GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT 
CRTCCAQGGT ATCATTOATT TATGATGGAA AACCGGCCTC 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC 
AAGAATTTTA TAAACTGATA AATTAAGGAT TTTlATTATG 



sequences 
41 
I 

GAGCACAGTA 
CCTCCCGAGG 
TGTGGGAOOG 
GQATCCGCGC 
CTGGGGGCGC 
CAGTGAAACC 
GATTTCCATC 
AGGACGGGGA 
TCGCGGCAGA 
GGGGGTCGOC 
CCGCGCCCAG 
CGCOGCTTCG 
GCGGCGAGTA 
GTCGCGAQCG 
ACTGCTGCTC 
AGGGCGCTQG 
CATGTCTTAG 
AGCGGTGCTC 
TGCTGCCGGG 
TGCAGCGGCC 
CCTTTRTCAT 
AGGATCCCCA 
TCCCCAGTGC 
CCAGCTCCAG 
GCTTGCCGGA 
TGAACTGTCA 
ACGTGGGGTA 
ACGGCCTGCA 
AGAAGATGTA 
GGGAGAGGAA 
TGGCAGGATT 
TAGCATTTCT 
AGCTGGAGAT 
CCTTGAGATA 
TTGXXATTAT 



correspond 

51 

] 

CACTCCCGGA 
GCGAGGCAGC 
CGCTGCAGCC 
CCAGCAGCAG 
GCAGGACCCT 
TCCAGAGGCT 
AGCGAGTGCA 
TGGCTTAGGA 
GGGGAGTGGC 
GGGGCCACGA 
GGTGGTGCCA 
CTGCTGCAGC 
CTGCCACGGC 
CTTCGACGGC 
CAGCGCCGAG 
CGAGCCTGGC 
GGGTACCCAA 
CCCTGAAGGC 



to start and stop codons) 



CGCCTTGCCC 
CTTGGGGTCC 
GCAGAGCCGA 
CAGCACCTCC 
CGCCAACTCC 
AGGGACCATG 
GCAGGCCACC 
CACGGTGCAG 
GCCTGGCTAC 
CCCAGOGGXG 
GGCAGGGGTG 
CCTTTGGATG 
GGATATGTTT 
GACXGTQATG 
TGGTTGACGA 

tatttctttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
126 D 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
204 0 
2100 



848 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 



TTGXTGTTGA 

tttttttttt 

TGGTGCGATC 
AGCCTCCCAC 
TGTAGAGATG 
TCTGCCTGTC 
TQAGCCTTTT 
ATTCTAAAAG 
GGCACACCTT 
GGGCCTATTT 
TTAAAAAGAG 
TGGGATGATT 
TTGAT TGT AT 
TATATTGAAA 
TQGTTATGGT 
GAGTGCAAGT 
AAAAATTTTC 
TAGCTCCTCA 
GCAGATCATA 
GATTGTTAAG 
ATTTCCTCTC 
CAAATTCAAG 
ATGTGCAAGT 
GTATGGTATT3 
TGTTTCATGA 
TTTTTTAATA 
TGAAAAAAAA 



CTGCACAGGA 
TTTTTTTTAA 
TCGGCTCACT 
GTGGCTGGGA 
GGGTTTCACC 
TCAGCCTCCC 
TTTTTTTCTA 
GAAACCTGTT 
AATTrCATTG 
TGTGCPTTTT 
ACTGAAATAA 
TTAATCTTGA 
CTTAATTTTC 
TCATAAACTA 
TTGGCGTTTC 
AATTACTATA 
TTGTCTTAGT 
GCCATAACCT 
AGGCTTTTTA 
AGAAAAGCTT 
TCTTTCAQGC 
TGAATTTATT 
ACATTTCCAA 
GTCTTCTACA 
CTGAATAATG 
ACCAAAGGCA 
AAAAAAAAAA 



TCAAAATQCC 
TCAGACAGGG 
GCRACTTCAQ 
TTACAGGTGC 
ATGTTGGCTG 
AAAGTGCTGG 
ATGCATCCAA 
TGAACTCTGT 
TAAAAAGATA 
TACCTTATGT 
ATTGTATAGT 
AGTAGTAGGT 
TGGCTTTAAG 
TCACCCGCTG 
CTTCTGTTTG 
CTTGTAAATG 
TATAAAAATT 
GAGACTTGGG 
TACTCTTGTT 
TTCAACGAAG 
CTCAACAGGC 
TGTGTGTTCT 
CTGCTAGCAC 
CATTTATGTC 
TAAAACCAGT 
GGGGAAAATC 
AAAAAAAAAA 



TGTTATCTCC 
TCTTGCTCTG 
CCTCCXOOAT 
CTGCC5CCCAT 
GGCTGGTCTC 
GATTACAGGC 
GGTTAAGGGG 
GAGATCAGTC 
TATATAT TTT 
AGAGATCTTA 
TACTTAACTA 
GGTATAGTCA 
GTGACATCTG 
CTTCTCTGAQ 
GTTTTCAGAG 
AAGATCAGTA 
CAAAGAAATG 
ATGAAATTTA 
ATCAAAATGG 

GATTGCCrrrr 

ACTGTATTCA 
T TACT TAT AT 
AACCAGTATT 
TATAGATATC 
GTTGGCAATT 
ATT TTACTT A 



CTTTTACTGG 
TTGCCCAGGC 
TCAGGCAACA 
GGCTAATTTT 
ACTCTCCTGA 
GTGAGCCACC 
AAGACGCAAA 

ATCAGrerCA 

GTCTATTTTT 
TTACAAAGTG 
ATGAAGACAT 
TAAAACCATT 
AGAGGTAATG 
TT ACT T TTAA 
CCCCATGTCT 
TTTCTGCCTA 
TGTTACAAAG 
AACCAGATAC 
CTTATTTTTC 
CTTCTCCCAC 
TTGCCAATGT 
AAAAAAAGAT 
TTGTAATTAA 
TATCGATCRT 
GGTATCATCA 
TTAATAAATA 



GACTTTTTTT 
TGGAGTGCAG 
CTCCTGCCTC 
TTGTATTTTT 
CC7TCAAGCAA 
GCCCOCAGCC 
TAACAGGACT 
BT ATT C CACA 
GTGCTTTTQG 
ATTTTCTACA 
TTCAGAACTC 
CATCCCCTTC 
CATTCTTTTT 
TTTTGCCTTG 
ATATAGTCCT* 
GATCTGATAA 
ATACTTAGTA 
GATTTACTTT 
AGGCACTAAG 
ACTGTTCTTG 
TCCAAATTAT 
AACTTTAAGG 
ACAAATCGCT 
CTTTCTATTC 
ATGATACTCA 
TTTTATGATG 



A4Q Protein frequence 



2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2080 
2940 
3000 
306D 
3120 
31QD 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



Unigene number-. 
Protein Accession #: 
signal sequence; 
Transmembrane domains: 
Cellular localization t 



£8Ts 

Ha. 128899 

none found 

1-11 
402-424 

not determined 



40 
45 
50 
55 
60 
65 
70 
75 
80 



SGRQPRGPSD 
FRGKRRGTVS 
CCWLRCttREG 
ARLDQGGCDN 
SPKGRQLLRA 
LVAACCCRCI* 
GARAPPTRSQ 
HDSVPMTAVP 



11 
I 

TOERAQYTPG 
PPAWIRAQQQ 
CIPRFPSASA 
DRARG9P0PR 
RGPSGHYCHG 
DRQQGAGEFG 
FPGLLPRARR 
RPKQDPQQSR 
TNCCIiPBGTM 
PFMDGLQPGY 



21 
1 

GKKIiPWEASI 
PRPPPAGQAP 
THKAVPKGTG 
l*LGDRPALSG 
WUJAQGVWRI 
RADKDGPRRL 



APGGNRLMET 
NNVYVHMPTN 
EQI QSPFPHT 



GTAAGGAQDP 
PPAEDGDSLG 

haijSaphwp 

GFQCPERFDG 
GRASCLHQTQ 
PSPLQRPALP 
IPNIFSASTS 
FSVLNCQQAT 
NSBQKMYPAV 



41 
1 


51 




DRERESRPKA 


1 

AGLTjHDRAAA 


60 


RLRPGRSRGR 


VRLPVKPPEA 


120 


APGPRARRRR 


LLGVAAEGSG 


iao 


CGALAARPSP 


HPGTPLRSCS 


240 


GDATICCG3C 


ALRYCCSSAE 


300 


GDGEGAPPPV 


RAWQHCSPEG 


360 


1 WPFIiTVG 8 


VFVAFIILGS 


420 


RGSSSRQSST 


AAE6SSSANS 


480 


QIVPH&GQYIi 


HPPYVGYTVQ 


540 



A41 DHA SEQUENCE* 
Gene name: 
Unigene number; 
Frofoeset Accession #: 
Nucleic Acid Accession #* 
Coding sequence: 



ESTfi, KeaXly similar to JE0350 Anterior gradient-2 |H. sapiens] 

Hs. 100686 

AA487468 

AA487468 

5S-5S5 (underlined sequences correspond to start and stop codons) 



l 
I 

CGQCACCAAG 
CTACACTCAQ 
GCAATAAAAA 
ACTTGGGTAC 
ATGOTTATTC 
CAAAATGAAG 
GAAACCACTG 
GACCCTTCTT 
TATGAGCCTC 
CAGTCAGAGC 
GAAGAAAACC 
TTACTATTTA 
TCTGAAAAAA 



11 
I 

AGCACTGGCC 
CTTTGGGTCT 
AGGAAAAGAG 
AAACTTATGA 
ATCACCTGGA 
AAATACAAGA 
ATAAGAATTT 
TAACAGTTAG 
GGGATTTACC 
T ATAAG AGAT 
TCTGGCACAT 
GTTTTTTTAA 
AAAAAAAAAA 



21 



AAGTCAGCTT 

GCCTCCTCAG 
AGAAGGTCTC 
GGATTGTCAA 
AATGGCTCAG 
ATCACCTGAT 
AGCTGACATA 
CCTATTGATA 
GATAGAAAAA 
TGACAAATAC 
TGTGTTTQCA 
AAAAAAAAA 



31 
I 

CTTCTGAGAG 
CTCGTCACAG 
ACACTCTCAA 
TTTTATGCTC 
TACTCTCAAG 
AATAAGTTCA 
GGGCAATATG 
GCTGGAAGAT 
GAAAACATGA 
AGCCTTCACT 
TAAATGTGCA 
ATAGTCTTAT 



41 

J 

AGTCTCTAGA 
TTTCTTCCAA 
GAGGATGGGG 
AAAAAAGTAA 
CACTAAAGAA 
TCATGCTAAA 
TGCCTAGAAT 
ACTOTAACAG 
AGAAAGCATT 
TCAAAGAAGT 
AGTA7ATAGA 
TAAAATAAAT 



51 
I 

AGA CATBA TQ 
CCTTGCCATT 
AGATGACATC 
GAAGCCATTA 
AGTATTTGCC 
CCTTATGCAT 
CATGTTTGTA 
ATTGTACACA 
AAGACTTATT 
CAAATTTCAT 
TTTTGTAATA 
GTTTTTTAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



AA2 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession fti 
Protein Accession 
eque 



Cellular Localization; 



BSTS, Weakly similar to JE0350 Anterior gradient -2 [H. sapiens] 

Hs-100686 

AA487466 

none found 

1-23 

none found 
secreted 



849 



WO 03/042661 



PCTAJS02/36810 



1 11 21 3L 41 51 

1 I I 1 I I 

MMLpHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEE GLFYAQKSKK 60 
PlMVIHHliED CQYSQALKKV FAQNEEIQEM AQHKFIMLNI, MHETTDKNLS PDGQYVPRIM 120 
FVDPSLTVRA BIAGRYSNRJJ YTYEPRDLPL LIENKKKALR I.IQSKL 



A43 EHA 9BQUENCE 

Gene names ATPaee, Ca++ transport ing, type 2C, member 1 

Unigene number: Ee .106776 

Pxoboset Accession #s N51919 
Nucleic Acid Accession #s AF189723 

Coding sequence t 1-2712 (underlined sequences correspond to start and stop codons) 



X 11 21 31 41 51 

I I ) 1 1 I 

ATGATTCCTG TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTCA AGTTGCAAGC 60 

ATTCTGCAAG CTGATCTTCA GAATGGTCTA AACAAATGTC AAGTTAGTCA TAOQOGAGCC 120 

TTTCATGGCT GGAATGAGTT TGATATTAGT GAAGATGAGC CAtTTQTGGAA GAAGTATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCFT CTGCAGTCAT CAGTGTTTTA 240 

ATGCATCAGT TTGATGATGC CGTCAGTATC ACTGTGGCAA TACTTATDGT TGTTACAGTT 300 

GCCTTTGTTC AGGAATATCS TTCAGAAAAA Td'CTTOAAG AATTGAGTAA ACTTGTGCCA 360 

CCAGAATGCC ATTGTGTGCG TGAAGGAAAA TTGGAGCATA CACTTGCCCG AGACTTGGTT 420 

GCAGGTGATA CAGTTTGCCT TTCTGTTGGG GA TAG AG PTC CTGCTGACTT ACGCTTGTTT 4fl0 

GAGGCTGTGG ATCTTTCCAT TGATGA6TCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 540 

AAGGTGACAG CTCCTCAGCC AGCTGCRACT AATGGAGATC TTGCATCQAG AAGTAACATT 600 

GCCTTTATGG GAACACTGGT CAGATG1GGC AAAGCAAAGG GTGTTGTCAT TGQAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAG CAQAAGAGGC ACCAAAAACC 720 

CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACIC CTTTQGTATA 780 

ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCCTGGA AATGTTTACT 840 

ATTAGTGTAA GTTTGGCTGT AGCAGCAATT CCTOAAGGTC TCCCCATTGT GGTCACAOTG 900 

ACGCTAGCTC TTGGTGTTAT GAGAATGGTG AAGAAAAGGG CCATTGTGAA AAAGCTGCCT 960 

ATTOTTGAAA CTCTGGGCTG CTGTAATGTG ATTTGTTCAG ATAAAACTGG AACACTGACG 1020 

AAGAATGAAA TQACTGTTAC TCACATATTT ACTTCAGATG GTCTGCATGC TGAGGTTACT 1080 

GGAGTTGGCt ATAATCAATT TGGC3GAAGTG ATTGTTGATG GTGATGTTGT TCATGGATTC 1140 

TATAACCCAG CTGTTAGCAG AATTGTTGAQ OCGGGCTGTG TGTGCAATGA TGCTGTAATT 1200 

AGAAACAATA CTCTAATGGG GAAGCCAACA GAAOGGGCCT TAATTGCTCT TGCAATGAAG 1260 

ATGGGTCTTG ATGGACTTCA ACAAGACTAC ATCAGAAAAG CTGAATACCC TTTTAGCTCr 1320 

GAGCAAAAGT GGATGGCTGT TAAGTGTGTA CACCGAACAC AGCAGGACAG ACCAGAGATT 1380 

TGTTTTATOA AAGGTGCTTA CGAACAAQTA ATTAAGTACT QTACTACATA CCAGAGCAAA 1440 

GGGCAGACCX TGACACTTAC TCAGCAGCAG AGAQATGTGT ACCAACAAGA GAAQGCAOGC 1500 

ATGGGCTCAG CGGGACTCAG AGTTCTTGCT TTGGCTTCTG GTCCTGAACT GGGACAGCTG 1560 

ACATTTCTTG GCTTGGTGOG AATCATTGAT CCACCTAGAA CTGGTGTGAA AGAAGCTGTT 1620 

ACAACACTCA TTGCCTCAGG AGTATCAA.TA AAAATGATTA CTGGAGATTC ACAGGAGACT 1680 

GCAQTTGCAA TCGCCAGTCG TCTGGGATTG TATTCCAAAA CTTCOCAGTC AGTOTCAGGA 1740 

GAAGAAATAG ATQCAATGGA TGTTCAQCAG CTTTCACAAA TAGTACCAAA GGTTGCAGTA 1B00 

TTTTACAGAG CTAGGOCAAG GCACAAGATG AAAAtTATTA AGTCGCTACA GAAGAACGGT 1660 

rCAGTTGFAG CCATGACAGG AGATGGAGTA AATGATGCAG TTGCTCTGAA GGCTGCAGAC 1920 

ATTGGAGTTG CGATSGGOCA GACTGGTACA GATGTTTGCA AAGAGGCAGC AGACATQATC 1980 

CTAGTGGATG ATGATTTTCA AACCATAATG TCTGCAATCG AAGAGGGTAA AGGGATTTAT 2040 

AATAACATTA AAAATTTCGT TAGATTCCAG CTGAGCACGA QTATAGCAGC ATTAACTTTA 2100 

ATCTCATTGG CTACATTAAT GAACTITCCT AATCCTCTCA ATGCCATGCA GATTTTGTGG 2160 

ATCAATATTA TTATGGATGG ACCCCCAGCT CAGAGCCTTG GAfiTAOAACC AGTGGATAAA 2220 

GA7GTCATTC GTAAACCTCC TCGCAACTGG AAAGACAGCA TTTTGACTAA AAACTTGATA 2280 

CTTAAAATAC TTGTTTCATC AATAAT CAT T GTTTGTGGGA CTTTGTTTGT CTTCTGGCGT 2340 

G AGGCA CQAQ ACAATGTGAT TACACCTOGA GACACAACAA TGACCTTCAC ATGCTTTGTG 2400 

TTTTTTGACA TGTTCAATGC ACTAAGTTCC AGATCCCAGA CCAAGTCTGT GTTTGAGATT 2460 

GGACTCTGCA GTAATAGAAT GTTTTGCTAT GCAGTTCTTG GATCCATCAT GGGACAATTA 2520 

CTASTTATTT ACTTTCCTCC GCTTCAGAAG GTTTTTCAGA CTGAGAGGCT AAGCATACTG 2580 

GATCTGTTGT TTCTTTTGGG TCTCACCTCA TCAGTGTGCA TAGTGGCAGA AATTATAAAG 2640 

AAGGTTQAAA GGAGCAGGGA AAAGATCCAG AAGCATGTTA QTTCGACATC ATCSVTCTTTT 2700 
CITGAAGT AT OA 



A44 Protein sequence: 
Gene names 
unigene number : 
Probeeet Accession tt= 
Protein Accession #: 
Signal sequence » 
Transmembrane domains $ 
Pfam domains: 
[744-8893 

Cellular Localization? 



ATPase, Ca++ transporting w type 2C f member 1 

Hs. 106778 

N51919 

AAP27613 

none found 

60-82, 86-10fl r 2S2-274, 282^304, 757^779^ 827-849, 856-878 

Cation ATPase N [6-83] p H1-K2 ATPase [89-324] , Hydrolase [328-649] t Cation ATPase C 
not determined 



I 11 21 31 41 51 

II I 111 

MTPVLTSXKA SELPVSSVA3 XLQADLQNQL NKCEVSEJUtA FHGWNEFDIS EDEPEjWKKYI 60 

9QFKHPLIML IiWVSAVlSVL MHQFDDAVSI TVAILTWTV AHVQEYR8EK SLEBLfiKLVP 120 

PECHCVRBGK LEHTLARDLv PGDXVCLSrVG DKVPADUlLF EAVDLSIDES SI»TGETTPCS 180 

KVTAPQPAAT SKHXLASRSNT AFHGT LrVECQ KAKBWIGTG ENSEFGEVPK MMQABEAPKT 240 

PLQX9HDLLG KQL8PYSPG1 IGXlMIiVGWIi LGKDItEMP.T I8VSLAVAAI PEGLPlWTV 300 



850 



WO 03/042661 



PCT/US02/36810 



TIALGVWRMV KKRAIVKKLP IVETLGCCNV ICHDKTGTLT KNEMTVTHIF TSDGLKAEVT 360 

GVOYNQFGEV IVDQDWHGF YNPAVSR1VE AGCVCmiAVr RNNTLMGKPT BGALIALAMK 420 

MGLDSLQQDY IRKAEYPFSS EQKWMAVKCV HRTQQDRPBI CFMKGAYEQV IKYCTTYQSK 480 

GQTLTLTQQQ RDVYQQEKAR MGSAGLRVLA IiASGPELGQL TFLGLVGIID PPRTGVKEAV S40 

TTLIASGV6I KMITGDSQET AVAJASRlXJfc YSKTSQSVSG BEIDAMDVQQ LSQIVPKVAV 600 

FYRASPRHKM KIIKSLQKMO SVVAMTGDGV NDAVALKAAD IGVAMGQTGT BVCKEAADMI 660 

LVDDDFQTIM SAIEEQKGIY WNIKNFVRFQ LrSTS I AALTIi ISLATLMNFP NPIjHAMQIIjW 72 0 

IHIIMDEPPA QSLGVEPVDK UVIRKPPKMW KDSILTKNLI LKJI,VSSIII VCGXliFVPWa 78 0 

BLRDNVITPR STTMTFTCFV PPDMPHALSS RSQTKSVFEI GLCSNRMFCY AVLGSIMGQli 840 

IjVTYFPPIlQK VFQTESLSIL DLJWLLGLTS SVCIVAEIIK KVERSREKIQ KHVSSTSS8P 900 



A45 DMA sequance 
Gene name: 
Unigene number: 
Probeaet Acc^sion #> 
Nucleic Acid Accession #r 
Coding Sequence j 



SSTs 

Hs. 157601 

W074S9 

AC0053B3 

328-2751 (underlined sequences correspond to start and stop codons) 



1 

I 

GACAGTGTTC 
TTTTATTTGC 
CCIGGCGGTA 
ACAAACAGGT 
CCCCCTOGOC 
TCQCCGCTCT 
QTTTTCCTGT 
GAAACCATCO 
ATCAJTG7TTC 
CACTTTGCCA 
QCATTCCAGT 
CAGGAAGTGA 
CTTGCTCTGA 
CAGATCCTCA 
CAGCTGAAGG 
GAGCTGCATG 
GAGGATQCCA 
AOt3CCAGACT 
GAGTTCGCTG 
GCACACTGTC 
AGGACCACCT 
CCAGAAGGAC 
TOTGCCCTGA 
GGC3QQCAOCR 
GCGGTGCTGA 



GGCRTTOCCT 
CGTGGCTT CG 
CTCACTGAGT 
CAGCTGCTCC 
GGCAGCCCAA 
GAGCTGCAGG 
CTCOTCTTCA 
AQCmGXGA 
CTGGTQGTGT 
GCTOOGATGC 
ACOGCCCTQC 
GTCOCCAAAG 
GCOCAGAAGC 
AGTOAGGGTC 
GCCGACCTCC 
OCAGTCAACC 
GQGAGCTACC 
TGGAOCTCTT 
ATGGCTCCCG 
GGCACTGAAA 

TTcccGcoar 

ATGCTGCTTn. 
TTGATGTGTA 
CTGCCACCXT 
CGTTCCTTTG 
AGGCCTTTAC 
GCAGCTTTTC 
CTTGAGGGAC 
GGTCTCAQAC 
TGTGCATGGQ 
ACCTTGAAGG 



11 
1 

GCGGCTGCAC 
AGACCTGGGC 
GTTOCTCCGA 
GTCCCAOGTG 
G3AGCGC5CGC 
CCTTCCGTTA 
TTTCCAGAGT 
GGAAOATTTC 
TQTTAGATGG 
TCACAGTCTG 
TCAGTTCCAC 
AGGCAAGAAT 
AATACCTTCT 
TCATOGTCAC 
AAAGGGGTGT 
CACTGGCCAG 
CCAACGGCCT 
GCAGGGTCGA 
GCAATGCCCC 
CCTTCTACAQ 
GCCCAGGCCC 
TGGAOGGGTA 
AGCTGAGOCT 
CTCTGGACGG 
GCQAGGACTC 
TGCCTOTGGG 
TCCGTGGMG 
GGAGCGCCAC 
CACACTCCX3A 
TGCTGGGTGT 
AGCATGTGAT 
GGAAGCTGTG 
TGTTGGACAC 
GAAGCTGTGC 
ATGGCAGCCA 
TGCGGGCCAT 
TGCACATCTA 
CTGTGGTGGT 
TGAGGAACAA 
TG03GAGGCT 
GGTACCACCA 
TCIGC3\AACC 
GCIGCAAGTG 
GCTCTGTATG 



21 
1 

DGCTOBGAGG 
CGATGCOGCT 
CCTCAGCCGG 
GCAGCCGCGC 
COGGGTCTGT 

tatcaacatg 
qccc0catct 

AGCTGCCAGC 
GTCTAACAGC 
TGAOGOTCTG 
TCCTCATCTG 
CAAGAGGATG 
GCACAGAGGG 
TGATGGOAAG 
CACTGTGTTT 
CGAGGCTAGA 
CTTCAGCACC 
GGCTCACCCC 
ATGCtGGAGA 
CTGGAAGAGA 
CTGTGACTCG 
CCAGTGOCTC 
GGAATGCAQO 
CTTCCTGOGG 
TCGGGCCGGA 
GGAGTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTGG 
CAGCCGGCAG 

ctctgcctca 
cctccagttt 

(3GTGCAGACT 

tagccaggcc 
xoacaaagtg 
gctcacaggc 
tggcatctct 
tgcasgtccc 
ggaogtgctc 
cagcccqtgc 

TCGGGATGGC 
TGTGAGCCAG 



31 
I 

CTGGGTGACC 
TTAAAAAACG 
GTCGGGTCGT 
CCCGGGCGCC 
GAGTAGAGCC 
CCCCCTTTCC 

ctccctctoc 
aaaatoatgt 

GTCGGGAAAG 
GACATCAGCC 
GAAITCCCCT 
GTTTTCAAAG 
TTGCCTGGAG 
TCCCAGGGGG 
GCTGTGGGGG 
GGGCAGCACG 
CTCAGCAGCX 
TGTGAGCACA 
GGATCGCGGC 
GTGTTCCTAA 
CAGCCCTGCC 
TGCCOGCTGG 
GTCGACCTOC 
GCCAAAGTCT 
GTGGGTGTGG 
GATGTGCCTG 
ACGGGCAGTG 
CAOGACCGaC 
GOGGGCCCAG 



TGGTGCCTAC 
GGCCAGQACC 
GAGACAAGAA 
AGTAAATAOC 
TCCCITGAGG 
CACACAATCA 
TAGAGCATCC 
CACTTCCCCA 
GTTTGTGACT 
TGAATGrGAC 
CCCAGGTCTG 
TCTTC 



CTTCIGGAAI 
ACTATTCTQV 
AGCAGCTGAT 
CACTTTCTGT 
ATAAACAAGG 
ATGCTCGCCA 
TTTGGAOGGC 
GAGACATTCT 
TCTTGGOGAC 
CAATTAACCA 
GAGGGCCAOG 



A4g Protein sequence 
Gene names 



GATCCTCAGG 
OGGCCAGGGT 
QTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC 
COCTAOCTAG 
ATGACCGTCC 
GGGAGAGGOG 
GTCTTGGTCG 
OGGGATTOOC 
ATTQAGTGGC 
ATGAA1GAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 
CTGAGGGAGG 
GTCACCCACA 
ACCTGCTGTG 
GGTCCTGAAG 
GAATGTTGTT 
GAAGGCCACG 
GGATGCATTT 
TGCCTTTTGT 
GCTTGGTTGA 
TAAAATOQTT 



ESTs 



42 

I 

CGCSTAGAAG 
CGAGGGGCTC 
GCCGCCCTCT 
CCTCCTGTGA 
GCOCGGQCAC 
TGTTGCTGGA 
AGGAAGTCCA 
GGTGCTCGGC 
GGAGCTTTGA 
OCGAGAGQGT 
TGGATTCATT 
GAGGQCGGAC 
GCAGAAATGC 
ATGTGGCACX 
TCAGGTTTCC 
TGCTGTTGGC 
CGGCCATCTG 
GQAQGCTGQA 
GGACCCTTGC 
CCCACCCTGC 
AGAA5X5QAGG 
CCTTTGGAGG 
TCTTOCTGCT 
TCGTGAAGCG 
OCAGAXACAG 
ACCTGGTGTG 
CCTTGCGGCA 
CACGTAGAGT 
0GCGTCAGOC 
CAGAOCTGGA 
ATCTGTTCAA 
GCGGGACACA 
AGAATTTTQC 
CTGACGTGAC 
TGGACACCAA 
GTGGGGIGGG 
AGAGGOGTGC 
CAGAGGATflC 
TGGGCGTOGG 
TGATCCACGT 
TGTGTGGAGA 
GCAGCXGGGT 
CCCACTGCGA 
TTGAGACGCC 
GCAACTACAG 
CAQQTOCT TA 
AGGATGTCCC 
AACGATGTTG 
CCTTGTTQAG 
ACTTAAATTT 
GACACAGTAA 
GCCTTTCAAG 
GCATTGAGTC 
GTGTGGAAGA 
TGATGGQGGA 
CTGAGTOGTG 



51 
I 

TGAAGTACTT 
TATQCACCTC 
CCCAGGAGAG 
TCCCGTAGCG 
CGAQCGCTGG 
GGCGGTCTGT 
TBTAAGCAAA 
TGCAGTGGAC 
AAGGTCCAAG 
CAGAGTGGGA 
TTCAACOCAA 
GGAGACGGAA 
TTCTOTGCCC 
GCCATCCAAG 
CAGGTGGGAG 
TGAGCAGGTG 
CTCCAGOGCC 
GATGGTOCGG 
GGTGCTGGCT 
CACCTOCTAC 
CAOVTGTGTT 
GGAGGCTAAC 
GGACAGCTCT 
GTTTGTGCGG 
CAGGGAGCIG 
GAGCCTCGAT 
GGCGGCAGAG 
GQTGGTTTTG 
AAGGGOGOGA 
GGAGATCACA 
CCAAATCCCT 
AacCCTGGAC 
TCAGATGCAG 
ACAGGTCGGC 
AOOCACCCGG 
CTCAGCCGGC 
OCGGOCTGGT 
AGGCGTTOCT 
GCCTGTCCXA 
GGCAGCITAC 
AGCCAAGCAG 
CCTGCSVQAAT 
GAACOGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
_GAATGTCTGC 
AACTGCAGCC 
TTGAAAAGTT 
GCTATGTCAT 
AGCGGCCTGA 
TGCCCAGCAG 
ATGGAAAGCA 
TQAAAGGGGG 
GACTTGGAAA 
GGOGCTGAGT 
AGCAGTGTCC 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
2260 
132Q 
13 BO 
1440 
1500 
1560 
1620 
1680 
1740 
180 0 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
31BO 
3240 
3300 
3360 



851 



WO 03/042661 



PCT/US02/36810 



Unigene number i 
Protein Accession fr: 
Signal sequence z 
Transmembrane domains : 
vow domains = 49-223; 341 
BGP domains: 298-333; 71 
Cellular Localisation i 



He. 157601 
none found 

1-17 

none found 

516; S29-70S 
-748 

secreted 



1 11 Zl 31 41 51 

1 I I \ \ I 

MPPFLLLEAV CVFLFSRVPP SLPLQSVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKQSFERS KHFATTVCDG LDISPERVRV GAFQFBSTPH LEPPLDSFST OQEVKARIKR 120 

MVFKGGRTBT EIiALKYLLKR GLPGGRNASV PQIIiirVTDG KSQGDVALPS JCQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVXAAEQ VEDATHGliFS TLS9SAICSS ATFIXZRVEAH 240 

PCEKRTLEMV REFAGNAPCif RGSRRTLAVL AAHCPFYSWK RVFL1HPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGUJGYQC LCPLAFGGEA NCALKLSLBC RVDUjFIOjDS &AGTTIxDGFL 360 

RAKVFVKRFV HAVLSEDSRA KVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 42 O 

LTG9ALRQAA ERGFGSATRT GQDRPRKWV LLTESHSEDE VAGPARHARA RBLLLLGVGS 46 D 

EAVRAELEBI TGSPKHVMVY 9DPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMIiBTSA 540 

SVGPEHFAQM QSFVRSCALQ FEVNPDVTQV GLWYGBQVQ TAFGLDTKPT RAAMLRAISQ 600 

APY1/3GVGSA OTAIiIJUYDX VMTVQRGARP GVPXAVW1.T GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRHIiAG PRDSLIEVAA YADLRYEQDV DIEWLCGEAK QPVNLCKPSP 720 

CMNBGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSC3VCVS QGWILETPLR HMAPVQEGSS 7 BO 
RTPPSHYREG LGTEHVPTFH NVCAPGP 



COLON 



A47 DMA SEQUENCE: 

Gene names 
Unigene number: 
Probeset Accession, ft; 
Nucleic Acid Accession 
Coding sequence: 



ESTs, Weakly similar to JE0350 Anterior gradient -2 [H. sapiens] 

He. 100686 

AA487468 

AA4674GB 

55-555 (underlined sequences correspond to start and stop codans) 



1 11 21 31 41 51 

1 i 1 ) I I 

CGGCACCAAG AaCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 12 O 

GCAMAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAOGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGQCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAftGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTOTA 420 

GACCCTTCTT TAACAGTTAG AGCXGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 4BO 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAGAGC TATAAGAQAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 60O 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTCCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A48 Protein sequence : 
Gene name: 
Unigene number: 
Probeset Accession #i 
Protein Accession #: 
Signal sequences 



Cellular Localization; 



ests, Weakly similar to JE0350 Anterior gradient -2 [h. sapiens] 

HB.100SBG 
AA4S7468 
none found 
1-23 

none found 
secreted 



1 11 21 31 41 51 

! i I I I | 

MMLHSALQLC LLLVTVSSMI, AIAIKKEKRF PQTLSRfiWGD DITWVQTYES GLFYAQKSKK 60 
PLWVIHHLED CQYSQALKKV FAQNEEIQEM AQNKPIMLNL MHBTTDKNLS PDGQYVFRIM 120 
FVDPBMTOA DIAGRYSHRIi YTYEPBDDPL LIENMKKALR LIQ3EL 



A49 DMA SEQUENCE 

Gene name: G protein- coupled receptor 56 
Unigene number r Hs .6527 

Probeset Accession #: AA47Q599 
Nucleic Acid Accession #t NM_005682 

Coding sequence; 163-2244 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

111(11 

CGGCRGCAGG GTCtCGCTCT GTCACACAGG CTGGAGTGCA GTGGTGTGAT CTTGGCTCAT 60 
CGTAACCTCC ACCTCCCGGG TTCAAGTGAT TCTCATOCCr CAGCCTCCCG AGTAGCTGGG 120 
ATTACAGGTG GTGACTTCCA AGAGTGACTC CGTCGGAGGA AAATGACTCC CCAGTCGCTG 180 

852 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 

25 
30 
35 
40 
45 



CTGCAGACGA 
GGCCACAGGG 
CACTACAAAC 
GTCCATGCCC 
CTCTACCACT 
AAGCGTGACT 
GAGAGCCTOG 
CAGAACATCA 
ACGGCCOCTC 
AGOCAGTTCC 
CAGCAGTTGC 
TCCTTCGAGG 
CAGGACCTGC 
GTGCTGCTGC 
AGACTCCTCC 
OTCCTGGGTG 
GAGCCCGTGG 
GTGTTCTGGG 
GAGACCGTCA 
GTGCTGATGG 
TCCTACGTGQ 
TGCTCCAGGG 
ATGAACCTGC 
GCCCTGACAG 
CTCACCTGCC 
GTCITTGQCA 
CCCATCTTTC 
TTGGCTGTGC 

GCCATGCTAG 
TCACATGTGC 
TTCTTCTCCT 
ACCTCCTTCC 



AGCACCTCGT 
TGCGGCCTCG 
CCGCAQACTT 



ACGGGACTCA 
CCAGCTGGAG 
GGCCTTGGAT 
TGTGGCTCCA 
TTTTAACCTC 



CACTGTTCCT 
AAGACTTTCG 
CCACACCAGA 



TCTGCCTCTA 
TCTTGCTGAG 
CTCAGGGOCC 
GCCTGCCCAG 
ACAATGCCTC 
TGAAGCATOC 
AGAGCCTGGA. 
AGGACOGGAT 
ACATCCACTC 
CTCGAACACT 
TGGTGGACTT 
AGAAGGTCTT 
TGCTCACTTT 
TTGAAGACCC 
GGAGAGAAAC 
TCTCCTCGGT 
□CTGTGTCGT 
TGCCCCTGCC 
TGCTGGCCGT 
GCTTCTGAGGC 
TTTCCTGOAT 
CCTATGTCCC 
TGGTGACGCT 
ATAGGACTCC 
GCTACATCAC 
CCACCATGGT 
TGACACTGCT 
TTGCTTCTGG 
AAGGCTTCCT 
CCCCTCTGAA 
CCAGCCGGAT 
TCGCACACTG 
TGGAAAGCCC 
TGGGGCTTTT 
GAAGTGGGCC 
OCCIGGTCTC 
CTTGAGGGTC 
GTTQCTCTGT 
AGGTGGCACC 
GGAGAGGCCC 



GCTGAGTCTG 
CTTCTGCAGC 
CCTGCQCATC 
AGCCCACCCT 
CTGGAACCGA 
TQACAAAGCC 
CCCGCTGTTA 
TGCCGCCAGC 
GGTGGACATG 
CCAGAAGGCC 
GTCGAAACTG 
CAACGCCACG 
CCGGCAGGAG 
CTTCCAGAGG 
CAGCAGCCAA 
GOGGATTGTG 
CCAGCACCAG 
CACATTGAGC 
CCAAAjCATGC 
GOAGGTGGAC 
CTCSGCCCTG 
GTGCAGGAGG 
CTTCCTGCTG 



GGGCCTOGAG 
TGGCTACCTA 
GGTQGCCCTG 
AGAGGGCGTC 
CAACCTGGGC 
GGTGCAGATC 
GGOCCTCAGC 
CACCTTCCAG 
CATCTTCATC 
GAGCAACTCA 
C TAGGC CTCC 
CCTGTGGCCC 
AAGGACCATG 
GAATTGGCCT 
GCCATGCTGC 
TCCTTACAAC 
TBGCACATCC 
CTCTCGTGGT 
CAGGGCGAAT 
TTTGCCAGGA 



CTCTTCCTGG 
CAGCGGAACC 
TCCATCGAGA 
GCTTCCCGAT 
CATGCTGGGA 
TCTAGCCTCC 
GCCACTTCTG 
TTCACCTTCT 
TGCGAGCTCA 
TGAAGGAGGC 
ACCTCTQTOA 
GTATGGAAGC 
GAQGAGCAGA 
ACGAAAGGOC 
GCCCTOTTCC 
GTACAGAACA 
CTACAGCCQA 
AGCCCGGGGC 

GCCGTGCACA 

AAACCTCGGG 
GACACGAGCT 
□CCAGTGCCA 
GGGTACAACC 
CTCAAGCTGA 
GTGGATGTGG 
ATCTACCCTT 
CTCTT CAGCC 
CTGCGGCTGC 
CTGGTCCTTG 
CTTGTCGTCC 



GACTGCGCCA 
AGCCCACCTG 
CCGAGCCAGG 
GAGAGATGGG 
TGGGGACTAC 
CTAGGGTACT 



TTAATCCTGT 
CACCCTOAGG 



GCACAGCAGC 



TCCAAGGTGC 
AGACACACAO 
ACTCCGAAGA 
CCTTCCCTGA 
GATTACATCT 
TCTGCTTCCA 
TCACCTCCTQ 
CCTTCCACAG 
AAAGGGACCT 
CCTCGGCTGC 
GATTGATGGG 
TCCAGCOCAC 
BCQAGAT CAT 
GGAGCGGGGA 
AGGACAAGAA 
CCAAAGTAGC 
AGAATGTQAC 
ATTGGAGGAG 
ACCACTTGAC 
AGCACTACCT 
TCACCATTGC 
ACTACACCAT 
TCCTGCTCAG 

TCTACCGACT 
GCGCCATGGG 
ACAACTATGG 
CCATGTGCTG 
TGGTGTTTCT 
GCCCCCACAC 
GCCTGCCCTG 
TCTACCTTTT 
CCATGCGGCr 
GGCTCCCCAT 
CCCATGTGAT 
CCCAGCCCCA 
CCGTTGCCAT 
TCGGOTCTCA 
GTCCCCACAT 
AGCCTCATTG 
GCCCCTGCCT 
GCACTCTGCA 
GCAGACCITC 
AGCTCGCCTA 



ccacggcagg 
gagcagcctc 
ggcdctcaca 

CCCCAGGGGC 
TCTCTATGGC 
GCACGAGGAG 
GTGGAGCCCT 
TCCTCCCCAC 
CCAGCTGCTC 
OCOCGCCAGC 
GGACATGGTG 
AGCCGGCCTC 
GGAGTACTCG 
GGCTGAGAAG 
TTCCAGCGAA 
CAAGCTCACG 
TCTGCAATGT 
TQCTGGGTGT 
CTACTTTGCA 
GAGCCTCCTC 
CGCCTACCTC 
CAAGGTGCAC 
CGAGOCGGTG 

CaTGGTGGAG 
CTGGGGCTTC 
CCC CAT CATC 
GATCCGGGAC 
GTTCAACATG 
CCAAAAGTGG 
GGCCTTGATC 
CAGCATCATC 



CAGCTCGGGC 
GAAGCAGAGA 
GGCCAGTCAG 
GGrGGACGGA 
CTCAGCTCCC 
CTGTCCCAAC 
CTGGGGGCCA 
GGGACAGAAA 
TCCTCTGTCA 
AGGGCCAGAG 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
162 D 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2B20 



50 



55 



A50 Protein sequence 
Gene name* 
Unigene number; 
Protein Accession #: 
Signal sequence i 
GPS domain i 
P£am domain: 



Cellular localization: 



G protein -coup led receptor 56 

He . 6527 

KM_005682-1 

1-26 

342-394 

7tTO_2 14 00-665 3 

410-432, 446-468, 482-504, 517-539, S7S-597, 608-630, 638-659 
plasma membrane 



60 
65 
70 
75 
80 



1 
I 

MTPQSLIjQTT 
6SEALXVHAP 



RDLQLLSQFL 
QPTAGI47DLH 
SKN88QVLGB 
WfiSAGCETVR 
TIARYLCSRV 
PLHFfililiTCE. 
WYGPIIIiAVH 
PHTQKWSHVL 
MRLQARGGPS 



11 
I 

LFLLSLLFliV 
FPAAHPASRS 
QGPPLIATSV 



21 
I 



IH6RQEEEQS 
KVljGrWQKT 
RETQTSCFCEJ 
PLPCRRKPRD 
SWWGLEGYHIj 
RTPEGVIYPS 
TLL GLS LVLG 
PLKSKSDCAR 



FPDPRGIiYHP 
TSWWSPGK1S 
SAAPASQQLQ 
EIMBYSVUjP 
KVANLTSPW 
HLTYPAVLHV 
YTIKVHMNLL 
YRLWEVFGT 
MOTIRDSLVS 
L.PMALIPF3F 
LPISSGSTSS 



31 
I 

DFRFCSQPKQ 
CLYWNRHAGR 
LPSAASFTPS 
filiBSKLTSVR 
RTLFQRTKGR 
LTFQHQLQPK 
8SVBVDAVHK 
LAVFIiIiDT&F 
YVPGYLLKbS 
YITKLGLFSL 
ASGTFQIiWI* 
SHI 



41 

I 

THRS&LHYKP 
IHUjYGKRDP 
FH&PPHTAAH 
FMEGDMV9FER 
SGEABKRLLL 
NVTLQCVFWV 
HYLSLLSYVG 
IiliSEPVAl/TG 
AMGWGFPIFIj 
VFIoFUMAMLA 
YLFSIITSFQ 



51 
! 

TPDLRigiHKT 
LLSDKASSLL 
KASVDMCELK 
DRTHATVWKL 
VDFQSQALFQ 

EDP7L88PGH 
CWaALACLV 



VTIjVALVDVD 
TMWQILRLR 
GFliI FIWYW 9 



A51 DMA L^jLfENCE 



Undgene number i 
Probeset Accession #: 
Uucleic Acid Accession #r 
Coding sequence: 



Hypothetical protein FLJ20063 

Hb.5940 

AA053 660 

AA053660 

218-1360 (underlined sequences 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 1 I I I 

CCCCCATGAC TTTGTACAGC tacttcactg cttcccccca attagtacac atagttcctc 

853 



60 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 



CACAATTCCT 
ACCTACTGCT 
AATCACOGCT 
TAACAATGAA 
CGCTTTATTG 
TCCCTGTGCA 
AGAAGGGTAT 
TTCAGTGACA 
CTTGCATftGT 
GACTGTAATT 

caagtttctt 
gactotgact 

TGATTTGACC 
TGGTTEAGCA 
CQTTGCTTOC 
AAAGAAGAGT 
TGGOAACTGC 
GCTGATCCTC 
ATTGATTGTC 
TGACGAAGAC 
AGGGAGCGTC 
CTATTCAAGA 
CGCCATGGCC 
GAAGTGAGCA 
CCTCTCTGAA 
TCTATGACAT 
TCAGTACAAA 
CAAC&AGAAC 
CATTGCTCTT 
TCCGAGGGGA 
AAGTCAOAAT 
CAGCCCAGAG 
CTTTTTCAGG 
ACTAAAAATG 
QGCAGAGAOT 
AGGATCAATT 
CAGCAAAGGA 
AACAAAAAAT 
CAAAGGQTGC 
TTTCTGAGGT 
ACCAACACAC 
CAAGCTTTTA 
GTTATTTTAT 
AAGTGCCTTT 



ACACCTGCTC 
GCAGACAGTG 
TCATCTCCAA 
ATGTCCCCCA 
GAGACCAGCA 
GATAATTCGT 
TACTACAACT 
GTATCAGAAA 
GAAATTACTA 
CTTACTGTAA 
GATGTAACAA 
GAGAAAATTA 
CTTCGGTGTG 
TGCGATTGCA 
AGTCTCAAGT 
GGTGGGGCCC 
CAAAAGTGTG 
ACTATTGTGG 
ACAGCAAGAT 
TTTCAAAATC 
TTTCCTAAGG 
CACAGCAGCA 
CCCAACCAAT 
CCAGTAAAGA 
TGGAAGTTGT 
TAAATGTAGT 
GTACTGAGAC 
CATTTCCAAT 
GAGTTAAGTG 
TGAGAAGGGA 
AGGGACACTG 
CTTGCAACCT 
AGGCGTGCCT 
TCCACTATGG 
AAAAAACATG 



GCACT*rCTCT 
CCAGGAGTAA 
GAGAGGGAGC 
GTGCCATTOG 
AACCACACAC 
CAAATGT TAT 
TTGTTATTAT 
GTTAGATGGT 



CCCCCATAAT 
AGTCAACCAC 
ATGATGGATT 
CCACAGAAGA 
CCCTAAACAG 
TATGTGTTAA 
CTTCTACATG 
CATTTGACCC 
GCTTG-TTTAA 
GCACATCTCT 
TAGTAACAAT 
ATAAAGCAAT 
ATTATTATGG 
AATCTGACCT 
GTCCTGATGC 
CTGAGTGTGC 
CATTTGGCTA 
GCACCATCGC 
CAAATAACAA 
TAAAACTGCG 
TCAGGATAAC 
TGCCCCGCCC 
GTACAAGCTA 
TCTGQOCTCC 
QAATGTTTGC 
AGATGCTATT 
AATGGT T AGG 
CTAGAGGAAA 
ACCTAATTOC 
TACCCACCAT 
CTTCTATCCC 
AGCCTCACCC 
GGGAATCCAG 
GGTGCACTCT 
AGCTGOTAGA 
CACCTGGGAT 
AATCATGCCC 
GAGCCTTAGG 
TTGTGCTCAG 
GGCCTCAGCC 
ACACAAOCAC 
TAGTGTCCTT 
TATTTGTTCT 



CAGTACACAT 
AAATGTAAAT 

aatcacaaxg 

CAATCAATCA 
CACAGGTCCC 
GCTGCATAAT 
TAAGAAAGGA 
AGAAGAGAAA 
AGATGTATTT 
GTCACCAAGA 
TTTGGCAGAA 
TAGAAGTAGC 
CTGTAACCAG 
GCAAAGGCCT 
CTGCAACGCA 
QTGCGTGCCC 
CAGTGGACTC 
TGGCATTGTC 
AACGAAGCAT 
GTCGACAGGC 
GGCCTCCAGA 
TGACTAT TAG 
TTATTTAGAO 
GGGGTTTTTC 
AACGAATCCA 
AGCGCTTQTC 
GTTQTTTTCT 
GCTCOCCAGC 
CCTGGGAGAC 
CTTTCAAGGG 
TCCAATGGAG 
AAGAAGACTG 
GAACTTTTTG 
ACAGTTTTTG 
AGGAAGAGAG 
CCACCTTCTT 
TCCCGAAGAC 
TCAGTTTGAA 
GAGTCGAGCC 
TTCTCTGGTG 
ACACACAAAT 
TTTTATTTCT 
TGACTGTTAA 



AGTTCCICCA 
TCATTAGCTA 
GTTCCTTCTG 
TCAGoacCTC 
AGCAATCCTT 
ACAAGTTTTT 
AAGGTATTCC 
CATTCCATGG 
GGCACATCTG 
TCTGAAATGC 
ACCACAAGTG 
TCAAGCAACT 
ACTG03GATG 
AACCCACAGA 
CAGCACAAGC 
GGCTACCAGG 
GACTGTAAGG 
ATTCTCAGCA 
ATTGAAGAAG 
TTCACCAATC 
GACAGCCAQA 
AAT CATAAGA 
TGTTTAGAAA 
TTCCATCTGA 
GCTCACTTGC 
AGAGAGGTGG 
TJUV3TCTTTT 
ATTGCTTGCT 
ATACGCATCA 
TCACAAGCTC 
AGATTCTGGC 
GAAAGAGACA 
ATGCTAATTA 
AAATGCTAGG 
GCAAAGGAAA 
CCTTAGGTCC 
TGGCTGGGAG 
ATTGGAGACA 
GTGCAGCCTC 
ACAGAGGCTC 
GGGGGCAAGC 
AATGCCTTGT 
TTGTGAATGG 
AAAAAAAAAA 



CAATTCCTAT 
GCTCTGACAT 
AAACACAAAG 
CCACTGGCAC 
GCCAAGATGA 
GCCTGTGTTT 
CTGGGAAGAT 
CCTATCAAGA 
TTTATGGACA 
GTGCTGATGA 
ACAATGAGAA 
TTCTAAACTA 
ACTGCCTCAA 
GCCCTTTCTG 
AATGCTTAAT 
AAGATGCTAA 
ACAAATTTCA 
TGATAATTGC 
AGAACTTQAT 
TTGGAGCAGA 
TGCAAAATCC 
ATGTGGAACC 
GACTGATGOA 
CAXCTGCCAG 
TAAATAAGAA 
TTTXCTTCAA 
OCTGGTAGGG 
CCTGGGCAAA 
ACTGTGGAGG 
ACTCTCTGAC 
AACCTTTGAA 
TATCTCTCAG 
GAAGGCCfGG 
AGGCAGAAGQ 
CTGGGTGGGG 
CCTCCTCCAT 
AAGGTTTAAA 
AACTGTCTGG 
GGQGTGTAGG 
AGCTGTGGCC 
ACATCCAGTA 
CCTCTlAAAA 
TAATGCAATA 
A 



A52 Protein sequence ; 
Gene nanto: ~ 
Unigene numbers 
Probeeet Accession #: 
Protein Accession # : 
Signal Beijueuce: 
Trans cnetrbrane domains; 
EGP domain: 
SEA domain » 

Cellular Localization: 



120 

18D 
240 
300 
360 
420 
4&0 
540 
600 
660 
720 
780 
G40 
900 
960 
102 D 
1080 
1140 
1200 
1260 
1320 
13B0 
1440 
1500 
1560 
1620 
1680 
1740 
1B00 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
22B0 
2340 
2400 
2460 
2520 
2580 
2640 



Hypothetical protein fIh720063 

Hs.5940 

AA0 53660 

none found 

none found 

289-311 

45-74 

plasma 



60 
65 
70 
75 
80 



i 
I 

MVPflETQSNN 
NTSFCLCLEG 
ETGTSVYGQTV 
SSSNFLHYDI* 
AOHKOCMKBC 
VILSKIIALI 
RDSCjMfcHPYS 



11 21 31 41 51 

I I 1 1 I 

EMSPTTEDNQ SSGPPTGTAL I/ETSTLNSTG PBNPOQDDPC ADNSLCVKLH 
YYYNSSTCKK. GKVFPGKTSV TVSETFDPKE KHSMAYQDLH SEZTSLFKDV 
ILTVBTSLSP RSKMRADDKF VDVTrVTIIiA ETTSDfoJEKTV TEKXNKAIRS 
TLRCUYTGCN QTADECLNGl* ACDGKSDLQR PHPQSPPCVA SSLKCPOACM 
SGOAPSXaCV PGYQBDANGN OOKCAPGYSG WXXDKFQLI LTIVGTIAGI 
VTARSNHKTK KIEKENLIDE DFQNIrKLRBT GETHDGABGS VPPKVRITAS 
&HSSMPRPDY 



A53 DMA 8BQUBWCB 
Gene name: 
Unlgene number: 
Pxobeset Accession # ? 
Nucleic Acid Accession #s 
rortHn g aequence i 



60 
120 
180 
240 
300 
360 



HB. 298241 

A1538613 

AB038157 

202-1566 (underlined sequences correspond to start and stop codons) 



\ 11 21 31 41 SI 

ACCGGGCACC GGACGGCTCG GGTACTTT CG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 
GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTAjCTATCTC TTCCGTGGIG 
CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TOGAGGCGGA GCCGGATGTC 
AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCQC CTGCTGTTGA AGCCOCCTTC 

854 



60 
120 
180 
240 



WO 03/042661 



PCT/US02/36810 



J^T? =aAT D3CTTT:rrGG CdTOATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 
GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA ASTnTTTCC AATCATCGTC 3 60 
™I^ GATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TOGGCATCCA CTTCGACTGC 420 
TCAGGGAAGT ACAGATGTCG CXCATCCTTT AAGTGTATCG AGCTQAT AG C TCGATGTGAC 480 
GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCOGGGT GGGTGGTCAG 540 
AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTQGAAGA CCATGTGCTC CGATGACTOG €00 
AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGQTT TCCCAAGCTA TGTGAGTTCA 660 
GATAACCCCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGOAGTTTGT GTCCATCGAT 72 0 
CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 
IS^E 01 ^ GCCACGTGGT TACC1TGCAG TGCACAGCCT GTGGTCATAQ AAGGGGCTAC 840 
AGCTCAC5CA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGlGGCCCTG GCAGGCCAGC 900 
CTTCAGTTCC AGGGCTAOCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGOGT 1020 

CTAGTTTCCC TOTGGACAA TCCAGCCCCA TCCCACTTGG TOGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGOCAAAOAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGA* CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 1260 

TCOCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTG CAACCACAGG 1320 

GACGTGTACG GTGGCATCAT CTCCCCCICC ATGCfTCTGOG OSGGCTACCT GACGGGTGGC 1380 

GTGGACAGCT GCCAGGGGGA CAGOGGGGGG CCCCTGG1GT GTCAAGAGAG GAGQCTGTGG 1440 

AAGTTAGTGG GAGCGACCAG CITTGGCATC GGCTGCGCAG AGGTGAACAA GCCTGGGGTG 1500 

^S CCGTQ TCACCrCCrr CCTGGACTGG ATCCACGAGC AGATGGAGAG AGACCTAAAA 1560 

ACCTGAAGAG GAAGGGGACA AGTAQCCACC TQAGTTCCTG AGGTOATGAA GACAGCCCGA 1620 

TCCTCCCCTG GACTCCCGTG TAGGAAOCTB CACACGAGCA GACACCCTTQ GAGCTCTGAG 1680 

TTCCGGCACC AGTAGCAGGC CCGAAAGAGG CACCCTTCCA TCTGATTCCA GCACAACCTT 1740 

CAAGCTGCTT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT CGCTCTGTTG CCCAGGCTGG 1800 

AGTGCAGTGG OGAAATCCCT GCTCACIGCA GCCTCCGCTT COCTGGTTCA AGCGATTCTC 1860 

TTGCCTCAGC TTCCCCAGTA GCTGGGACCA CRGGTGCCCG CCACCACACC CAACTAATTT 1920' 

TTGTATTTTT ACTAOAGACA GGGTTTCACC ATGTTGGCCA GGCTGCTCTC AAACOCCTGA 1980 

CCTCAAATGA TQTGCCTGCT TCAGCCTCCC ACAGTGCTGG GATTACAGGC ATGGGCCACC 2040 

AmCCTAGCC TCACGCTCCT TTCTGATCTT CACTAAGAAC AAAAGAAGCA GCAACTTGCA 2100 

AGGGOGGCCT TTOCCACTGG TCCATCTGGT TTTCTCTCCA GGGGTCTTGC AAAATTCCTG 2160 

ACGAGATAAG CAGlTATGTG ACCTCACGrG CAAAGCCACC AACAGOCACT CAGAAAAGAC 2220 

GCACCAGCCC AGAAGTGCAa AACTGCAGTC ACTGCACGTT TTCATCTCTA GQGACCAGAA 22 BO 

CCAAACCCAC CCTTTCTACT TOCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGGA 2340 

TCTTTGTftGC ATTTGGTGCT T^ACGTATTA 2400 

TTGTCCTTTG ATTCCAAATA ATATGTTTCC TTCCCTCAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAA 



AS 4 Protein sequence; 
Gene name i 
Unigene number: 
Probeset Accession #s 
Protein Accession #i 
Signal sequence* 
Transmembrane domains: 

Tryp_SPc domain: :iD - vli 
Cellular Localization: plasma merthrane 



TMPRSS3a 
HS.29B241 
AI53B613 
BAB20O77 
none found 
43-65, 239-261 
216-444 



i i 1 f i 1 T f 

MGEHDPPAVE AFFSPRSfcFG LDDUQSPYA PDAtJAVAAQI LSIlUPLKFFP irVIGlIALI 60 

IAUUGLG1H FDCSGKYRCR SSFKC2HLIA RCDGVSDCKD GKDBYRCVRV GGQNAVLQVF 120 

^f^™^ 8 »DWKSHYANV ACAQLGFPBY VSSTOttRVSS U3GQEREEFV 180 

y™^HSVYy KEGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLfiQWPH QASLQFQGYH 240 

LCGG8VITPL WHTAAHCVY DLYLPKSWTI QVGI>VSLLDN PAPSHLVBKI VYHSKYKPKR 30O 

LGWDIALMKL AGPLTFNEMX QFVCLPKfiEE HFPDQKVGWT flGKGATKDQA GDASFVLNHA 360 

AVPMSKKZC NHHDVYGGri SPSMLCAGYL TGGWSOQGD SGGPliVOQER fiLlTOVGATS 420 
PGIGCAEVNK 5GVYXHVTSF LDWIHKQMER BLKT 

ASS DMA SEQUENCE 

Gene name» Putative g protein-coupled receptor Gpcriso 
Unigene nurriberj He . 97101 

Probeset Accession #: AA215333 
Nucleic Acid Accession # : nm_014373 

Cooing sequence, 322-1338 (underlined sequences correspond to start and stop codons) 

i i 1 r r r r 

^SSSS SIS^^ GQOCCGOOOC CTGCAGTC03 GAGACGAACG CACGGACCGG 60 

GCAGGTTCGG CTGGAAGGAA COGCTCTCSC TTCGTCCTAC ACXTGCGCAA 120 

^^2^ GCTTACrCAC ATAGCATATT GGTATATCAA AATGAAATGC AAGGAACCAA 180 

AAATAACATA ATTGAAGGCA GTAAAAGTGA AAtTAAATAG GAAGAT CAT C AGTCAAGGAA 240 

GACCCACTGG AGAGGACAGA AAATGAAGCA GTGTTTTATC ATGTGTATTT CAGCAGOTCT 300 

MTGAAATT TAACTAAAAA TATGACTGCT CTCTCTTCAG AGAACTGCTC TTTTCAGTAC 360 

CAGTTACGTC AAACAAACCA GCCCCTAGAC GTTAACTATC TGCTATTCTT GATCATACTT 420 

TATTAAATAT CCTTACACTA GGAATGAGAA GAAAAAACAC CTGTCAAAAT 460 

TTTATGGAAT ATTTTTGCAT TTCACTAGCA TTOGTTGATC TlTTACTTTT GGTAAACATT 540 

I^Z™* T Sf^™ kG aGATTTTGTA CTTTTAAGCA TTAGGTTCAC TAAATACCAC 600 

^?5^ T TTACTCAAAT TATTTCCTTT ACTTATGGCT TTTTGCATTA TCCAGTTTTC 660 

CTGACAGCTT GTATAGAT T A TTGCCTGAAT TTCTCTAAAA CAAOCAAGCT TTCATTTAAG 720 

TGTGAAAAAT TATTTTATTT CTTTACAGTA ATTTTAATTT GGATXTCAGT CCTTGCTTAT 780 

855 



WO 03/042661 



PCT/US02/36810 



GTTTTGGGAG ACCCAGCCAT CTACCAAAGC CTGAAGGCAC AGAATGCTTA TTCTCGTCAC 840 

TGTCCTTTCT AT GTCAGCAT TCAGAGTTAC TGGCTGTCAT TTTTCATGGT GATGATTTTA 50 0 

TTTOTAGCTT TCATAACCTG TTGGGAAGAA GTTACTACTT TGGTACAGGC TATCAGGATA 960 

ACTTCCTATA TGAATGAAAC TATCTTATAT TTTCCTTTTT CATCCCACTC CAGTTATACT 1020 

GTGAGATCTA AAAAAATATT CTTATCCAAG CTCATTGTCT GTTTTCTCAG TACCTGGTTA 1080 

C CATTTGT AC TACTTCAGGT AATCATTGTT TTACTTAAAG TTCAGATTCC AQCATATATT 1140 

GAGATGAATA TTCCCTGGTT ATACTTTGTG AATAGTTTTC TCATTGCTAC AGTGTATTGQ 1200 

TTTAATTGTC ACAAGCTTAA TTTAAAAGAC ATTGGATTAC CTTTGGATCC ATTTGTCAAC 12 60 

TGGAAGTGCT GCTTCATTCC ACTTACAATT GCTAATCttG AGCAAATTGA AAAGCCTATA 1320 

TCAATAATGA TTTGTTAATA TTATTAATTA AAAGTTACAG CTGTCATAAG ATCATAATTT 1380 

TATGAACAGA AAGAACTCAG GACATATTAA AAAATAAACT GAACTAAAAC AACTTTTGOC 1440 

CCCTOACTQA TAGCATTTCA GAATGTGTCT TTTGAAGGGC TATACCAGTT ATTAAATAGT IS 00 

GTTTTATTTI AAAAACAAAA TAATTCCAAG AAGTTTTTAT AGTTATTCAG GGACACTATA 1560 

TTACAAATAT TACTTTGTTA TTAACACAAA AAGTGATAAG AGTTAACATT 1GG C TAT ACT 1620 

QATGTTTGTG TTACTCAAAA AAACTACTGG ATGCAAACTQ TTATGTAAAT CTGAGATTTC 1680 

ACTGACAACT TTAAGATATC AACCTAAACA TTTTTATTAA ATGTTCAAAT GXAAGCAAGA 1740 
AAAAAAAAA 



A56 Protein sequence 
Gene namei 
unigene number i 
Protein Accession #: 
signal Bequance: 
Transmembrane domains : 
Cellular localization: 



Putative G pro tela- coupled receptor Gpcriso 
HS - 97101 
HP_0551BB 
none found 

23-45, 59-81, 97-119. 138-160, 184-206, 241-263, 276-297 
plasma, membrane 



1 11 21 31 41 51 

11)111 

MTATjSSENCS FQYQIjRQTKQ pldvnyixfi* IILGBCI1*LNI ltlgmhhxmt cdkpmeyfci 60 

SLAPVDLIiIiI. VNISTIIiYPR DFVLLSIHFT KYHICliPTQl ISFTYGFLHY PVFLTACIDY 120 

CLHFSXTTKI* SFKCQKIiFYF PTVILIHISV EAYVLGDPAI YQSUCAQNAY SRHCPFYVSI 180 

QSYWLSFFMV KllaPVAFITC WEEVTTLVQA IRITSYMHHT ILYPPFSSHS SyTVRSKKIF 240 

LSKLlVCPtiS THliPFVLLGV IIVIAKVQIP AYIEMWIPWL YFVNSFL1AT VYWENCHKLtf 30D 
I»KDIGLPLDP FVNWKCCPIP JLTIPNLEQIE KP1SIMIC 



AS7 UKA SEQUBMCE 

Gene name? BSTo. Weakly similar to CQHD7L collagen alpha 1(111) chain precureor [H, sapiens! 

Unigene number: Hs .19322 

Probaset Accession #; AA0eB4S8 
Nucleic Acid Accession ft i AA088458 

Coding sequeneei 862-1995 (underlined sequences correspond to start and atop codons) 



3- 11 21 31 41 51 

I 1 I I 1 I 

GCCCTTQGAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAGGACA CTGACATGGA 60 

CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 120 

GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAAOSAG TGCAGGAGOG GCAGCGCCGC 180 

CTGGGCCAGA GCAGAGCCAG CX3CCGACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGQCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 300 

TGTGCCAGCC GGGCCCTGCC CCCGTCCTCC TCCGGGCCCC CCTGCCCTGC CCTGACGTCC 360 

ACCTGACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 420 

CTCACCCAGG AGGTGACCGA GAAGAGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 480 

GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC G05TGGGCCC CCAGGGCCAG CCTGGCACTC 600 

AGCCCTTCGA GGGTGGGCGC CCCATCGCAC CCACCCTCTC TGQCTGGAQA CCCCCGGCAG 660 

GCCCAGGCAC agtcccggag tgggcgcctt cctgcogccc TTGCCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GGGTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 780 

ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCA3 CTACTACTGG CCGCTGTCAG 840 

TGGACAGTGG GGTACCCCTC CATGAGTTAG C5TCCCCCCG TTTCCAGCGG K3CCGCCCTG 900 

GGTCCCATCT TCAGGOAAAO GCACTGCCCA CSCCAGGCTG CACITCCAAC AAOGGGCAGC 960 

AGAGGGCGCG GGGCGGCTCC GAGGGGGGTC CAAGGGCAGC TTCCCGCTCA AOCAGGGCAC 1020 

CAGGACGAGG TGGCTGTAGC TCGGACGGAC GGAAGTAGAT GGAGGQGGTG GGGACGGCCT 1080 

GTAAGCGQGG GGTGCCTGCC TGGCTGGGGA GCCOCAGGGA TAGOGGTOGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGGACC CTGGCTGCAG CGGATCQGCA GGCCGGGTGG GCGWSftOCTT 1200 

GGOCTGCATQ TGCCTCCCAC AGACGCTGGG GTGATGGCCT TCCCCCTCTT GGCCGGGACG 1260 

TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCOCAGG AGGGCCCCCA 1320 

GACAGCTOCC AGGCACGTCA TAGGCAAAGC CTOTTTCCCC CGACTCAGGA TTTCCAAGGC 1380 

CTGGGGTOCT GCTCACCCCC CTTTGCTCTC ACGCCCAGOC TCTCCCCRGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 1S00 

GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCOCA GGCCCACTCC CQCTGQTGCT 1560 

GGAGTACQCA CTGGTGGGGG GGCOCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 

QAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGG CCAGGOGCCC CGATGCGGGG 1680 

TCAGTGTGTG TGGGGCGCAG GBCCTCCGAT ©OGGGGTCAG TGOGTGGGGG GCGGAGGGCC 1740 

CCCGATQCGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT CGTGTCCAGG GCACTTTGGT 1800 

ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTCCGGA£3 CCCAGCICCA TGCTAACCTG CCCACAGCAA CCCCACAGAQ CCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCA0CCCA GCATGCAGCT 19 BO 

GCCCTCCTAC CCTGAAGATG GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTGGGGTC rGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGCTTGCC TGCCATGCCC 2220 



856 



WO 03/042661 



PCT/US02/3681© 



5 
10 
15 



TGGGGOATCC 
GGTGACXTCA 
GAGACAGGCT 
AAAGAAATAG 
CACOAGGGGA 
GCAGACCCTG 
GAGCAGCGTC 
GCGTGCACAC 
CAGAAGTGTC 



CTGGAATCCC 
CCGCATCTCT 
TCATAAACAC 
TAGACCCAGA 
AGAAATAAAA 



TGGCATC1TT 
TCAQGAGACC 
OGCACCTCCG 
GTCCTCCCRQ 

gaatttaaag 
cctggagcct 

CCTGGGCTCT 
TGTGATGACA 
CCCAGTTGAG 
ATCAAGTTCC 
AGCACTTGAG 
ACAARAAAAA 
CACAAGGAAA 
TACTAGAATT 
GAGATTTCTG 



ACTGGACTGG 
GCCCACATAG 
GAAAAACTGC 
TTTACAGCTT 
GC CCCGGCTG 
GCCCTAGGAC 
ATCCGCUAGG 
CCCGGAAATG 
AATCTGCCCC 
AAGGAAAAGG 
GCCAGGAOTT 
AAAAAGAAAG 
GAATACACTA 
ATCAGAGAGA 
GAAAGATOAA 



AAGCAGQAGA 
AGCTGGACCC 
CTTTCAGCCT 
GAAATCAGGC 
GCAGGGTCTA 
GCTGGGOGGG 
TGCCAGTAGC 
TCTCAGGATG 
AGAGGAACAC 
AACATCTCAG 
CCAGAGCAQC 
AAAGAAAATG 
TGAGACCCAG 
ATATAAAGTA 
AAAAAA 



CAGAACAGTG 
CGCAGCTGAA 
TGGTGTTCCG 
TAG1GAGTGG 
GGTGGCTGGC 
TCAGTCT003 
GTGTGCAGGT 
TTGAAATGTG 
ACCCACACCA 
COGGGCGTGG 
CTGGGCAACG 
AGAOATCCAG 
CAGAAGCAAC 
ACAGTGTTTT 



TCTGTCCCGG 
GCGOAAATGT 
TGCAAGGTGA 
CCCTGGAGAC 
AGAGGCACAT 
TGCAGGATGT 
ACATACACGT 
TCCTTGGGGG 
GGCCTCAGGA 
TGGTTCACGC 
CAGTGAGAGA 
GTTTAAAAAT 
AGATTGACTC 
ATATATCTAA 



2280 
2340 
24.00 
2460 
2520 
25B0 
2 640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 



AS 8 DNA 



sequence 



Gene namei 
Unigcne number: 
Protaeaet Accession #• 
£\J Nucleic Acid Accession #t 
Coding Sequence: 



ESTs 

Ho. 157601 
W07459 
AGO 053 S3 

328-2751 (underlined 



correspond to start and stop codons} 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



i 
I 

GACAGTG-TTC 
TTTTATTTGC 
CCTGGCGGTA 
ACAAACAGGT 
CCCCCTGGCC 
TGGCOGCTLT 
GTTTTCCTGT 
GAAACCATCG 
ATCATGrTTC 
CACTTTGCCA 
GCATTCC5W3T 
CAGG AAGTGA 
CTTGCTCTGA 
CAGATOCTCA 
CAGCTGAAGG 
GAQCTGCATG 
GAGGATGCCA 
ACGCCAGACT 
GAGTTOGCTQ 
GCACACTGTC 
AGGACCACCT 
CCAGAABQAC 
TQTGCCCTGA 
GCGGGCACCA 
GGCGTGCTGA 
CrGOTGOCGG 
GGCATTCCCT 
CGTGGCTTCG 
CTCACTGAGT 
OAGCTGCTCC 
GGCftQOCCAA 
GAGCTGCAGQ 
CTCGTCrrCA 
AGCTTTGTGA 
CTGGTGGTGT 
GCTQCOATGC 
ACCGCCCTGC 
GTCCCCAAAG 
GCCCAGAAGC 
AGTGAGGGTC 
GGCGAOCTGC 
CCAG1CAAOC 
GGGAGCTAOC 
TGGAGCTCTT 
ATOGCTCCDQ 
GGCACTGAAA 
TTCCOGCCGT 
ATGCTGCTTA 
TTGATGrGTA 
CTGCCACCTT 
OGTTCCTTTQ 
AGGCCTTTAC 
GCAQCTTTTC 
CTTGAGGGAC 
GGTCTCAGAC 
TGTGCATGGG 
ACCTTGAAGG 



11 
I 

GOGGCTGCAC 
AGACCTGGGC 
GTTCCTCOGA 
QTCCCACGTG 



TTTCCftGAGT 
GGAAGATTTC 
TGTTAGATGG 
TCACAGTCTG 
TCAGTTCCAC 
AGGCAAGAAT 
AATACCTTCT 
TCATCGTCAC 
AAAGGGGTGT 
CACTGGCCAG 
CCAACGOCCT 
GCAGGGTCGA 
GCAATGCCCC 
CCTTCTACAG 
GCCCAGGCCC 
TGGACGGCTA. 
AGCTGAGCCT 
CTCTGGACGG 
GCGAGGACTC 
TGCCTGTGGG 
TCCGTGGTGG 
GGAGGGCCAC 



TGCTGGGTGT 
AGCATGTGAT 
GGAAGCXGT3 
TGTTGGACAC 
GAAGCTGTGC 
ATGGCAGCCA 
TG03GGOCAT 
TGCACATCTA 
CTGTGGTGGT 
TGAGGAACAA 
TQCGGAGGCT 
GGTACCACCA 
TCTGCAAACC 
GCTGCAAGTG 
GCTCTGTATG 
TGCAGGAGGG 
TGGTGCCXAC 
GGCCAGGACC 
GAGACAAGAA 
AGTAAATACC 
TCCCT1GAG3 
CACACAATCA 
TAGAGCATCC 
CACTTOCCCA 
GTTTGTGACT 
TQAATGTQAC 
CCCAGOTCTG 
TCTTC 



21 
I 

CGCTCGGAGG 
CGATGCCGCT 
CCTCAGCCQG 
GCAGCOGCGC 
CCGGGTCTGT 
TATCAACATG 
GCCCCCATCT 
AGCTGCCAGC 
GTCTAACAGC 
TGAOGGTCTG 
TCCTCATCTG 
CAAGAGGATG 
GCACAGAGGQ 
TGATGGGAAG 
CACTGTGTTT 
CGAGCCTAGA 
CTTCAQCACC 
GGCTCAOCCC 
ATGCTGGAGA 
CTGGAAGAGA 
CTGTGACTCG 
CCAGTGCCTC 
GGAATGGAGG 
CTTC5CTGCGG 

GGAOTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTCG 
CAGCCGGCAQ 
CTGTGCCTCA 



31 
I 

CTGGGTGACC 
TTAAAAAACG 
GTCGGGTCGT 
CCCGGGCGCG 
GAGTAGAQCC 
CCCCCTTTCC 
CTCCCTCTCC 
AAAATQATGT 



GACATCAGCC 
GAATTCCCCT 
GTTTTCAAAG 
TTGCCTGGaG 



GCTGTGGGGG 
OOGCAGCACG 
CTCAGCRGCT 
TOTGAGCACA 
GGATCGCGGC 
GTGTTCCTAA 
CAGCCCTGCC 
TGCCCGCTGG 
GTCGACCTCC 
GCCAAAGTCT 
GTGGGTGTGG 
GATGTGCCTG 
ACGGGCAGTG 
CAGGACCGGC 



GGTGGAGACT 
TAGCCAGGCC 
TGACAAAGTG 
GCTCACAGGC 
TGGCAXCTCT 
TGCAGGTCCC 
GGACGTGCTC 
CAGCCCGTGC 
TCGGGATGGC 
TGTGAGCCAG 
CAGCAGCCGT 
CTTCTGGAAT 
ACTATTCTCA 
AGCAGCTGAT 
CACTTTCTGT 
ATAAACAAGQ 
ATGCTOGCCA 
TTTGGACGGC 
GAGACATTCT 
TCTTGGCGAC 
CAATTAACCA 

gagggccacb 



GCCGTGCGGG 
GATCCTCAGG 
CGGCCAGGGT 
GTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC 
CCCTAOCTAG 
ATGACOGTCC 
GGGAGAGGCG 
GTCTTGGTCG 
CGGGATTCCC 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATtrC 
ACCCCTCCCA 
GTCTGTG0OC 
CTGAGGGAGG 
GTCACCCACA 
ACCTOCTGTG 



GAATGTTGTT 
GAAGGCCACG 
GGATGCATTT 



GCTTGGTTGA 
TAAAATCGTT 



41 
I 

CGCGTAGAAG 
CGAGGGGCTC 
GCCGCCCTGT 
CCTCCTGTGA 
GCCCGG3CAC 
TGTTGCTGGA 
AGGAAGXCCA 
GGTGCTCGGC 
GGAGCTTTGA 
CCGAGAGGGT 
TGGATTCATT 
GAGGGCGCAC 
GCAGAAATGC 
ATGTGGCACT 
TCAGGTTTCC 
TQCTGTTGGC 
CGGCCATCTG 
GGACGCTGGA 
GGACCCTTGC 
CCCACCCTGC 
AGAATG GAQQ 
CCTTTGGIAGG 
TCTTOCTGCT 
TGGTGAAGCG 
CCACATACAG 
ACCTGOTCTG 
CCTTGGGGCA 
CACGTAGAGT 
DGCGTCACGC 
CAGAGCTGGA 
ATCTGTTC3A 
GCOGGACACA 
AGAATTTTGC 
CTGACGTGAC 
TGGACACCAA 
GTGGGGTGGG 
AGAGGGGTGC 
CAGAGGATGC 
TGGGGGTGGG 
TGATCCAOST 
TGTGTGGAGA 
GGAGCTGCGT 
OOCACTGOQA 
TTGAGACGCC 
GCAACTACAG 
CAQGTCCTTA 
AGGATG7CCC 
AACGATGTTG 
CCTIGTTQAG 
ACTTAAATIT 
GACACAGTAA 
GCCTTTCAAG 
GCATTGAGTC 
GTGTGGAAGA 



51 
1 

TGAAGTACTT 
TATGCACCTC 
CCCAGGAGAG 
TCCCGTAGCG 
CGAGCGCTGG 
GGCCGTCTGT 
TGTAAGCAAA 
TGCAGTGGAC 
AAGGTCCAAG 
CAGAGTGGGA 
TTCAACCCAA 
GGAGACDGAA 

GCCATCCAAG 
CAGGTGGGAG 
TGAQCAGGTG 
CTOCAGCGCC 
GATGGTCCGG 
GGTGCTGGCT 
CACCTGCTAC 
CACATGTGTT 
GGAGGCTAAC 
GGACAGCTCT 
GTTTGTGCGG 



GAGCCTCGAT 
GGCGGCAGAG 
GGTGGTTTTG 
AAGGGCGCGA 
GGAGATCACA 
CCAAATCCCT 
AGCCCTGGAC 
TCAGATGCAG 



CTGAGTOOTG 



ACCCACCCGG 
CTCAGCCGGC 
CCGGCCTGGT 
AGCCGTTCCT 

GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACCGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
GAATGTCTGC 
AACTGCAGCG 
TTGAAAAGTT 
GCTAXGTCAT 
AGCGGCCTGA 
TGCCCAGCAG 
ATGOAAAGCA 
TGAAAGGGGG 
GACXTOGAAA 
GGGGCTGAGT 
AGCAGTGTCC 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
760 
640 
900 
960 
1020 

ioao 

1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
192Q 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2B80 
2940 
3000 
3060 
3120 
31B0 
3240 
3300 
3360 



857 



WO 03/042661 



PCT/US02/36810 



AS 9 Protein sequence 

Gene name: ESTs 
Unigene number: Ho. 15 7601 

Protein Accession 44 1 none found 

Signal sequence? 1-17 
Transmembrane domains: none found 
VOW domains; 49-223? 341-518; S29-706 
EGF domains: 299-333; 71S-748 
Cellular Localisation: secreted 

1 11 21 31 41 51 

111)11 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIHFLLDGSK 60 
9VGKGSFERS KHFAITVCDQ LDI8PERVRV GAFQPSSTPH LEFFLDSFST QQEVKARIKR 120 
MWKGGRTET SLMrKYLLHR GLPGGHNASV PQILIFVTDG KSQGDVALPS KQLKERGVTV 180 
FAVGVRFPRW KELEAIiASBP RGQHVKLAEG; VSDATNGLFS TLSSSAICSS ATPDCHVEAH 240 
PCBHRTDEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSKK RVF1/THPATC YRTTCPGPCD 300 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA KCALKLSLEC RVDCbLFI/LiDS fiAQTTLDGFL 360 
RAXVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDWPpr,VWSIj DGIPFRGGPT 420 
I>TGSALRQAA ERGFGSATRT CQDRPRRVW LLTESHSEPB VAGPARHARA RELbLLGVGS 480 
EAVRAELfiBI TQSPKHVMVY SDPQDIiFNQI PELQQKLC5R QRPGCRTQAIi DLVFMLDTSA 540 
SVGPENFAQM QSFVRSCAIjQ FEVNPEWTQV GLWYGSOVQ TAFGLDTKPT RAAMLRAISQ 600 
APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT QGROAEDAAV PAQKLRKHQI 660 
SVI»W<3VGPV LSEGLRRLAG PKDSLIHVAA YADIiRYHQDV LIEWLOGEAK QFVNLCKPeP 720 
CMNEG6CVLQ NG9YRCKCRD GWEGPHCENR BWS8CBVCVS QGWILETPiR HMAPVQEGSS 780 
RTPPSHYRB3 LGTEWVPTFW KVCAPGP 

AgO DMA SEQUENCE 

Gone names Homo sapiens type II membrane serine protease mRNA 

Unigene number: HS.6332S 
Probeset Accession 44 > AA4115Q2 
Nucleic Acid Accession #: NM_01642S 

Coding sequence: 1-1324 (underlined sequences correspond to start and stop codonsj 



1 
I 

ATGTTACAGG 
AAACCCCGTA 
CTGAQCCTGG 
TACTTCCTCT 
CTGGACTGTC 
GCAGTGGCAG 
GGGAACTGGT 
AGGCAGATGG 
GATCTGGATG 
GGOCCCTGTC 
AAGACCCCCO 
AQCKTCCAiGT 
CTCACGGCAO 



11 
I 

ATCCTGACAG 
TCCCCATGQA 
CGAGTATCAT 
GCGGGCAGCC 
CCTTQGGGGA 
TCCGCCTCTC 
TCTCTGCCTG 
GCTACAGCAQ 
TTGTTGAAAT 
TCTCAGGCTC 

ACGACAAACA 
CCCACTGCTT 



ACTTTCTCAG 
GCCACCCCAC 
GACATACTGC 



GACACCTGCC 
GTGGGCATCG 
AAGGTCTCAG 



TGTACCCCAA 
GCACAGTCAG 
TCTQGATCAT 
TGCAGGCGTC 
GGGAAGTCAC 
AGGGTGACAG 
TTAGCTGQGG 
CCTATCTCAA 



21 
I 

TGATCAACCT 
GACCTTCAGA 
CATTGTGGTT 
TCTCCACTTC 
GGACGAGGAG 
CAAGGAC0GA 
TTTCGACAAC 
CAAACOCACT 
CACAGAAAAC 
CCTGGTCTCC 
TGGGGAGGAG 
GCACGTCTOT 
CAGGAAACAT 
CTTCCCATCC 
AGACAATGAC 
GCCCATCTGT 
TGGATGGGGC 
AGTCCAGGTC 



31 
I 

CTGAACAGCC 
AAGGTGGGGA 
GTCCrrCATCA 
ATCCCGAGGA 
CACTGTGTCA 
TCCACACTGC 
TTCACAGAAO 
TTCAGAGCTG 
AGCCAGGAGC 
CTGCACTGTC 
GCCTCTGTGG 
GOAGGGAGCA 
ACCGATGTGT 
CTGGCTGTGG 
ATCGCCCTCA 



TTTACGAAGC 
ATTGACAGCA 
ATGTGTGCAG 
CTGATGTACC 



41 
I 

TCGATGTCAA 
TCCOCaTCAtP 
AGGTGATTCT 
AGCAGCTGTG 
AGAGCTTCCC 
AGGTGCTGGA 
CTCTCGCTGA 
TGGAGATTGG 
TTCQCATGCG 
TTGGCTGTGG 
ATTCTTOGCC 
TCCTGGACCC 
TCAACTGI3AA 
CCAAGATCAT 
TGAAGCTQCA 
TTGATGAGGA 
AGAATGGAGG 
CACGGTGCAA 



TGGTGGGCCC 
CTATGGCTGC 
CTT3GATCTAC AATGTCTGGA 



GCACCCCAGG 
AGGCTGAGCT 



sa 
1 




ACCCCTGCGC 


60 


CATAGCACTA 


120 


GGATAAATAC 


180 


TGACGGAQAG 


240 


CGAAGGGCCT 


300 


CTCGGCCACA 


360 


GACAGCCTGT 


420 


CCCAGACCAG 


480 


GAACTCAAGT 


540 


GAAGAGCCTG 


600 


TTGGCAGGTC 


660 


OCACTGGGTC 


720 


GGTGCGGGCA 


780 


CATCATTGAR 


B40 


GTTCCCACTC 


900 


GCTCACTCCA 


960 


GAAGATOTCT 


1020 


TGCAGACGAT 


1080 


AGGGGGTGTG 


1140 




1200 


AGTATACACC 


1260 


GTAA 





A61 Protein sequence* 
Gene name i 
Unigene number: 
Probeset Accession #: 
Protein Accession 44 ; 
Signal sequence i 
Transmembrane domains: 
LDLa domain r 
Tryp__SPc domain: 
Cellular Localization: 



Homo sapi< 
Ha. 63325 
AA411S02 
NP_057509 
none found 
31-53 
54-94 
204-423 

plasma iaembrane/ER 



is type IX membrane serine protsase mRNA 



1 11 21 31 41 51 

I I I I I I 

MLQDPDSDQP LNSLDVXPUl KPRIPMBTFR KVGIPIIIAL LSLASIIIW VLIKVIEDKY 60 

YFLCOQPIiHF IPRKQLCDGE LECFDGEDEE HCVXSFFBGP AVAVRL8KDR STLQVLDSAT 120 

GHWFSACFDW FTEAIAETAC RQMGYSSKPT FRAVBTGPDQ DLDWE1TEN SOBLRMHNSS 100 

GPCLBGSLVS LHGAOBKSI. KTPRVVGGSE ASVDSWPWQfV SIQYDKQHVC GG3IW5PHWW 240 

LTAAHCFRKH TDVFNWKVRA QSDKJX38FPa LAVAKII1IE FUPMYPKDND lALKKLQFPIj 300 

TFBGTVRPIC LPFEDEBLTP ATPLWIIGWG FTKQMGGKMS D1LL0ASVQV 3USTRCHADD 3 60 

AYQQEVTBKM MCAQIPEGGV DTCQBDSGGP LMYQSDQKHV VQIVSWGYGC GGP9TPGVYT 420 
KV3AYLKW1Y NVWSCAKL 



858 



WO 03/042661 



A62 PKA SEQDEHCE 

Gene name; E3Tg f Weakly similar to LITHOSTATHINE 1 BETA PRECURSOR [H- sapiens! 

Unigene number i Hs , 1 0 548 4 

Probeset Accession #s AA31477? 
Nucleic Acid Accession fc: none found. 

Coding sequence; 103-579 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 1 I I I ! 

CCAAACAGAT TTGCAGATCA AGGAGAACCC AGGAGTTTCA AAGAAGCGCT AGTAAGGTCT 60 

CTGAGATCCT TGCACTAGCT ACATCCTCAG SGTAGGAGGA AGATGGCTTC CAGAAGCATO 120 

CGGCTGCTCC TATTGCTGAG CTGCCTGGCC AAAACAGGAG TOCTGGGTGA TATCATCATG 160 

AGACCCAQCT GTGCTCCTGG ATGGTTTTAC CACAAGTCCA ATTQCTATGG TTACTTCAGG 240 

AAGCTGAGGA ACTGGTCTGA TGCCGAGCTC GAGTGTCAGT CTTACGGAAA CGGAGCCCAC 300 

CTGGCATCTA TCCTGAGTTT AAAGGAAGCC AQCACCATAG CAGAGTACAT AAQTGGCTAT 360 

CAGAGAAGCC AGCCGATATG GATTGGCCTG CACGACCCAC AGAAGAGGCA GCAGTGGCAG 420 

TGGATTGATG GGGCCATGTA TCTGTACAGA TCCTOGTCTG GCAAGTCCAT GGGTGGGAAC 480 

AAGCACTGTG CIGAGATGAG CTCCAATAAC AACTTTTTAA CTTGGAGCAG CAACGAATGC S«0 

AACAAGCGCC AACACTTCCT GTGCAAGTAC CGACCATAQA GCAAGAATCA AGATTCTGCT €00 

AACTCCTGCA CCAGCCCCGT CCTCTTCCTT TCTGCTAGCC TGGCTAAATC TGCTCATTAT G60 

TTCAGAGGGG AAACCTAGCA AACTAAGAGT GATAAGGGCC CTACTACACT GGCTTTTTTA 720 

GGCTTAGAGA CAGAAACTTT AGCATTGQGC CCAGTAGTGG CTTCTAGCTC TAAATGTTTG 7 B0 

CCCCGCCATC CCTTTCCACA GTATCCTTCT TCCCTCCTCC CCTGTCTCTG GCTGTCTCGA 840 

GCAQTCTAGA AGAGTGCATC TCCAGCCTAT GAAACAGCTG GGTCTTTGGC CATAAGAAGT 300 

AAAGATTTGA AG&CAGAAGG AAGAAACTCA GGAGTAAGCT TCTAGACCCC TTCAGCTTCT 960 

ACACCCTTCT GCCCTCTCTC CATTGCCTGC ACCCCACCCC AGCCACTCAA CTCCTQCTTG 1020 

TTTTTCCTTr GGCCATAGGA AGGTTTACCA GTAGAATCCT TGCTAGGTTG ATGTGGGCCA 1080 
TACATTCCTT TAATAAACCA TTGTGTACAT AAGAGAAAAA AAAAAAAAAA AAAAAAAAAA 



A63 Protein sequence; 
Gene namex 
Unigene number : 
Probeset Accession ft: 
Protein Accession fr= 
Signal sequence i 
Transmembrane domains: 
C-type lectin domain: 
Cellular Localizations 



ESTs; Weakly similar to 1j t THOSTATH IWE 

Hs. 1054 34 

AA314779 



1 BETA PRECURSOR EH-sapiensl 



1-22 

none found 

47-1S6 

secreted 



1 11 21 31 41 51 

1*111] 

MASRSMHTiTiTi IiliSCLAKTGV LGDIIMRPSC APGWFYHK5H CYGYFRKLKN WSDAEDEOQS 60 
VGNGAHIiASI LSLKEASrlA BTISGYQRSQ PIWIGIiHDPQ KRQQWQWIDG AMYLYRSWSG 120 
KSMGGNKHCA EKSSNNNFI>T WSSNECWKSQ HPLCRYHP 



A64 DMA SEQUENCE 

Gene names Cadherin 3* P-cadherin (placental) 

Unigene number* Rs.2677 

Probeset Accession #: X63629 

Nucleic Acid Accession #> X63629 

Coding sequence; 54-2543 {start and stop codons are underlined) 



1 U 21 31 41 51 
I I 1 I ] I 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATGGGGC 60 

TCCCTCGTGG ACCTCTCGOG TCrCTCCTGC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 

CCTCCGAGCC GTGCCQGGCG OTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 

CGGAGCAGGA OCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAOAGC 240 

CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 

AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCT TAC 360 

QAAQACACAA GAGAGATTGO GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 

CCTTCCCCCA GAOACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 

AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 

TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 

TCATCGTGAC CGACCAGAAT GACCACAAGC CCRAGTTTAC CCAGGACACC TTCCGAGGGA 720 

GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 

ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 340 

AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 

CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 

tggatgggga cggctocacc accacggcag tggcagtagt GGAGATCCTT GATGCCAATG 1020 

ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGlGCCTGAG AATGCAGTGG 10BO 

GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 

GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA XTTTACCATC ADCACOCACC 12 QO 

CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 

AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGXGCTGAAG CTCDCAACCT 1320 

CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCOCAC 1380 

CCTCCAAAGT OGTTGAGGTC CAGGAGGGCA TCCCCACTGG QGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 



859 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 
35 

40 
45 
50 
55 



CAGGGTGGCT 
GTGAGOATGA 
ATGGAAGCDC 
ACCA.TGQCCC 
ACGTGCTGAA 
TCACAOATGA 
TCTTGTCCCX 
ACCATGGCAA 
ATGTCGAAAC 
TCCTGGCTCT 
TCAAGGAGCC 
AAGAGGGGGG 
AGGCCAGQCC 
TGTACCGTCC 
AGGCGGCTAA 
AGOSCAGOGG 
ACCAAGATTA 
ACG3TOGCGG 
CACAGAG CAT 
GOAAGTGGCC 
TCCTTAGCCT 
CACCTGGGCC 
TGCTCAACCC 
CTGGAATGOA 
TCAAAAOGTT 
TCCTGCATTT 
ATACTGAGTG 
GAAGGGTGAG 



AGCCATGGAC 
GCAGTTTGTG 
TCCCACCACT 
AGTCCCTGAG 
CATCRCGGAC 
CTCAGACATC 
QAAGAAGTTC 
CaAAGAGCAG 
ctqccxttgoa 
GCTGTTCCTC 
CCTCCTACTC 
TGGCGAAGAG 
GGAGGTGGTT 
TAGGCCAGCC 
CACAGACCCC 
CTCCGACGCC 
CGATTATCTG 
GGAGGAOGAC 
CTCCAAGGGQ 
GTAGCAACTT 
TTCAGGATGG 
AGGGTTGCCT 

ACCT TCTTAG 
AGAGAAAGTT 
CTQGTTTCCA 
TGCCTAGGTT 
GACAATCGTG 



CXZAGACAGTG 
AGGAACAACA 
GGCACGGGAA 
CCOCGTCAGA 
AAGGACCTQT 
TACTGGACGG 
CTGAAGCAGG 



OCCTGGAAAG 
CTGCTQOTQC 
CCAGAAGATG 
GACfZAGGACT 
CTCCGCAATG 
AACCCAGAT3 
ACAGCCOCGC 
GCGTCCCTQA 
AACGAGTGGG 
TAGGCQGCCT 
TCTCAGTTCC 
GGCGGAGACA 
AGGAATGTGG 
CAGAGGCCAA 
GCCTGGQCCT 
GCCTCCTGGT 
CTTCAAAAGT 
GACCCCAATG 
GCCCCTTATT 
TATATGTACT 



GGCAGGTCAC 
TCTATGAAGT 
CCCTTCTGCT 
TCACCATCTG 
CTCCCCACAC 
CAGAGGTCAA 
ATACATATGA 
TCAGGGCCAC 
GAGGTTTCAT 
TGCTTTTGTT 
ACACCCGTGA 
ATGACATCAC 
ACGTGGCACC 
AAATCGGCAA 
CCTACGACAC 
GCTCCCTCAC 
GCAGCCGCTT 
□CCTGCAGOQ 
CCCTTCRGCT 
GGCTATOAGT 
GCAGTTTGAC 
GTTTCCAGAA 
GCTGTGACTG 
GCAACTTAAT 
GCAGCCCAGA 
CCTCCCATTC 
TTTTATTTTC 
AGAACTTTTT 



AGCTGTGGGC 
CATGGTCTTG 
AACACTGATT 
CAACCAAAGC 
CTCCCCTTTC 
CGAGGAAGGT 
CQTGCACCTT 
TGTOTQCGAC 

GGTQAGAAAG 
CAACGTCTTC 
CCAGCTCCAC 
AACCATCATC 
CTITATAATT 
CCTCTTGGTG 
CTCCTCCGCC 
CAAGAAGCTG 



GAGGACTTCG 
CTGAOGTTAG 
TTCAGCACTG 
GCCTCTTACC 
ACCTACAGTG 
1TTTTTTTTT 
GCTGCTGGGC 
GGATGGATCI 
CCTGTTQCQT 
TATTAAAGAA 



ACCCTCGACC 
GCCATGGACA 
GATGTCAACG 
CCTGTGOGCC 
CAQGCCCAQC 
GACACAGTGG 
TCTCTGTCTG 
TGCCATGGCC 



AAOCGGAAGA 
TACTATGGCG 
CGAGGTCTGG 
CCGACACCCA 
GAGAACCTGA 
TTCGACTATG 
TCCQACCAAG 
GCAGACATOT 
AACBTCAGGC 
GAGCPTGTCA 
AGTGGTTGCT 
AAAACCTCTC 
TGCCGTAAAA 
GACTTTCTCT 
AA7GCTATCT 
OCACTGGOGG 
CIGCGTTTTT 
TGCTATAGAT 
A 



A6S 



Protein sequence? 



Gene name: 
Unigene number - : 
Probeset Accession #: 
Protein Acceanion 
Signal sequence: 
Transmembrane domain: 

OelLular localization! 



Cadnerin 3, P-cadherin (placental) 

Ha. 2877 

X63629 

CAA45177 

1-24 

659-675 (second underlined sequence) 
plasma membrane 



1 
I 

KGLPSGPLAS 



21 
t 

CAASEPCRAV 



KGPFPQRLNQ 
YELFGEAVSE 
DEDDAIYTYN 
TDMDGDGSTT 



PTSTATIWH 
DPAGWIAMDP 
VNDHGPVPEP 
TWTjSLKKTI* 
GAVLALLFLL 
GLEARPEWIi 
DYEGSGSDAA 



11 
1 

LLLLOTCWLQ 
EDFTVRNGET 
LKSNKDRDTK 
NGASVEDFMSI 
GWAY3IHSQ 
TAVAWEILD 
GDDGDHFTIT 
VBDVNEAPVF VPPSKWEVQ 
DSGQVTAVGT UIREDEQPVR 
HQITXCNQSP VRHVLNITDK 
KQDTYDVHLS LSDHGNKEQL 
I>VLL1jLVRKK RKIKBPMiLP 
RNDVAPTIIP TPMYRPRPAK 
ELSSLTSSAS DQDQDYDYIJJ 



31 
I 

FREABVTLEA 



IPYSITGPGA 
I6IIVTDQND 
EPSCDPHDLMP 
ANDNAFMFDP 



DSPPEGVFAV 
HKPKFTQDTF 
TIHKSTGTIS 
QKYEAHVPEBT 



EGIPTGEPVC 
JWIYEVMVLA 
DLSPHTSPFQ 
TV1RATVCDC 
EDDTEDNVFY 
PDEIGNFIIE 



15G0 
1620 
1680 
1740 
1880 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



41 


51 




i 

GGAEQEPGQA 


1 

LGKVPMGCPG 


60 




VAPISVPEHG 


120 


EKETGWLLUJ 


KPLDRESIAK 


180 


KSSVIiBGVItP 


GTSVMQVTAT 


240 


VISSGLDREK 


VPEYTLTIQA 


300 


AVGHEVQKLT 


VTDLDAPNSP 


360 


KNQHTLYVEV 


TMEAPPVLKIj 


420 


VYTAEDPDKE 


NQKISYRJUiR 


480 


MDNGSPPTTG 


TGTLLLTLID 


540 


AQI/TBDSDlY 


WTAEVMEEGD 


600 


HGHVETCPGP 


WKGGFILPVL 


660 


YGKBGGGEKD 


QDYDlTQTiHR 


720 


NIiKAANTDPT 


APPYDTOMFF 


780 


DMYGGGEDD 







60 



65 



A€6 DNA SEQT3B2JCE 

t3ene same: ATPase, Ca++ transporting, type 2C, 

Unigene raarberx eb* 10677 8 

Probeset Accession ft: NS1919 
Nucleic Acid Accession IJ i AF189723 

Coding sequence: 1-2712 (underlined eeguencee correspond to start and stop codona) 



/U ATTI 



75 



80 



ATGATTCCTG 
ATTCTCCAAG 
TTTCATGGCT 
TCTCAQTT7A 
ATGCATCAGT 
GCCTTTGTTC 
CCAGAATGCC 
CCAQQTGATA 
GAGGCTGTGG 
AAGGTGACAG 
GCCTTTATGG 
GAAAATTCTQ 
CCTCTGCAGA 
ATAGGAATCA 



11 
I 

TATTGACATC 
CTGATCTTCA 
GGAATOAGTT 
AAAATCCCCr 
TTGATGATGC 
AGGAATATCG 
ATTGTGTGCG 
CAGTTTGCCT 
AXCTTTCCAT 
CTCCTCAGCC 
GAACACTGGT 
AATTTGGGGA 
AGAGCATGGA 
TCATGTTGGT 



21 
I 

AAAAAAAGCA 
GAATGGTCTA 
TGATATTAGT 
TAITATGCTG 
CGTCAGTATC 
TTCAGAAAAA 
TGAAGGAAAA 

TGATGAGTCC 
AGCTGCAACT 
CAGATGTGGC 
GGTTTTTAAA 
CCICTTAGGA 
TGGCTGGTTA 



31 
I 

AGTGAATTAC 
AACAAATOTG 
GAAGATGAGC 
CTTCTGGCTT 
ACTGTGGCAA 
TCTCTTOAAG 
TTGGAGCATA 
GATAGAGTTC 
AGCTTOAC3W3 
AATGGAGATC 
AAAGCAAAGG 
ATGATGCAAG 
AAACAACTTT 



41 
1 

CAGTCAGTGA 
AAQTTAQTCA 
CACTGTGGAA 
CTGCAOTCftT 
TACTTATCGT 
AATTGAGTAA 
CACTTGCCCG 
CTGCTGACTT 
GTGAGACAAC 
TTGCATCGAG 
GTGTTGTCAT 
CAGAAGAGGC 
CCTTTIRCTC 
ATATCCTGGA 



51 
I 

AGTTGCAAGC 



GAAGTATATT 
CAGTGTTTTA 
TGWACAGlT 
ACTTGTGCCA 
AGACTTGGTT 
ACS3CTTGTTT 
GCCTTGTTCT 
AAGTAACATT 
TGGAACAGGA 
ACCAAAAACC 
CTTTGGTATA 
AATGTTTACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



860 



WO 03/042661 



PCT/US02/36810 



ATTAGTGTAA 
AOQCTAGCTC 
ATTGTTGAAA 
AAGAATGAAA 
BGAGTTGGCT 
TATAACCCAG 
AGAAACAATA 
ATGGGTCTTG 
GAGCAAAAGT 
TGTTTTATGA 
GGQCAGACCT 
ATGGGCTCAG 
ACATTTCTTG 
ACAACACTCA 
GCAGTTGCAA 
GAAGAAATAG 
TTTTACAGAB 
TC2VGTTGTAG 
ATTGGAG7TG 
CTAGTGGATG 
AATAACATTA 
ATCTCATTGG 
ATCAATATTA 
GATGTCATTC 
CTTAAAATAC 
GAGCTACGAG 
TTTTTTGACA 
GGACTCTGCA 
CTAGTTATTT 
GATCTGTTOT 
AAGGTTGAAA 
CTTGAAGTAT 



GTTTGGCTGT 
TTGGTGTTAT 
CTCTGGGCTG 
TGACTGTTAC 
ATAATCAATT 
CTGTTAGCAG 
CTCTAATGGG 
ATGGACTTCA 
GGATGGCTGT 
AAOGTGCTTA 
TGACACrrAC 
CGGSACTCAG 
GCTTGGTGGG 
TTGCCTCAGG 
TCGCCAGTCG 
ATGCAATOGA 
CTAGOCCAAG 
CCATGACAGG 
CGATGGGCCA 
ATGATTTTCA 
AAAATTTCGT 
CTACATTAAT 
TTATGGATGG 
GTAAACCT CC 
TTGTTTCATC 
ACAATGTGAT 
TGTTCAATGC 
QTAATAGAAT 
ACTTTCCTCC 
TTCTTTTGGG 



AGCAGCAATT 
GAGAATGGIG 
CTGTAATBTG 
TCACATATIT 
TGGGGAAGTG 
AATTGTTGAG 
GAAGCCAACA 
ACAAGACTAC 
TAAGTGTGTA 
CGAACAAGTA 
TCAGCAGCAG 
AGTTCTTQCT 
AATCATTGAT 
AGTATCAATA 
TCTGGGATTG 
TGTTCAGCAQ 
GCACAAGATG 
AGATGQAOTA 
GACTGGTACA 
AACCATAAK3 
TAGATTCCAG 
OAACTTTCCT 
ACCCCCAGCT 
TCGCAACTGG 
AATAATCATT 
TACACCTCGA 
ACTAAGTTCC 
GTTTTGCTAT 
GCTTCAGAAQ 
TCTCACCTCA 
AAAGATCCAG 



CCTGAAGGTC 
AAGAAAAGGG 
ATTTGTTCAG 
ACTTCAGATG 
ATTGTTGATG 
GCGGGCTGTG 
GAAGGGGCCT 
ATCAGAAAAG 
CACCGAACAC 
AT TAAGTACT 
AGAGATGTGT 
TT GGCTTCTG 
CCACCTAGAA 
AAAATGATTA 
TATTCCAAAA 
CITTCACAAA 
AAAATTATTA 
AATGATGCAG 
GATGTTTGCA 
TCTGCAATCG 
CTGAGCACGA 
AATOCTCTCA 
CAGAGCCTTG 
AAAGACAGCA 
GTTTGTGGGA 
GACACAACAA 
AGATCCCAGA 
GCAGTTCTTG 
GTTTTTCAGA 
TCAGTGTGCA 
AAGCATGTTA 



TCCCCATTGT 
CCATTGTGAA 
ATAAAACTGG 
GTCTGCATGC 
GTGATGTTGT 
TGTGCAATGA 
TAATTGCTCT 
CTGAATACCC 
AGCAGGACAG 
GTACTACATA 
ACCAACAAGA 
GTCCTGAACT 
CTGGTGTGAA 
CTGGAGATTC 
CTTCCCAGTC 
TAGTACCAAA 
AGTCGCTACA 
TTGCTCTGAA 
AAGAGGCAGC 
AAGAGGGTAA 
GTATAGCAGC 
ATGCCATGCA 
GAGTAGAACC 
TTTTGACTAA 
CTTTGTTTGT 
rrGACCTTCAC 
CCAAGTCTGT 
GATCCATCAT 
CTGAGAGCCT 
TAGTGGCAGA 
GTTCGACATC 



GGTCACAGTG 
AAAGCTGCCT 
AACACTGACG 
TGAGGTTACT 
TCATGGATTC 
TGCTGTAATT 
TGCAATGAAG 
TTTTAGCTCT 
ACCAGAGATT 
CCAGAGCAAA 
GAAGGCACGC 
GGGACAGCTG 
AGAAGCTGTT 
ACAGGAGACT 
AGTCTCAGGA 
GGTTGCAGTA 
GAAGAACGGX 
GGCTGCAGAC 
AGACATQATC 
AGGGATTTAT 
ATTAACTTTA 
GATTTTGTGG 
AGTGGATAAA 
AAACTTGATA 
CTTCTGGCGT 
ATGCTTTGTG 
GTTTGAGATT 
GGGACAATTA 
AAGCATACTG 
AAXTATAAAG 
ATCATCTTTT 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
18G0 
I860 
1320 
1980 
2040 
2100 
2160 
2220 
22 80 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



OA 



A67 Protein sequence; 
Gene name: 
Unlgene number » 
Probeget Accession # ; 
Protein Accession #* 
9ignal sequence; 
Transmembrane domains? 
Pfara domains: 
[744-889] 

Cellular Localization; 



ATPase, Ca++ transporting, type 2C f member 1 

He. 106778 

NS1919 

AAF27613 

none found 

60-82, 86-10B, 252-274, 262-304, 757-779, 827-849, 856-878 

Cation ATPase N [6-B3J , B1-E2 ATPase [89-3243 , Hydrolase [328-649] , Cation ATPase C 
not determined 



HIPVIjTSKKA 
SQFKNPLIMIj 



KVTAPQPAAT 
PLQKSMDliIXS 
TLALGVMRMV 
GVGYNQFGEV 
MGLDGLQQDY 
GQTIjTIjTQQQ 
TTLXA8GVSI 
FYHASPRHKM 
LVDDDFQTIK 
INIIMDGFPA 
ELiRDKVTTPR 
bVIYPPPLQK 
LEV 



11 
1 

SELPVSEVAS 
LLASAVISVL 
IiEHTLARDLV 
NGDLA&RSWI 
KQLSFYSFGI 
KKRAlTOKEiP 
XVDGDWHGF 
IRKAEYPFS3 
KDVYQQEKAR 
KHITGDSQET 
KIJKSLQKKG 



21 
I 

ILQADLQNGI> 
MHQFDDAVSI 
PGDTVCLSVG 
AFMGTLiVRCG 
IGllMIiVGRL 
IVCTLGCCNV 
YHPAVSaiVE 



31 
1 

NKCEVSHRRA 
TVAILIWTV 
BHVPADLTLLP 
KAJCGWIGTG 
liOKDILEMPT 
ICSDKTGTLT 
AGCVCNDAVI 



41 
I 



APVQEYRBEK 



ENSEFGEVFK 
ISVSLAVAAI 
KNEMTVTHIF 



QSLGVEPVDK 
PTTMTFTCPV 



MGSAGLRViA 
AVAIASHLGli 
3WAMTGDGV 
NWIKMFVHFQ 
DVIRKFPHMW 



DLLFLLGLTS 



LA6GPELGQL 
ySKT8QSV5G 
KDAVALKAAD 

KDSIIjTKWIiI 
RflOTKfiVFEI 
SVCrVAEI JK 



CPMKGAYEQV 

EBIDAMDVQQ 
IGVAMQQTGT 



IiKILVSSIII 
GLGSMRMFCY 



51 
I 

EDEPLWKKYI 
SLBELSKLVP 
3MTGETTPCS 
MMQAEEAPKT 
PEGLPIWTV 
TSDGLHAEVT 
EGALIAIAMK 
IKYCTTYQSK 
PPKXGVKEAV 
IiSQXVPKVAV 
DVCKKAADM2 
NPLNAHQILW 
VCGTLFVFWR 
AVTjGSIMGQL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 



A68 DNA SEQgEMCE 
Gene name: 
Uhigene number i 
probeset Accession 4)t 
Nucleic Acid Accession #; 
Coding sequence? 



bone morphogenetic protein 7 [osteogenic protein 1) <BWF7) 

Hb . 17019S 

BE616633 

NM_001719 

123-1416 (underlined se<iuences correspond to start and stop codons} 



1 11 21 31 41 51 

I I ) I I 1 

GGGCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTOCCCCCT 60 

CTGCCACCTG GGGCGGTGCG GGCCCGGAGC CCGGAGCOCG GGTAGCGCGT AGAGCCGGCG 120 

CGATGCACBT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CTTCGTGGCG CTCTGGGCAC 1B0 

CCCTGTTCCT GCTGCGCTCC QCCCTGGCCG ACTTCAGCCT GGACAACGAG GTGCACTCGA 240 

GCTTCATCCA COGGCGCCTC CGCAGCCAGG AGCGGCGGGA GATGCAGCGC GAGATCCTCT 300 

CCATTTTGGG CTTGCCCCAC CGCCCOCGCC CGCACCTCCA GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAIT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCCCGGCG 420 

GCCAOGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGTAC CCAGGGCCCC CCTCTGQCCA 480 

GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGQT CATGAGCTTC GTCAACCTCG 540 

TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 600 

TTTCCAAGAT COCAGAAGGG GAAGCTGTCA CGGCAGCOGA ATTCCGGATC TACAAQGACT 660 



861 



WO 03/042661 



PCT/US02/36810 



ACATCCGGGA ACGCTTCGAC AATGAGACGT TCCGGATCAG CGTTTATCAG GTGCTCCAGQ 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCGACAG COGTACCCTC TGGGCCTGGG 780 

AGGAGGGCTQ aCTGGTGTTT GACAT CCACCAGCAA CCACTGGGTG GTCAATCCGC B40 

GGCACAACCT GGGCCTGCAG CTCTCGGTGG AGACGCTGGA TGGGCAGAGC ATCAACCCCA 900 

AGTTGGCGGG CCTGATTGGG CGGCACGGOC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

TCTTCAAGGC CACGGAGGTC CACTTCCGCA GCATCCGGTC CACGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GCGGATGGCC AACGTGGCAG 1080 

AGAACAGCAG CAGCGACCAG AGGCAGGCCT GTAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 

GAGAQCTOGG CTGGCAGGAC TGGATCATCG CGOCTGAAGG CTACGCCGCC TACTACTGTG 1200 

AGGGGGAG1G TGCCTTCGCT CTGAAClCCT ACATGAACGC CACCAACCAC GCCATCGTGC 126 O 

AGACGCTGGT CCACTTCATC AACCCGGAAA CGGTGCCCAA GCCCTGCTGT GCGCCCACGC 1320 

AGCTCAATGC CATCTCX3GTC CTCTACTTCG ATGACAQCTC CAACGTCATC CTGAAGAAAT 1380 

ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCCACTAGCT CCTCCGAGAA TTCAGAOCCT 14.40 

TTGGGGOCAA GTTTTTCTGG ATCCTCCATT GCTCQCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTO TGAGACCTTC CCCTGCCTAT CCCCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAQA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA GGAAAAAAAA ACAACGCATA AAGAAAAATG 1680 

GCCGGGCCAG GTCATTGGCT GGOAAGTCTC AGCCATQCAC GGACTCOTTT CCAGAGGTAA 1740 

TTATGAQCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTOACCOGQA AGTTCCTGTA ATAAATGTCA I860 
CAATAAAACG AATGAATG 



A69 Protein, sequences 

Gene names bone morpliogenetic protein 7 [osteogenic protein 1J {BMP?) 



Unlgene number: 
erdbeset Access ion # i 
Protein Accession #: 
Signal sequence: 
Pfam domains: 
Transmembrane domains: 
Cellular Localization: 



Hs. 170195 
BE6X6633 
NP_001710.1 
1-30 

TGFb_propeptide [37-201] 

none found 

secreted 



1 11 21 31 41 51 

i i l i i I 

MHVRSLRAAA PHSFVALWAP LFLIaR3ALAD FSLDNEVHSS FIHRRI/RSQE RREMQREILS 60 

ILiOI/PHEPRP HLQGKHNBAP MFMU>LYWAM AVEBGGGPGG QGF6YPYKAV FSTQGPPIiAB 120 

LODSHFLTDA DMVMSFVKLV EHDXEFFKPR YHHRKFRFDTj SKI PJ2GBAVT AAEFRIYXDy 180 

IREREDNETF RISVYQVLQE HLGRESOLFL LDSRTLWASE BGWLVFDITA TSNHWWNPR 240 

HNL GLQIi SVE TEDGQSIHPK LAG^IGRHGP QNKQPFHVAF FKATEVHPHS IRSTGSKQRB 300 

QNHSBTPKNQ EALRMANVAE NSSSDQRQAC KKtfELYVSFR DCLGWQDWXIA PBQYAAYYCB 3 60 

GECAPPLNfiY MKATNHAXVQ TLVEFINPBT VPKPCCAPTQ LNAISVLY2D DSSNVILKKY 420 
RNMVVRACGC H 

Cervical 

A70 IMA, 



Gene name: bone morphogenetic protein receptor IB CALK- 6 ) 

Uaigene number i Ha . B7222 

Probeset Accesa4.cn 4: AA250737 
Nucleic Acid Accession it; NMJJ01203 

Coding sequence; 274-1782 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 I I I I 1 

CGCGGGGCGC GGAGTCGGOG GGGCGTCGCG GGACGCGGGC AGTGCGGAGA OCGCGGCfiCT 60 

GAGGACGCGG GAGCGGGGAG CGCACGCGOG GGGTGGAGTT CAGCCTACTC TTXCTTAGAT 120 

GTGAAAGGAA AGGAAQATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 

CAT A AC C ATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAOCC 240 

TGCCATAAGT OAGAAOCAAA CTTCCTTQAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 300 

AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 

TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 

GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 

GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTOGGG ACACTCCCAT TCCTCATCAA 540 

AQAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 

CTGCCTCCAT TGAAAAACAG AGATTTTQTT GATGGACCTA TACACCACAG GGCTTTACTT 660 

ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCC3DG 720 

TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAAGAQGA TGAAACTTAC 780 

ATTCCTOCTG GAGAATCOCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 

TCAGGOCTOC CTCTBCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 

ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 

GTGAAAGTGT TCTTCAOCAC AQAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 

ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 

GGGTOCTGGA CCCAGTXGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1140 

TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGOCTA CTCTTCTQTC 1200 

AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 

CATCGAGATC TGAAAAGTAA AAACATTCTG GTCAAGAAAA ATGGAACTTG CTOTATTGCT 1320 

GACCTG GQG G TQGCTGTTAA AT T TAT T AST GATACAAATQ AAGTTGACAT ACCACCTAAC 1380 

ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 

AATCACTTCC AGTCTTACAT CATGGCTQAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1500 

GTTGCTAGGA GATGTGT AT C AGGAGGTATA GTGGAAGAAT ACCAQCTTCC TTATCATGAC I860 

CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAQTTA 1620 
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CGOCCCTCAT TCCCAAACCG GTGGA3CAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 

ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 

ACACTTQCCA AAATGTCAGA GTCCCAGGAC ATTAAACT CT GA TAGGAGAG GAAAAGTAAG 1800 

CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 

CTTTCAGGGA GOGACCTGGG CAAAGACAGA GAAGCTDCCA GAAGGAGAOA TTGATCCGTG 1980 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 

A71 Trot e in sequence 

Gene name: bona morphogenet tc protein receptor IB (ALK-6) 

TJnigene number: Ha. 72472 / Hs -87223 

Probe set Accession #: AA2S0737 / US 93 2 6 

Protein Accession #2 NP_001194 

Signal sequence: 1-13 

Transmembrane domains: 128-144 

PPAM domains? activin_receptor [30-111] , protein kinase 1204-491} 
Cellular Localization: plasma membrane 



X 11 21 31 41 51 

I I 1 I 1 I 

MLJiRSAGKLN VGIKKSDGE9 TAPTPRPHVIi RCKCHHHCPB DSVNMICSTD GYCPTMIEED 60 

DSGDPWTSG CLQIiBGSDPQ CRDTPIPEQH XSIBCCTEKW ECNKDLHPTIi PPUKNRDFVD 120 

GPrHHRAXiLI SVTVCSLLLV LIII>FCYFRY XRQETRPRYS IGLBQOETYI PPGESI1KDI1I 180 

EQGQSSGSGS GLPLLVQRTI AKDIQKVKQI GKGHYGEVNM GKHRGEKVAV KVFFTTBEAS 240 

WFRE.TBIYQT VLMRHENILG FIAADIKGTO SWTQL,YLITT> YBENGSLYDY LKSTTLDAKS 300 

MLKLAYSSVS GLCHLKTEIP STQ0KPAIAH RDLKfiKNILV KKNGTCCIAD LGLAVXFISD 360 

1KEVDIPPHT RVGTKRYMPP BVLDESLBRH HFQ3YIMADM YSFOIiILWEV ARRCVSGGIV 420 

EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECHAHNPAS 4B0 
RI>TAltRVKKT IiAKMSESQDI Kb 



Bladder 

A72 DMA SEQUENCE 

Gene name? Homo sapiens type II membrane serine protease mRNA 

Uhigene number: Hg. 53325 

Probeset Accession #s AA411S02 
nucleic Acid Accession #1 KM_01642S 

Coding sequence: 1-1314 (underlined sequences correspond to start and atop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGTTACAGG ATCCTGACAG TOATCAACCT CTGAACAGCC TCGATGTCAA ACCCCraCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TOCCCATCAT CRTAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GOATAAATAC 180 

TACTTCCTCr GCGGGCAGCC TCTCOVCTTC ATCCCOAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CQAAQGGCCT 300 

GCAGIGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGAOCAG 480 

GATCTQGATG TTGTTGAAAT CACAOAAAAC AGCCAGGAGC TTCGCATGOG GAACTCAAGT 540 

GGGOCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGQ GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAfiGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCAOGTCTGT GGAGGGAGCA TCCTGGACOC CCACTGGGTC 720 

CTCAOGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGQCAG CTTOCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

T.TCAACCCCA TGTACCCCAA AGACAATGAC ATOGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTST CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GOCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AQAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAQAAGATG ATGTGXGCAO GCATCCCGGA AQGGGGTGXG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC J2G0 

AAGGTCTCAG cctatctcaa gtggatctac aatgtctgga aggctgagct gtaa 



A73 Protein sequence 1 
Gene names 
"Unigene numbers 
Probeset Accession #: 
Protein Accessipn #: 
Signal aequence: 
Transmambrane domainsi 
LDLa domain: 
Tr YP_SPc domain: 
Cellular Localization: 



Homo sapiens type II 

Hs. 63325 

AA411502 

NP_057S09 

none found 

31-53 

54-34 

204-429 

plasma membrane/ER 



serine protease mHNA 



1 11 21 31 41 51 

I I 1 I 1 I 

MLQDPDSDQP LNSLEKVKPLR KPRIPHETPR KVGIPIIIAL LSLA8IIIW VL1KVILDKY GO 
YPlXDGQPIiHF IPRKQ1XZOQE LDCPIX3KDEE HCVKSFPBGP AVAVULSKDR STLOVLDSAT 120 

863 
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GNWFSACFDN FTEAIAETAC RQMGYSSKPT FRAVBIQPDQ DLDWErTEH SQELRMRSSS 100 

GPCLSBSIiVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKUGSFPS IAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TFSGTVRPIC LPFFDSEJjTP ATPLWIXGWG FTKGNGGKMS DIUjQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDGGGP UWQSDQHHV VGIVSWGYGC GGPSTPGVYT 420 
KVSAYU5WIY NVWKAEL 



A74 PUA SEQUBHCB 

Gene name : ESTs, Weakly similar to CGHU7I> collagen alpha 1 (III) chain precursor [H. oapicma] 
Dixlgeao number: He. 193 2 2 

Probeset Accession ft: AA088458 
CTucleic Acid Accession 4? AA08845B 

Coding gegjence: 862-1995 (underlined sequences correspond to start and .stop codans) 



'1 11 21 31 41 51 

I I I I I I 

GCCCTTGGAC ACTGACATGG ACTGAAOGAG TAGAATGGAG CACGAGGACA CTGACATGGA 60 

CTGAAGAAAA AGGAGCTGGA GCAGOAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 12 Q 

GCGCX3GGGCC GCGACTGGTA CCAGCAGCAG CTGCAAOGAG TGCAGGAGCG CCAGCGCCGC 180 

CTGGGCCAGA GCAGAGCCAG G3GCCQACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 300 

TGTGCCAGCC GGGCCCTGCC CCCQTCCTCC TCCGGGCCCC OCTGCCCTGC CCTGACGTCC 360 

ACCTCACCCC CGGTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACCGACTC 420 

CTCACCCAGG AGGTGACCGA GAAGAiGTGAG CGCATCACGC AGCTGGAGCA GGAGAAGTCG 480 

GCGCTCATTA AGCAGCTQTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 540 

GATT0CACCT TCATCTAGTC CTTGTGGGGC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 60 0 

AGCCCTTCGA GGGTGGGOGC CCCATCQCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAfi 660 

GCCCAGGCAC AGTCCOGGAG TGGGCGCCTT CCTGCCGCCC TTGCCAGATG GGCTCCOCAG 72 0 

GCCTGOCCCC GGCTGGTCCC OGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGQTTGYTQ 780 

ACATGGGCTG GGGGCTCTCT TGAGTCCGGA TAGTCCGCAG CTACTACTGG CCGCTGTCAG 840 

IGGACRGTGG GGTACCCCTC CKTGAGTTAG CGTCCCCCCG TTTCCAGCGG TGCCGCCCXG 900 

GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG CACTTCCAAC AACGGGCAGC 960 

AGAGGGCGCG GGGCGGCTOC GAOGGGGGTC CAAGGGCAGC TTCCCGCTCA AOCAGGGCAC 1020 

CAGGACGAGG TGGCTGEAGC TCGGACGGAC GGAAGTAGAT GGAGGGGGTG GGGACGGCCT 1080 

GTAAGCGGGG GGTGCCTGGC 1GGCTGGGGA GCCCCAGGGA TAGCGGTCGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGGACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GGGAGAGCTT 1200 

GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCCGGGACG 1260 

TTQCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCOCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCAOGTCA TAG<3 CAAAGC CTGTTTCCCC CGACTCAGGA TTTCCAAGGC 13 B0 

CTGGGGTCCT GCTCAOCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAfiC CAAGGAAAAC GAGAACCGCC AGGGTACAGG AGGAGGCTGG 1500 

GGCAGGTOCC CTTGGGTGTC ACTCOCTCAG OCOCTGCGCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCAOCA 1620 

GAAOCAGGGG CACGGCAACA GCATOGATGG GTTCTGCAGC OCAGGGCCCC OGATGCGGGG 1680 

TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG TG03TGGGGG GCGCAGGGGC 1740 

CCCGATGCGG GGTCAGTGCG TGGQGGGOGC AGGGCCCCCT CGTGTCCAGG GCACTTTGGT 1800 

ACACTGTCCC ACAAGGGACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTOOGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACA0AO OCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGG GGTGTOCCAG GACAGGCCCA AGTCAGCCCA GCATGCAGCT 1980 

GGCCTCCTAC CCTGAAGATQ GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCI GAGGGOCTGT GCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGGGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTQGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 

TGGGGGATCC TGGCATCTTT ACTGGACTGG AAflCRGGAGA CAGAAGA0TG TCTGTCCCGG 2280 

GGTQACTTCA TCAGGAGACC GCCCACATAG AGCXGGACCC GGCAGCTGAA GCGGAAATGT 2340 

GAGACAGGCT GGCACCTCCG GAAAAACTGC CTTTC5U30cr TGGTGTTCCG TGCAAGGTGA 2400 

AAAGAAATAG GTGCTCCCAG TTXACA3CTT GAAATCAGGC TAGTGAGTGG CCCTGGAGAC 2460 

CAOGAGGGGA GAATTTAAAG GCCCCGGCTG GCAGGGTCTA GGIGGCTGGC AGAGGCACAT 2520 

GCAGACCCTG CCTGGAGCCI GCCCTAGGAC GCTGGGCGGG TCAGTCTCCO TGCAGGATGT 2580 

GAGCAGCGTC CCTGGGCTCT ATCOGCGAGG TGCCAGTAGC GTGTGCAGGT ACATACRCGT 2640 

GOGTGGACAG TGTGATGACA CCCGGAAA1X3 TCTCAGGATG TTGAAATGTG TCCTTGGGGG 2700 

CAGAAGTGTC COCRGTTGAG AATCTGCCCC AGAGGAACAC ACCCACWXA GGCCTCAGGA 2760 

TTTTGTGTTG ATCAAGTTCC AAGGAAAAGG AACATCTCAG CCGGGGGTGG TGGTTCACGC 2820 

CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC CTGGGCAACG CAGTGAGAGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATGCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAQAAGCAAC AGATTGACTC 3000 

TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 



A7S DMA SBQOSEHZE 

Gene names SCCR2 or 2b, serine (or cysteine) proteinase inhibitor 

Unigene number: He -227948 

Probeeet Accession #; AB0350Q9 
Nucleic Acid Accession # t AB035089 

Coding sequence- 9945-10219 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I \ 1 

GGGCATGCAG CCATOGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 60 

CAGTTCTAGT AAAAGGQAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAGAAGG 120 

CCAAGAGGAA TTAGGGAQAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 180 

TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG CAGAGTAGGA 240 
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AGGAGGAAGa GAGGCAAGAG ATAAT AT CAT TTTCTCTGTG CTCCAACTGT ACTTACATAT 300 
GAGACTATTT COCTCTCTGC TTTTCAAACC TTACTOGAQT TGTTTTCCCT CATGAAAACC 3 SO 
AAGAAAGGAA AGCTAGTTAG TCTTGTTCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 
GTAGACAGAA TCCTTGGOAA TACAGTAATT (3ACATATATT CTGTTATTTG ATGCTTGAAA 430 
AATCTCCTCC ACTAACCAGT TTCCCTATAG ATTGCCACAA GCACATAATA AGAAACAATA 540 
AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CAGCCTTCAT 600 
AAGGAAGTGA GGTCCAGGAA AATCTAGGAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 
GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 
GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 
CAAGTGTTCA TATGCAAAAA CTTCTTGGAA TTTCTGAGTT CTCTGTGGCA ATATATGACA 840 
TCAGGATATG TCCAGTCTCA CACACCABGA TATGTCCTTT CTAGCCTGTC TATCACATGC 900 
TAGGAGAACT ATTTAGGAAC AGAAAAAAAT GCCTGAAATG ATTTCTCATT TGAACTCATC 960 
CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 
TTCAACCTTC AGGGCAAACC TCOGTGCCTC AGACGTTTAG CCATAGTCTQ AAATTCTCTT 10 B0 
CCATAGATTG GTCCCCTGTA ACCCCGGTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTTC 1140 
CCTCCATTCC CAGGATGAGC TTGTTGCTTC TGTCCTATGA GACATTAQAT TCCTTTTCTT 12 OO 
TGGTACCCGA GTAAATCCAT CCTACTCCAA TAGAGGAAGG TCCATTTTTG TCTTATAGOG 1260 
CTBGATGCAG ACTCAGCTGA GAAGACCATT ATTCATTTTT GGAATTCTXT ATCTCAGATA 1320 
TTTCCTCTTC TTTCTTTTTC TTCTATCTTT GGATTTTTAG TCCATCAAGG CCCCATTAGT 1380 
CTATTCCCCG ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 
ATATATATTG ATACAGGAGA CCTAAGAAGA GCATGTCTXG GGGGTTGAGG AAACAGGCAG 1500 
GTGAGAAATT TCCAGATTGG AAACACAGCT TC5CTTTCTCC CATCCAGCCC CTACTTTCAG 1560 
CCTATGTGTT TCTGGCACCT TGTTGTAGAT AAATCTCCCT TGACTTTGTG ATGTGCTGAG 1620 
AAAACAAACT CACGGCTOGT GTTAAAAAGG GCCCATQACA ATACCAAGTG TTGGGGAGAA 1680 
TGTGGAGAAA TCAGAACTCT ATTCACGGTC GGTTGGAATG CACACTTGTG CAGAATTCTA 1740 
TGGAGAAGAG TCTGGCATTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 
TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATQTAC J.860 
ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 192 0 
ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 19BD 
AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 2040 
TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 
AOAATGAAGT ACAAQATTTC TTTTGGAGGT AGTQAAATTQ TTGTOGAATG AGATCATGAT 2220 
GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 2280 
TATATAAATT ATATGTTAAT AAAAAOOGGG TCCACAAAAC AAACAGCOCC CCACTCTGGT 2340 
TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 2400 
ATTCTTCAGA TTACAAOATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 
AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTT0GO 2520 
CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCQTC TQTTTCAATT 25QO 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 2640 
AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAOAGTT CCACOTGATT 2700 
GGGGAGGCCT CAGAATCACA GTAGGAGGCA AAAGTT AT X C TTACATGGTG GCTGCAAGAG 2760 
AAGATOAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 2820 
ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 2880 
TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2940 
GGGAACACAG CCAAACCATA TCIACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 
AACAGAAAAC CATCT0C3GAT GGTTGTAAQG GGCACAGQAA GTQACTGGTA GGATCACTGC 3060 
CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 3120 
ACTGAAGTAjC ACTTCTTCAC TATCTCTTTO GACTTAGAAT TAGCAOTACA TTCCTTGTTA 3180 
TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 3240 
GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 
CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 3360 
AGAArACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 3420 
CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 3 4 BO 
TTCTTATTTT" TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 
TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCT43AAGT 3600 
TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 
GGATCTCACC CTTCATTCCA CAQACACACA CAOCCTCTCT GCCCACCTCT GCTTCCTCTA 3720 
GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 
AATAATGATA CTGTGTTCTA TATCATG CAT CTCCTGCATT CTGTCTGATT ATAT TTTACT 3840 
TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTO 3900 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 
GAGAGTCITG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATQ TOATCCACAC3 4020 
AGGTCTCCTG TTAGCAXTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACCAGCAAT 4080 
GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 
TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 
CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 4320 
TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC AXGAATTCAC TCAGTGAAGC 4380 
CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 4440 
CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 
CAACACTGCA CAACAAATTA GCAAGGTAGC TAT CAG CATC ATTACGTTGT CCTGTTGCAG 4560 
TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 
AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 4680 
TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 
CACAGAGAAC ACCACAGAAA AAGCTOCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 4800 
CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAQAAA 4860 
AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 
TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 4980 
CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTOCAGCTT TGCTCTTCAC 5040 
AAATGATGAA TAAGAGCAGQ ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA S100 
CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 
GOCTTGCCCA TAAAAGTCAT AATAAATGTT ATT AT TATTA TAAAGTAGCT ATAATTATAC S220 
TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 



865 



WO 03/042661 



PCT/US02/36810 



ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 5340 
CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 34 00 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 
TGAGCXTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 
5 TTAGCCATGG ACCTGGCATA CACTCITCTT ACGTGCAOAG AATGACCATC ATGAGGAAAG 55 BO 

AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 57 DO 
AGAAGTGAAA TATGOTCCTT QCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 
1 ^ AACOCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 5820 
lU AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA QAAGAATTTT CTGOAAGAAG TCAAATATAA 5880 

GGTGGATTTT GAAGGGTATT TGAGGTGAAA XACACCAATT ATCAGGGAAT AACATCAAAG 5940 
GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 6000 
ATTTACATTG ATACAGCAAr TGAATGATCT OCTTTTTTGA TGTTTGAAGG TTGATAGGTC 606O 
AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTaACTGAA TTCAACAAAT CCACTGATGC 6120 
15 ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAGAAAAG ACGTATCAAT TTTTACAGGT 618© 
AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 
AAATGGAAGA AAGCAAGGCA GATGAGOCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 6300 
GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGOAGT QTCCCAGACC CCAACGATAC 6360 
ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 
20 TGAOTftAOAG TAATAAQTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 6480 

CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGQ 6540 
GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 
CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 
GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTGTT CTGCTTCTGC 6720 
Zj CTTTTGCCAA GATGATATTG ACCTTCAGTQ ACCTCTTTCT TGTGCCAGGC CACATTCCCC 6780 

TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 6840 
ATTCAGGGTC TGQATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 
TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGCJAAT ATTTAGATGC 6960 
CATCAAGAAA T T TTAC CAG A CCABTGTG0A ATCTACTGAT TTTGCAAATG CTOCAGAAGA 7020 
30 AAGTOGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 70B0 
TTATAGAAAC ACCTTTOAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATOA 7200 
AGGATTTGTC OAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 
^ _ GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCOCTGCA TGACAATAGC 7320 

35 TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 73 BO 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCtCGTCTDG 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGO TCTCTTCCCC 7S60 
ATCAAQGGCA AAGAGCTTGA CAAAGTCTOO ATTCTACAGA CATCTXTCTT AOCTCCCACC 7620 
40 TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCGCO 7680 
GAAGTAGTGT CTOACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTOCTCTG AATGGGGACC COGTAGTTAA AAAAAAATAA 7BO0 
AAGTAQGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 
4 J TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 
G7IGAGTCTCA AGCAGGGATT TGGGTCSATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CITTTRTTCT TTCJCCTTGCA 8160 
GAAAAAATTA AAAAOCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 822 0 
50 GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAA0A AAACACTAAA 8260 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGXAACA TQATAAGTAA AAAATAAAAA B400 
TTGTK3VTGT CTGTTATTTT GTTGTTTTAC TCTTATAAGT TTATTTAGTT AGGAATACCT 8460 
GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGqOTTAT TTCTTAGAAG AAGAAGGATG 8520 
55 TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCABAAT ACATAGAAAT CTGTACAGAT GA3GAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAaTCGA TGGTCTGCAG AAGGTAAGAA 8760 
CTTGCATCXA CAACTCTTCC TTCPACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 
00 CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACTCACC TTCATATTAC AAATATTTQA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT AT T ATTATAA TACTTTAAGC TTTAGGGTAC 9O0O 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGOTGG TGTGCTGCAC 9060 
CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTGCCGC 9120 
65 CACCGCACAA CAGTCCTCAG AGTGTGATGT TAOCTTCCTG TGTCCAAGTG TTCTCA2TGT 9180 

TCAATTCCCA TCT ATGAT TT AATTCCATCT ATGGCTTAGX TAATGATTAA TTTATTAGAG 9240 
TTACATGGAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 9360 
ATTTAGGAAT TIAGGGAATT ACAAT TTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
70 GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTOTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTOTATTTC ACATTTATTG 9720 
75 CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 

TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACIGA CTGCTGAQAA 9840 
ATT GATGG AA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT XACACTTACC 9900 
TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAAJZCA TGGGAATGGT 996D 
GAATATCITC AATGGGGATG CAGACCTCTC AflGCATOACC TGGAGCCACG GTCTCTCAGT 10020 
oO ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGlGG AAGCTGCAGC 10080 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTT9 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA. TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAOAAAA TGTTCAOCTA 10260 
□AGGIGTTCT GGTAAACXGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
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CTATCTCATC TTGATGATGA TAGTCATCAT CAAQAATTTA ATOATTAAAA TAGCATGCCT 103 BO 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCOTGTCTTC TAAATTTGAA 10S0O 
ATATAATTCT GTTTCTGACC TOTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 10 560 
ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10 620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC ATCATCAATA AAATAATGAC ATAAAATCAT 10680 
TTTTQCTTTA CCTGTTTTCT CXCTOGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10 BOO 
GCCTGGAGTA GAAGGCATA? CCEAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
CTATTTTATG CTAAAQGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
GQGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 
TGGATGCQAa GAAAGAACAG TGTGGTTACC ATATATAAAT TAQGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCT'G ACTGGATGGA TGGAAQAATA TCTTGTXTCT 11160 
GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGG2A CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 112 BO 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATOGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT C T AGAAT TGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
CAT TTTTAT T TTCCAQAAAG TAGCTTTTCT TAGGGTTCCA TATTTAC T CC CATAGATTCT 11520 
TCCC 



A76 Protein sequence: 

□ens namei SCCA2 or 2b, 

tin i gene number: Ms. 227 948 

Probefiet Accession #: AB035089 

Protein Accession #s BAB21S25 

Signal sequence: none found 

Transmembrane domains: none found. 
Serine Proteinase Inhibitor domain t 

Cellular Localization: secreted 



serine {or cysteine) proteinase inhibitor 



13-390 



1 11 21 31 41 51 

I I I 1 I I 

MNSLSEANTK FMFDLFQQFR X6KBWNIFYS PISITSALGM VLLGAKENTA QQISKVLHFD 60 

QVTENTTEJKA ATYHVDRSGN VHHQFQKLTiT EFNKSTDAYB JUKXANKIjFGE KTYQFIiQExIj 120 

DAIKKFYQTS VESTDFANAE EEflRKKlNSW VESQTttEiClK HLFPUGTIGN DTTLVLVKAI 1B0 

YPKGQWBNKP KKHSTKEEKP WPNKHTYKSV QMMRQYNSFN FAIO»EDVQAK VliEIPYKOKD 240 

LSMIVL1.PMB XDGLOjKXiEEK I/TAHKLMEWT 9LQKMRETCV DLHLPRFKME ESYDLKDTIiR 300 

TMGMVNIFKG DADLSGMTMS HGLSVSKVLH KAPVEVTEEG VEAAAATAW WBLS8PSTN 360 
EEFCCNHPF1. FFIRQNXTNS 1I*FW3RFS3F 

A77 DMA SEQOBHCB 

Gene name: hypothetical protein FU13459 

Unigene number: none found 

Probeset Accession #s XM_047266 
Nucleic Acid Accession #; XK_047266 

Coding sequence: 485-1471 {underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I i 

CTGACCTCAA GTGATCCACC CACCTCTGCC TCCCAAAGTG CTGGGATTAC AGGTGTGAGC 60 

CACCATGCCA GGCCCTCTTA ACGTCTTCAA GTCXGTTTTC TCATCTGCAA AACAGAGGTA 120 

ATAAGATCAG TATCTTCTTA ATQGAAGCAC CTGGACTACA TTTTTTTCAT TCATTGTTAT 180 

CATAAATGAG GACTAACCTG TCTCCOSTTC GGAGTTTTGA ACCTAGACCT CATGTCTTCA 240 

TGACGTCATC ACTGQCCCAG GCCCAGCK3T GtfGCClACAC CAGCCCCAGC TGACGCATCT 300 

TCTTTTTCTG CCTGTAQAGA TGGTTACAAT GCCTGGCGTG ATGCATTCTG GCCTTCGCAG 360 

ATCCTGGCGG GGCTGTGCCA AGGCTGTGGC CTCCCTGCCC CTGAATACCG AGCCGGTGCT 420 

GTCAAGGTGG GCAGCAAAGT CTTCCTGACA CCACCGGAGA CCCTGCCCCC AGGGATCTCT 480 

TCACATGTGG ATTGACATCT TTCCTCAAGA TGTGCCTGCT CCACCCCCAG TTGACATCAA 540 

GCCTCGGCAG CCAATCAGCT ATOAQCTCAQ AGTTGTCATC TGGAACACGQ AGGATGTGGT 600 

TCTOGATQAC GAGAATCCAC 1CACOGGAGA GATGT0GAGT GACATCTATG TGAAGAGCTG 660 

GGTGAAGGGG TTGGAGCATG ACAAQCAGGA GACAQACOTT CACTTCAACT CCCTGACTGG 72 O 

GGAGGGGAAC TTCAA1TGGC GCTTTGTGTT CCGCTTTGAC TACCTGCCCA CGGAGCGGGA 780 

GGTGAGCGTC TGGCGCAGQT CTGGACCCTT TGCCCTOGAG GAGGCQGAGT TCCGGCAGCC 840 

TGCAGTOCTQ OTCCTGCAGG TCTGGGACTA TGAC0GCATC TCTGCCAATG ACTTCCTTGG 900 

ATCCCTGGAG TTGCAGCTAC CAGACATGGT GCGTGGQGCC CGG3GCCCCG AGCTCTGCTC 960 

TGTGCAGCTG GCCCGCAATG GGGCCGGGCC GAGGTGCAAT CTGTTTCGCT GCCGCCGCCT 1020 

GAGGGGCTGG TGGOGGGTAG TGAADCTGAA GGAGGCAGAG GACGTGGAGC GGGAGGOGCA 10B0 

GGAGGCTCAG OCTGGCAAGA AGAAGCGAAA GCAGAGGAGG AGGAAGGGCC GGCCAGAAGA 1140 

CCTGGAGTTC ACAGACATGG GTGGCAAT0T GTACATCCTC ACGGGCAAGG TGGAGGCAGA 1200 

GTTTGAGCTG CTGACTGTGG AGGAGGCCGA GAAACGGCCA GTGGGGAAGG GGCQGAAGCA 1260 

GCCAGAGCCT CTGGAGAAAC CCAGCCGCCC CAAAACTTCC TTCAACIGGT TTGTGAACCC 1320 

GCTGAAGAOC TTT G TCTTCT TCATCTGGCG CCGGTACTGG CGCACCCTGG TGCTGCTGCT 1380 

AOTGGTGCTG CXCACOGTCT TOCTCCTCCT GGTCTTCTAC AOCATCCCTG GCCAGATCAG 1440 

CCAGGTCATC TTCCGTCCCC TCCACAAGIG ACTCTCGCTG ACCTTGGACA CTCACCCAGG 1500 

GTGCCAACCC TTCAATGCCT QCTCCTGQAA GTCTTXCTTA CCCATGTGAG CTACCCCSiGA 15 60 
GTCTAGTGCT TCCTCTGAAT AAACCTATCA CAGCCACTG 



A7B Protein sequence; 

Gene name i hypothetical protein FLJ13459 

867 



WO 03/042661 



PCT/US02/36810 



Uhigene nuniber : none found 

Probeaet Accession #i 3CM_047266 

Protein Accession #: XP_047Z6G 

Signal sequence: none found 

Transmembrane domains: 291-313 

C2 domain; 27-86 

Cellular Localization: plasma membrane / ER 



1 11 21 31 41 SI 

I 1 I I I I 

MWIDIFPQDV PAPPPVDIKP RQPISYELRV VIWHTSDWL DDENPLTGEM SSDIYVKSWV 60 
KGLEHDKQBT DVHFNSLTGE GNFrWRFVFR FDYLPTEREV SVWRRSGFFA LEEABFRQPA 120 
VLVLQVWUYD RISANDFLQS LKLQLPDMVR OARGPELCSV QlxARNGAGPR CNLFRCRKLR 1B0 
GHWPWKLKE AEDVEREAQE AQAGKKXRXQ RRRKGRPKDL EFTOMGGNVY IliTQKVEAHF 240 
ELliTVEEAEK RPVGKGRKQP EPLBKPSRPK T3FNWFVKPL KTFVFFIWRR YWRTLVLLLL 300 
VLLTVFLLLrV FYT I PGQ I SQ VIFRPLHK 



A79 UNA SEQUENCE 

Gene name: Homo sapiens mRNA; cDNA DKPZp434K0322 (from clone DKFZp434K0322) 

Gnigene number: Hs. 161031 

Probeset Accession #: AL137708 
Nucleic Acid Accession #: AL137708 

Coding sequence c 1315-1791 (underlined sequences correspond to start and stop codons} 



1 11 21 31 41 51 

It I I I ! 

GGCATTGATG CTGTGTGCGC GTGCGTGTOT GTOTGTGTGT GTGTGTGTGT GTGTGTGTGT 60 

GTGTGTGTGT OTCTGGftGTC ATGGCAGGGT CCCTTTCTGT CTGTCTCCTT GCTCTGCCCC 120 

AGACTGGGGG GCTGCAGAGG TGAGGGTATC TGGCCTCAAC AGCTGCTTAT TCCOQATGGG 180 

ATGGCCTGGG CTGGGCCCCT GAGGCCAGGC TGACTTGGAC ATGGCAAGAG GGGTCCCAGG 24 O 

CTCTTGTGGG CAAAGCAGGG GAGGCGCCAA TGTGGAGGAA GAGAOTCTCC TGGCTGGCTG 300 

CTGCTCTCTG GAGCGGGTGG AGTCAGGGAA GAGCTGAGCT GGGGAGTCAC CCTGGGCCTG 360 

GGGTCACCGT AGGCCCCATG TAGCACCCTG GTTCCCCTGC CTGTAGGTGA CAGGAGCCAG 420 

CCCAGCCAGG TGTGCTCCCT CCCCAGGCCC TAGGCAGGCG GGTACAGGGG CCAGCAGCTG 480 

CGCCCGCCCC ACCTTCCTTC CCACCCACAT GCCGAAGGGT GGCCftGGCAG GCAGGTGGAC S40 

GAGTGCAGGC AGCGGCTGAG TCAGTGTGTG TGGAATGTTC TGGCCGCTCC CAGCTQCACC 600 

CTGCCCCTAC CTGCCACCAC CTCACCTTCA TCCTCAflGCG CTGCGGCCCT GAGCCOCTGC 660 

CAGGAATGCA CCTTTAGCCC AGGCCTGCTC AGTGAGCTCC GCCGACAGCC AGCCCTGCTC 720 

CTCCCGCCAT GACCCTGCAG ACCCCTCTGG GCTTCCAAGT TCCTGGGGQC TGCAGTGAAC 780 

ATGCTCCACC TGCATGGCTG GCAAACCATG GTGGGGCCCA GCTGTGGTGC GTGCTGGGGT 840 

AGAGGCAAGG AAGTGATGGG ACCGCAGAGA TGAGACCCCC AGGOATOAGA TGGGACCCCC 900 

AGGCAGGGCC CAGGGTCCAG GGOCCAGGAG AGAGAAGCAG GGAGGGAGAG AGCTTCCTGG 960 

TGGAGGACGC ATCCTACAGT GGGGGCAAGG GTGCTCTGAG GTCCGGTGAA GGCAGGGACT 1020 

AGGCTGCCCA GGCCCTGCGT GCTTGGCTGG GGCTGGGGGC TGCTGGGAGG TGGCTGGGAG 1080 

GCTGGGCCTG GGCAGCTAAG CTOGAGCTTT GGCCAGGGTC GAGAGCCTCC CTCCCTTCAG 1140 

CTTCCTGCTG CACAGAACCC TOGCCCCTGG CCACCCOGTG CTGCCTCCTT GCCCTGGEBG 3.200 

ACCCAGCAC? GGCTGCTGCT AGTCAGATGG' GGTAGCGGGC AGGGGCCGGA GGGGCCACCG 1260 

TCCCAGCTGA CCCAGCCTCC TGGGCCGCTT CTTCCAAACC AGCAGGGTAG AAAGATGGGG 1320 

CACCCAGCAG TCTCTCCCAG TGCCCCGGCC CCAGCTGGCA CCACAGCTAT ACCTGGGCTT 1380 

ATTCCAGACC TTGTCGCOGG GACCCCCTGT GAGTTGTGGG ATTCCCAAGA GGGGTGTGGQ 1440 

GATAACCCAG GCAAGTGGGG GCTGCAGCTG TCCACAGATG CACTCAGCGT GGCCTCTACC 1500 

CCAGGGCCCC GCTGGGCTCT CATTGCCGGC GCCCTTGCCG CGGGCGTCCT CCTCGTCTCC 1560 

TGCCTCCTCT GTGCTGCCTG CTGCTGCTGC CQCCGCCACA GGAAGAAGCC CAGGGACAAG 1620 

GAGTCCGTGG GTCTGGGCAG TGCCGGCGGC ACCACCACCA CCCACCTGGT GAGGAGCGGC 1680 

TCCTTGCTCA CTCAGTCCAG AGAGGGCTTG AAATCCAGGC TCCAGAGCCC AGGGCAGCGA 1740 

GGGGAGTTCA GCCCCAGGGA TGGTTTAACC CCCAGAGAGG CAGGGCGT TG AG GACCTTCC 1600 

TGGGAGGGAA AGTGGGTGAA CAGAGGTGAG AAGGAGGCCA TGCAACAGGG GCTGCCCCAT I860 

GGGCCCGAGG GAGCCAGAGC GGGTTCTTGA GGAAGGCAGG GGGTACCCCA GATGCCACGT 192.0 

TTTGGGTGGG TTTGGOGGGT CTCACAQAQC GAAGCCGACQ ATTTGTT3CCT GTTGGGTGGC 1980 

CTGGCCTGGA GGCGGGGGGT CTTGACCCAT GTCATGGAAG GGCTGCCCGG GAGCCCAGGG 2040 

CTCTGATGAG GCATGATGTC AGCACCACCT GCCCCTXOTC CCARCTCACT CCAGGTGCAA 2100 

CCTGATGTGG ATGGCCTGGA GTCCAGCCCG GGGGATGCTC AGCAATGGGG GTGGCTGCAG 2160 

CTCTCCCTGG AGTTCGACTT TGGAAGCCAB GAGGTGAAGG GCCCCGCTGC GCAGGACCAG 2220 

OGGTTCTQCG AGTTTCCGGA AAGGGTGACG GGGGAAGGGC AGACCCCATG CCCTGGGTGG 22 BO 

TGGGGAGCTG ACAGGGCAGG GGCCCTTGGC TGAGCCCACC CCGCTGGCTC CCAGATCAGG 2340 

QTGQGCCTGA GGCAGGCAGC OGACCTGAGG GCTGGGGGGA CCGTGGACCC CTATGCCCGG 2400 

GTCAGCGTCT CCACCCAGGC CGGACACAGA CATGAGACAA AAGTGGACCG AGGCACGCTC 2460 

TGCCCCGTGT TTGACGAGAC CTGCTGCTTC CACGTGAGTC AGGGATGGTC GGCTGC3GTGQ 2520 

GCCTGGACGG CTGGATGGGC CTGGGCTGGG TGGGCCTGGG CAGCTGGGTG GGCCTGGGCA 2S80 

GCTGGGTGGG CCTGAGCTAG GGCAGCAGGG CCTGGCTCAC GCCGCTGOCT CAGATCCGGC 2640 

AGGCGGAGCT GCCAGGGGGC ACCCTGCAGG TGCAGCTTTT CAACTTCAAG CGGTTCTCGG 2700 

GGCATGAGCC CCTGGGTGAG CTCCGTCTGC CACTGGGCAC CGTGGATCTG CAGCATGTTC 2760 

TGGAGCACTG GTACCTGCTG GGCCCGCCGG CTGCCACTCA GGTGAGGTGC TGGTCACCAG 2820 

GCCACRQCCC AAGGCAGAGG TGGCAGGGAC CCTGCCCTAT GGGCCATCGG AAAGACAGGC 2 8 BO 

CTGATGGGCA GCATTTTCQQ GGGTCTGAGC CCCAACTCGG CCAGAATCAC CCTCCCGGGC 2940 

TGAAGCCCCT CTTGCTGCCC ACAGCOCGAG CAGGTCGGGG AGCTGTGCTT CTCTCTCCGG 3000 

TACGTGCCCA GCTCAGGOCG GCTGACCGTG GTGGTGCTGG AGGCTCGAGG CCTGCGTCCA 3060 

GGACTTGGAG AGCCGTACGT GAAGGTCCAG CTCATGCTGA ACCAGAGGAA GTGGAAGAAQ 3120 

AGAAAGACAG CCACCAAAAA GGGCACGGCG GCCGCCTACT TCAATGAOGC CTTCACCTTC 31B0 

CTGGTGCCCT TCAGCCAGGT CCAGAATGTQ QACCTGGTGC TGGCTGTCTG GGACCGCAGC 3240 

! GAACTOAGCC CGTAGGCAAG GTGCACCTGG GTGCCCGGGC CTCGGGGCAG 3 300 
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CCCCTGCAGC ACTGGQCAGA CATGCTGGCC CACGCCCGGC GGCCCATTGC CCAGCGGCAC 3360 

CCCCTGOGGC CAGCCAGGGA GGTGGACCGC ATGCTGGCCC TGCAGCCCCG CCTTCGCCTG 3420 

CGCCTGCCCT TGCCCCACTC CTGAATGCAC CACATGCCTC TGTCTCCCCG CTGAGCCCAG 3480 
GCACTTGCCC AGGCCGCCCT GCAGGACCAC TGCAATAAAC GCCTTCTCCT GCC 



asp Protein sequence: 
Gene name i 
Unigene number: 
Probe set Accession #i 
Protein Accession 
Signal sequence i 
Transmembrane domains; 
Cellular Localization: 



Homo sapiens 
Hs. 161031 
AL137708 
CAB7088S 
none found 
69-85 
plasma 



tbRNA; CDHA DKFZp434K0322 {from clone DKFZp434K0322j 



41 
I 



51 
I 



1 11 21 31 

III] 

MGHPPVSPSA PAPAGTTAXE GlilPDLVAGT PCSLKDSQEG CGDNEAKHGL QLSTOAUSIA 
STPGPRWALI AGALAAGVLI, VSCLLCAACC CCRRHRKKPR DKESVGLGSA RGTTTTHLVR 
SQSLIiTQSRE GIJC3RLQSFG QHGEFSPHDG L.TPTEAGR 



60 
120 



A91 UNA SEQUENCE 



Unigene number: 
Probe set Accession # : 
Nucleic Acid Accession #: 

Coding sequence i 



Cadherin 3, P-cadaerin (placental) 

HS.2B77 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



1 11 21 31 41 SI 

I I I 1 I I 

GCGGAACACC GGCCOGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCftTGGGGC 60 

TOCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 

CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCIGAAGT GACCTTGGAG GCGGGAGGCG 160 

CGGAGCAGGA GCCCGGCCAQ GCGCTGGGGA AAGTAT TCAT GOGCTGOCCT GGG CAAGAGC 240 

CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 

AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 

GAAGACACAA GAGAGAXTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 

CCTTCCCOCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAABG 540 

AOACAGGCTG GTT0TTGTTG AATAAGCCAC TGQACCQOGA GGAGATTGCC AAGTATGAGC 600 

TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 

TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGAiCACC TTCCGASGGA 720 

GTGTCTTAGA GGGAGTCCTA CCAQGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAOG 780 

ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 

AGGACCCACA OGACCICATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 

CCAGTGGCCT GGACCGGGAA AAAGTC0CTG AGTACACACT GACCATCCAG GCCACAGACA 960 

TGGATGGGGA CGGCTCCACC ACCAOGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 

ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACQAGQCCCA TGTGCCTGAG AATGCAGTGG 1080 

GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 

GTGCCACCTA CCTTATCATG GGCGGTGACG AOQGOaACCA TTTTACCATC ACCACCCACC 1200 

CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 

AGCACACCCT GTACGTTOAA BTGACCAACG AGGOCCCTTT TGTGCTGAAG CTCCCAACCT 1320 

CCACAGCCAC CATAGTGGTC GACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 

CCICCAAAGT CGTIGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 

CAGGGTGGCT AGCCATGGAC CCM3ACAGTG GGCAQGTCAC AGCTGTGGGC ACCCTCGACC 1560 

GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGOTCTTG GCCATGGACA 1620 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAQA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGOCCAGC 1800 

TCACAGATGA CTCAOACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 

TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1320 

ACCATGGCAA CAAAGAGCAG CTGACGGXGA TCAGGGCCAC TGTGTGOGAC TGCGATGGCC 1980 

ATOTCQAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGG CTG 2040 

TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAQCGGAAGA 2100 

TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTQA CAACGTCTTC TACTATGGCG 2160 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGQTCTGG 2220 

AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 

TGTACCGXCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 

AGGCGGCTAA CACAGACCCC ACAGCCOCGC CCTACGACAC CCTCTTQGTG TTCGACTATG 2400 

AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 

ACCAAGATTA CGATTATCTG AAOGAGTGGG GCAGCCGCTT CAAGAAGCTO GCAGACATGT 2 520 

ACGGTGGCGG GGAGGACGAC TAGGOGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT QAGGACTTCG GAGCTTGTCA 2640 

GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGT TAG AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCXGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

TGCTCAACCC TGTGTCCTOG GCCTGGGCCT GCTGTOACTG ACCTACAGTG GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 

TCCTGCATTT CTGGTTTCCA GACCCCAATG OCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGCCTAGGTT GC3X3CTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TAT AT GT ACT AGAACTTTTT TATTAAAOAA A 



869 



WO 03/042661 



PCT/US02/36810 



A62 Protein sequence; 
G€?JQje name: 
Unigene number: 
Probeset Accession #: 
Protein Accession 4f ; 
Signal sequence : 
Transmembrane domain: 

Cellular localization: 



Cadnerin 3, P-cadher-in (placental) 

He. 2877 

X63629 

CAA45177 

1-24 

655-675 (second underlined sequence) 
plasma membrane 



10 



15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 



1 
I 

MGLPRGPKAS 
QEPALFSTDN 
KG PF PQRLKQ 
YELFGHAVSE 
DEDDAI1CT5M 



AWRATYLIHG 
PTSTATIWH 
DPAGtfLAMDP 
VHDHGPVPBP 
TWLSLKKFL 
GAVLALlLFLL 
GLBARPEVVIi 



11 
I 

LLLLQVCWLQ 
&DFTVRNGET 
liK&ElKDimTK 
NGASVEDPMH 
GWAYSIHSQ 
TAVAWSIUJ 
GDDGDHFTlT 
VEDVNEAPTTF 
DSGQVTAVGT 
RQITICSQSP 
KQDTYDVHLS 
LVIiLLLVRKK 
RNDVAPTIIP 



21 
I 

CAASEPCRAV 



IFYSITGPGA 
ISIXVTTJQJ3D 
EPKDPHDLMF 
ANDNAPMFDP 
THPESNQGII. 
VPP9KWEVQ 
LDRBDEQFVR 
VRHVIjMITDK 



RKI KE PLLIiP 
TPMYRPRPAN 
DQDQDYBYLM 



31 
1 

FHEAEVTLEA 
HPUCIFP6KR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTQTIS 
QKYEAHVPEN 
TTRKGTjDFF-A 
EGIBTGEPVC 
NNIYEVMVLA. 
DL6PETSPFQ 
TVIPATVCDC 
EDDTHDNVFY 
PDEIGMFIIE 
EVK3SRFKKLA 



41 


51 




1 

GQAEQEPGQA 


1 

LGKVFMGCPG 


60 


ILRRHKRDWV 


VAPISVPENQ 


120 


EKBTOWIfLLN 


XPLDREEIAK 


180 


RGSVLEGVLP 


GTSVMQVTAT 


240 


VISSGLDREK 


VPEYTliTIQA 


300 


AVGHEVQRI*T 


VTECLDAPN9P 


360 


KNQHTLYVEV 


TMEAPFVLKL 


420 


VYTAEDPDKE 


UQKI3YRIL-R 


480 


HDNGSPPTTG 


TGTLLLTLID 


540 


AQLTDDSDIY 


WTABVHESGD 


600 


HGHVETCPGP 


WKGGFILPVL 


S60 




QDYDITQLHR 


720 


NLKAANTDPT 


APPVUTIiDVF 


780 



A83 DMA SEQUENCE 

Gene name: putative G-protein. coupled receptor 
Unlgene number: Ha. 160 B 5 

Probeset Accession ft; F07953 
Nucleic Acid Accession #: NM_016334 

Coding sequence i 104-1471 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

111 111 

AGCACCTGGG AAAAGGCAGA OCGTGTGAGG GGGCCTOTOG OCCCAGCGIG CTGTGGCCTC 60 

GGGGAGTGOG AAGTOGABGC AGGAGCCTTC CTTACACTTC GOCATGR0TT TCCTGAxCGA 120 

CTCCAGCATC ATGATTACCT CCCAAATACT ATTTTTTGGA TTTGGGTGGC TTTTCTTCAT 180 

GCGCCAATra TTTAAAGACT ATGAGATACG TCAGTATGTT GTACAGGTGA TCTTCTCOGT 240 

GACGTTTGCA TTTTCTTGCA OCATGTTTGA GCTCATCATC TTTGAAATCT TAGGAGTATT 300 

GAftTAGCAGC TCCOGTTATT TTCACTGGAA AATGAACCTG TGCGTAATTC TGCTQATCCT 360 

GGTTTTCATG GTGCCTTTTT ACATTGGCTA TTTTATTGTO AGCAATATCC GACTACTGCA 420 

TAAACAACGA CTGCTTTTTT CCTGTCTCTT ATGGCTGACC TTTRTGTATT TCTTCTGGAA 480 

ACTAGGAGAT CCCTTTCCCA TTCTCAGCCC AAAACATOGG ATCTTATGCA TAGAACAGCT 540 

CATCAGCCGG (JlTGGTGTaA TTGGAGTGAC TCTCATGGCT CTTCTTTCTG GATTTGGTGC 600 

TGTCAACTGC CCATACACTT ACATGTCTTA CTTCCTCftGG AATGTGACTG ACACAGATAT 660 

TCTAGCCCTG GAACGGOGAC TGCTGCAAAC CATGGAT AT G ATCATAAGCA AAAAGAAAAG 720 

GATGGCAATG GCACGGAGAA CAATGTTCCA GAAGGGGGAA GTGCATAACA AACCATCAGG 780 

TTTCTGGGGA ATGATAAAAA GTGTTACCAC TTCAGCATCA GGAAGTGAAA ATCTTACTCT B40 

TATTCAACAG QAAGTGGATG CTTTGGAAGA ATTAAGCAGG CAGCTXTTTC TGGAAACAGC 900 

TGATCTATAT GCTACCAAGG AGAGAATAGA ATACTCCAAA ACCTTCAAGG GGAAATATTT 960 

TAATTTTCTT GGTTACTTTT TCTCTATTTA CTGTGTTTGG AAAATTTTCA TGGCTACCAT 1020 

CAATATTGTT TTTGATCGAG TTGGGAAAAC GGATCCTOTC ACAAGAGGCA TTGAGATCAC 1080 

TGTGAATTAT CTGGQAATCC AATTTGAK3T GAAGTTTTGG TCCCAACACA TTTCCTTCAT 1140 

TCTTGTTGGA ATAATCATCG TCACATCCAT CAOAGGATTG CTGATCACTC TTACCAAGTT 1200 

CTTTTATGCC ATCTCTAGCA GTAAGTCCTC CAATGTCATT GTCCTGCTAT TAGCACAGAT 1260 

AATGGGCATG TACTTTGTCT CCTCTGTGCT GCTGATCCGA ATBAGTATGC CTTTAGAATA 1320 

CCGCACCATA ATCACTGAAG TCCTTGGAGA ACTGCAGTXC AACTTCIATC ACCGTTGQTT 1380 

TGATGTQATC TTCCTQGTCA GCGCTCTCTC TAGCATACTC TTCCTCTATT TGGCTCACAA 1440 

ACAGGCACCA GAGAAGCAAA TGOCACCTTQ AAC TTAAG CC TACTACAGAC TGTTAGAGGC 1500 

CAGTGGTTTC AAAATTTAGA TATAAGAGGG GGGAAAAATQ GAACCAGGGC CTGACATTTT 1560 

ATAAACAAAC AAAATGCTAT GGTAGCATTT TTCAOCTTCA TAGCATACTC CTTCCCOCTC 1620 

AGGTOATACT ATGACCATGA GTAGCATCAG CCAGAACATG AGAGGOAGAA CTAACTCAAG 1680 

ACAATACTCA GCAGAGAGCA TCCCGTGTGQ ATATGAGGCT GGTGTAGAGG COGAGAGGAG 1740 

CCAAGAAACT AAAGGTGAAA AATACACTGG AACTCTGGGG CAAOACATGT CTATGGTAGC 1800 

TGAGCCAAAC ACGTAOGATT TCCGTTTTAA GGTTCACATG GAAAAGGTTA TAGCTTTGCC 1860 
ITGAGATTGA CTCATTAAAA TCAGAGACTG T 



A84 Protein sequence 

Gene name: putative O protein coupled receptor 

75 TJnigene number: Ha. 160 a 5 

Protein Accession #s »p_0574ia,l 

Signal sequence: none found 

Transmembrane domains: S-27, 42-64, 76-98. 110-132, 144-166, 289-311, 342-364, 380-402, 
424-446 

oU Cellular Localizations plasma membrane 

1 11 21 31 41 51 



870 



WO 03/042661 



PCT/US02/36810 



I I I I I I 

MSFIilDSSIM ITSQIL.FFGP GWL FFMHQLF KDYEIRQYW QVIFSVTPAF SCTHFELIXF GO 

ElLOVUffSSS HYFHWKMNIiC VIM-ILVFMV PFYIGYFIVS NIFXLHKQRI* LFSCU^WLTF 120 

MYTFWKLGDP FPILSPKHGT L5IEQLISRV GVIGVTLMAL LSGFGAVNCP YTYMSYFLRN ISO 

VTDTDIIALE RRLLQTMDMI ISKKKRMAMA RRTMFQKGEV HNKPSGFWGM IXSVTTSA5G 240 

3EHLTLICQE VPALEELSRO LFLBTADLYA TKERIEY9KT FKGXYFNFLG rFFSIYCVWK 3 DO 

IFMATINTVP DEVGKTDPVT RGIEITUNYL GIOPDVKPWS QHISFILVGI IIVTSIRGKE, 360 

ITLTKFFYA1 SSSKSSNVIV LLLAQIMGMY FVSSViLIRM 8MPLEYRTH TEVLQELQFM 420 
FYHRWFDVIF LVSAI*SSII.F LYIiAHKQAPE KQMAP 

A85 DMA, SEQUENCE* 

Gene name; TTK protein kinase 

Unigene number: Hs- 169B40 

Probeset Accession #: M06699 

Nucleic Acid Accession #: WM_003318 

Coding sequence; 1026-3SS1 (underlined sequences correspond to start and stop codons) 



1 IX 21 31 41 51 

I I * I I I I 

GGAATTCCTT TTTTTTTTTT TTTGAGATGG AGTTTCACTC T TB TTO GCC& GGCTGOAGTG 60 

CAATGGCACA ATCTCAGCTT ACTGCAACCT CCGCCTCCCG GGTTCAAGCG ATTCTCCTGC 120 

CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGTGCCAC CACCCCTGGC TAACTAATTT 180 

CTTTTCTATT TAGTAGAGAT GGGGTTTCAC CATGTTGGTC AGGCTGCJTCT TGAACTCCTa 240 

ACCTCAGGTG ATCCACTTGC CTTGGOCTOC CAAAGTGCPA GGATTACAGC OGTGAAACTG 300 

TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGGTCTCCC TTG^TDQCGC 360 

AGGCTGGAGT GCAGTGGTGC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 

GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTGATTACAG GCGTGCACCA CCACACCCGG 460 

CTAATTTTTG TATTTTTATT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGTTCTCAAA 540 

CTCCTGGACT CAAGGGATCC CCCTGCCTCC ACTTCCCAAA GTCCCGAGAT TACAGGTGTG €00 

AGTCACCATG CCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 

GOTCCTCACA ACAAATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT GAITCACAAA 720 

AGGAAGATGA AGTGATTCAT G ATT TAG AAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 7B0 

GGATOATGAT CCTAAATCCA GATACAGTAA AAATGGGGTA TGGGAAGGTA GAATACAAAA B40 

TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGGATACATT 0TTGATQTGA 900 

ATGTAAGACT GTACAGAjCTT CCTAGAAAAC AGTTTGGGTT CCATCTTTTC ATTTCCCCAG 960 

TGCAGTTTTC TGTAGAAATG GAATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

eCAT AATGA A CAAAGTGaGA GACATTAAAA ATAAGXTTAA AAATGAAGAC CTTACTGATG 1080 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTCGGQAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTTGAGTIT GTTGCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 1260 

TTGAAGCGCT TCCCCCAGAT AAATATGGCC AAAATGAGAG TTTTGCTAGA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAGC CAGATGATGC ACGTGACTAC TTTCAAATGG 1380 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTG TTCATATATC TTTTGCACAA TTTGAACTGT 14.40 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA CGTGGAGCAG 15O0 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 

TTTCAGAGGA GGAAAAGAAG AATTTATCAG CATCTACGGT ATTAACTGOC CAAGAATCAT 1620 

TTTCCGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTGTGATTOC AGAGGACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAGATGCA GAAATAGGTT 1740 

AGCGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAGTC ATGCCCATTT GGAAGAGTCC 1800 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT I860 

GTTTTATGAA AAGACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 1920 

AACCAAGTQQ AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAGA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAACGGAA TCAA0TCTTC TAGCTAAATT AGAAGAAACT AAAGAGTATC 2100 

AAGAACCAGA GGTTCCAGAG AGTAACCAGA AACAGTGGCA AGCTAAGAGA AAGTCAGAGT 2160 

GTAIIAACCA GAATCCTGCT GCATCTTGAA ATCACTGGCA GATTCCGGAG TTAGCCCGAA 2220 

AAGTTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 22 BO 

AGTCACCACC AATATCAACA TCTAAATGGT TTGACCCAAA ATCTATTTGT AAGACACCAA 2340 

GCAGCAATAC CTTGGATGAT TACATQAGCT GTTTTAGAAC TQCAGTTGTA AAGAATGACT 2400 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGCCTGTTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAGCATCT TCTTCAGCAA 2520 

ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTCCATATT AAAGCAGATA GGAAGTGGAG 2580 

GTTCAAGCAA GGTATTTCAG GTGTTAAATG AAAAGAAACA GATATATGCT ATAAAATATG 2640 

TGAACTTAGA AGAAGCAGAT AACCAAACTC TTGATAGTTA CCGGAACGAA ATAGCTTATT 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGG 2760 

ACCAQTACAT CTACATGGTA ATGGAGTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2020 

AGAAAAAATG CATTGATCCA TGGGAACGCA AGAGTTACTG GAAAAATATG TTAGAGGCAG 2B80 

TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2940 

TAGTTGATGG AATGCTAAAG CTAATTGATT TTGGGATTGC AAACCAAATG CAACCAGATA 3000 

CAACAAGTGT TGTTAAAGAT TCTCAGGTTG GCACAGTTAA TTATATGCCA CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAGAATG GGAAATCTAA GTCAAAGATA AGCGCCAAAA 3120 

GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 

AATTTCCOGA TATTCCAGAG AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAAGGG 3300 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CTCATCCAGT TAACCAAATG GCCAAGGGAA CCACTGAAGA AATGAAATAT GTTCTGGGCC 3420 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CAXAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGG 3540 

GAAAAAA ATG AT TTGCAGTT ATTCDTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGRATCCCTG TGGAAATCTA CATTTGAAGA CAACATCACT CTQAAGTGTT 3660 

ATCAGCAAAA AAAATTCAGT GAGATEATCr TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 

TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 3760 

TTGACATCAT TTTACTCTTG GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 



871 



WO 03/042661 



PCT/US02/36810 



TAAATAAAGT TTTGTGGCTA AAATGA 



A86 Protein sequence: 
Gene name: 
Unlgene number i 
Probeset Accession #: 
Protein Accession #: 
Signal sequence: 
Transmembrane domains s 
Protein Kinase Domain: 
Cellular Localization 



TTK protein kinase 

Hs. 16 9840 

M86699 

KP_003309 

none found 

none found 

S10-775 

cytoplasmic and nuclear 



1 
I 

MNKVHDIKNK 
VPLSDALLNK 
AHCKKFAFVH 



NSLEjQTNKTK 
SGNDSCELRN 



PPISTSKWFD 
QII^ATPUQNL 
LEEADHQTLD 
KSIDPWBRXfi 
SWKDSQVGT 
IINQISKLHA 
PVNQHAKGTT 



II 
I 

FKNEDLTDELi 
LIGRYSQAIE 
ISFAQFELSQ 
TVLTAQESFS 
QSCPFGRVPV 
LKSVQMSHFX 
WQAKRXSECI 
PKSICKTPSS 
QVIiASSSANE 
SYRNEXAYU4 
YWKHMtifiAVH 
VNYMPPEAIK 
IIDPNHEIEF 
EEMKYVLGQb 



21 
I 

SLKKJSADTT 
ALPPDKYGQN 
OHVKKSKQLIi 



NLLNSPDCDV 
EPLVSDEKSS 



NTLDDYMSCF 
CISVKGRIYS 
KLQQHSDKIJ 
TIHQHGIVHS 



PDIPEKDLQD 
VGmSFNSlIi 



31 
I 

DNSGTVNQIM 
E8PARIQVRF 
QKAVERGAVP 
NSCDSRGQTT 
KTDD9WFCF 
ETjIITDSITTj 
WQIPELAEKV 
RTPWJMDFP 
ILKQIGSGGS 
RLYDYEITDQ 
DLKPANFLIV 
SKSXXSPKSD 
VkKCCLKRDP 
KAAKTLYEHY 



41 
1 

t-MANNPEDWL 
ABLKAIQEPD 
LEMLEIALRN 



KNKTESSLIA 
NTSOKHT IFB 
PACQLSTPYG 
SKVFQVXNEK 
YIYMVMECGN 
DGMUCfrTPFG 
VHSU3CILYY 
KQRISIPEIiL. 



SI 
I 

SLLX,KJ>EKNS 
DAHDYFQMAR 
LNLQKKQLLS 
PPQDAEIGYR 
RDLWPGSKP 
KLHETKEYQE 
QPVFBVSKQS 
QPACPQQQQH 
KQIYAIKYVH 
IDLHSWIrKKK 
IANQMQPDTT 
MTYGKTPFQQ 
AHPYVQIQTH 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 



A87 PNA SEQUENCE 
Gene names 
Unigene number: 
Probeset Accession ft : 
Nucleic Acid Accession Hi 
Coding sequence : 



bone morpbogenetic protein 7 {osteogenic protein 1) (BMF7) 

Hs, 170195 

BE616633 

NM_0D1719 

123-14.18 (underlined ueguencea correspond to start, and stop 



l 
1 

GGGOGCAGCG 
CTGCCACCTG 
O0RTQCAC0T 
CCCTGTTCCT 



CCATTTTGGG 
GCATGTTCAT 
GCCAGGGCrr 
GCCTGCAAGA 
7QGAACA7GA 
TTTGCAAGAT 
ACATCOGGGA 
AGCACSTTGGG 
AQOAGGGCTG 
GGCACAACCT 
AGTTGGCGGG 
TCTTCftAGGC 
GGCAGAACCG 
A8AACAGCAJ3 
GAGACCTGGG 



AGACGCTGGT 
AdClCAKTOC 
ACftGAAACAT 
TTGGGGCCAA 
CTGCCTTTTG 
AAACATGAGC 
TCCTACAAGC 
GCOGGGCCAG 
TTATGAGCGC 
GGGGACAITTG 
CAATAAAACG 



11 
I 

GGGCCCGTCT 
GGGCGGTGCG 
GCGCTCACTG 
GCTGCGCrCC 
CC5GCGCCTC 
CTTGCCCCAC 
GCTGOACCTG 
CTCCTACCCCT 
TAGCCATTTC 
CAAGGAATTC 
CCCAGAAGGG 
ACGCTTCGAC 
CAGGGAATOS 
GCTGGTGTTT 
GGGCCTGCAG 
CCTGATTGGG 
CACGGAGGTC 
CTCCAAGAOG 
CAGCQACCAG 
CTGGCAGGAC 

tgccttccct 

OCACTTCATC 
CATCTCCGTC 
GGTGGTCCGG 
GTTTTTCTGG 
TGAGACCTTC 
AGCATATGGC 
TGTGCAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 
I 

GCAGCAAGTG 
GGCCCGGAGC 
CGAGCTGCOG 
GCCCTGGCOG 
CGCAGCCAGG 
OGCCCGCGOC 
TACAACGCCA 
TACAAGQCCG 
CTCACCGACG 
TTOCAOCCAC 
GARGCTGTCA 
AATGAGAOGT 
GATCTCTTCC 
GACATCACAG 
CTCTCGGTGG 
CGGCACGGGC 
CACTTCCGCR 
CCCAAGAACC 
AGGCAGGCCT 
TGGATCATCG 
CTGAACTOCT 
AACCCGGAAA 
CTCTACTTCG 
GCCTGTGGCT 
AT CCTCCATT 
OCCrOCCTAT 
TTTTGATCAG 
AAACCTAGCA 
GGGAAGTCTC 
GGCCACCCAG 
GAAAGGAAAA 



31 
1 

ACCGACGGCC 
CCGGAGCCCG 
GGCCGCACAG 
ACTTCAGCCT 
AGCGGCOGQA 
CGCACCTCCA 
TGQCGGTGGA 
TCTTCAGTAC 
CCGACATGGT 
GCTACCACCA 
GGGCAGOCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
AGACGCTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAGCA 
CGCCTGAAGG 
ACR1GAACX3C 
CGGTGCCCAA 
ATGACftGCTC 
GCCA CTAGC T 
GCTCGCCTTG 
CCCCAACTTT 
tTlTTTCAGTG 
GGAAAAAAAA 
AGCCATGCAC 



41 

I 

GGGACGGCGG 
GGTAGCGCGT 
CTTCGTGGCG 
GGACAACGAG 
OftTGCAGGGC 
GGGCAAGCAC 
GGAGGGOGGC 
OCAGGGCCCC 
CATGAGCTTC 
TCGAGAQTTC 
ATTCCGGATC 
05TTTATCAG 
CCGTACCCTC 
CCACTGGGXO 
TGGGCAGAGC 
GCRGCJCCTTC 
CRCGGGGAQC 
GCGGATGGCC 
DOaGCTGTAT 
CTACGCCGCC 
CACCAACCAC 
GCCCTGCTGT 
CRACGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTQTGA 
GCRGCATCCA 
ACAACGCATA 
GGACTCGTTT 



TTGACCCGGA AGTTCCTGTA 



51 
i 

CCTGCCCCCT 1 

AGAGCCGGGG 

CTGTGGGCAC 

GTGCACTCOA 

GAGATCCTCT 

AACTCGGCAC 

GGGOCGGGGG 

CCTCTGGCCA 

GTCAACCTCG 

OGGTTTGATC 

TACAAGGACT 

GTGCTOCRGG 

TGGGCCTCGG 

OTCAATCOGC 

ATCAACCCCA 

ATGGTGGCTT 

AAACAQCGCA 

AAOGTGGCAG 

GTCAGCTTCC 

TACTACTGTG 

GCCATCGTGC 

QCGCCCAOGC 

CTGAAGAAAT 

TTCAGACCCT 

AGCAGAOCAA 

GAGTATTAGG 

ATGAACAAGA 

AAGAAAAAXG 

CCAGAGGTAA 

GGCAAGGGGT 

ATAAATGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



ABB Protein, sequence : 

Gene name: bone morphogeny tic protein 



7 (osteogenic protein 1) (BKP7) 



Unigene number t 
Probeset Accession #s 
Protein Accession #: 
Signal sequence: 
Pfam domains r 
Tranamembran© domains i 
Cellular ixacalization: 



Ha. 170195 
BE616633 
UP 001710.1 

TGFbj>ropeptide 137-281] 

none found 

secreted 



872 



WO 03/042661 



PCT/US02/36810 



X 11 21 31 41 51 

! 1 I 1 I I 

MHVRSUtAAA PHSFVALWAP LFLI.RSALAD FSLDNEVHSS FIHRRUlSQE RREMQREILS 60 

IliGIiPKRPRP HIX2GKHN8AP MFMLDJjYNAM AVEEGGGPGG QGFSYPYKAV FSTQGPPLAS 120 

liQDSHFLTDA DHVMSFVNLV EEDKEFFHPR YHHHEFRFDL 8KIPEGEAVT AAEFRIYKDY 180 

IRERFDNETF RISVYQVLQE HLGRESDLFL. LDSRTLWASE BSMLVFDITA TSNKWWHFR 240 

HNLGLQL5VE TfcDGQSINPK IAGklORHGP QNKQPFMVAF FKATSVHFRS IRSTGSKQRS 300 

QNRSKTPKNQ EALRMANVAE NSSSDQRQAC KKHELYVSFR DLGWQDWIIA PEGYAAYYCE 350 

GECAFPLNSY MNATNHAIVQ TLVHFIKPET VFXFCCAPTQ LNAISVLYFD DSSNVILKKY 420 
RMMWRACGC H 

A83 DNA SEQUENCE: 

Gene names Homo sapiens inRNA; cDNA DXFZp564G1763 (from clone DKPZp5G401763> 

Unigeue number: Hs. 27373 

Probe set Accession #i F13036 

Nucleic Acid Accession #= AC012478 

Coding sequence? 1-1896 1 underlined sequences correspond to start and stop codans) 



1 11 21 

I 1 1 

ATGC GCGCCO TGCCGCTGCC CGCCCCGCTC 
GCTCCCGCCG CGCGCGCCAG CAGAGCCGAG 
CaCGAOfTCTC GGCCACCGCC CGGCCQ3GGG 
GOGGCGGGCG GCAGCGGCAG CTCCAGCTCC 
ATTTCCATOC TCCTCCGCGA CCTACCCACC 
TTTACCACCC TCCTCATCGC CTGOCTGCTG 
AA6AAGACAC GGAAGTATGA TATCATCACC 
CTAAATGAAG AGGATGATGA AGATGAGGAC 
TCCTTGCCGG CTGCACTGAG ACGTCAGCTG 
GTQCCCCCAC CCTTCATCCT GGACATTGAC 
GGTGGAATCA GACCTGGTAA AACCTGTTTC 
TCAQCTGCAA CCTGGGQTGT GAAGGACTGG 
GAAACCAAAA CGAACGTTAT GTATAAAACC 
TCAQACTGTC ACTGGCAAGC TCGTTTCCAC 
TTTGGGCATC CCTTTAAAGT GCCCCCTACT 
CTGAATCTCA TGGAAAAGCT GGATTCCTCT 
GCCAGGTGCT TGCCACTGGT CCTGGCAGAA 
CC ITGGTGGC ACTTGAQOGC CACAGGCTCT 
AGTAOCTTGG GCTTGGATGT TTTCTGTGGT 
AGASCA£3TQA CTAAGOTTCTT CCAGGGTAGC 
GCCCTAGAGA GTGGGTTTOC CCATCATCTC 
ACCCATCCTG TCAGGTTGGC TCXJJTCAGAT 
AGGGTGTTTC GGCGTCCGOG GCAGTCTCTG 
TGCCTTTTGG TTTTGAAGAT TCTGTTGAQG 
ATCTGTCTCC CCTGCTGTGC GGTGGAACAC 
GTCCrTGCGT CATTTGAGCA GAGCCCACAA 
AAACGAGGGC CCAGTGGGCA ATTGACCAGA 
GGGAAOCTGC AGACAATTCC AGATACCCAA 
CACCCTGGAG GAGACTTGGA TGGGGTGGCA 
GATGGCAGAT GCCAGAAGAT GGTCCTGATG 
TGTGAQAGGC TCACAGGTTC CCATCACTTC 
TCCCCCCGAC AGCCCCTGTT TCTGTCCAGG 




CCCGGGAACA 
AACAGCAGTG 
CTCAAGGCAG 
CTGG3CQTCT 
ACTCCAGCAG 
TCCACAGTAT 
CCAGGGTGCC 
CTTCCAGCAA 
CCAGGCTGGT 
ACCTGGAAGC 
CCAGCTCCAT 
GTCACCACAA 
TCTACTCCOC 
GCCTTACGCA 
ATGGCGGCTG 
CCAATAAAAA 
GCCGGCCAGC 
TCTTTCTCCA 
AGGCTTCTCA 
GCCCGGGGAC 
CATGGCGGAG 
CGCCATCCTC 
CTACGGGAAG 



CCACCCGGTT 
GOGACGCCTT 
COGTGATOGT 
TCAGGTCGGG 



TCGACATCAA 
AGACGCTACT 
GATGCAGTGG 
GGCATCCTGT 
OCTCTTGCGT 
CGTGCGTGTC 
TGGAGTTGCT 
ATGGTTTTCG 
GAAACACOCG 



cccrrxTACAC 

GGGGCAC2CTT 
AACAGCTGCG 
GAGAGTOTCC 
AAGCCAGCCT 
GGTCAQCGGG 
ACCTTGACCT 
CCAAQAQAAG 



CACACATGCC 
GGCCAGGAAG 
AATTTCTATT 
TCIGAGGAAG 
TCCAGCCATT 
GCCTGA 



GCCCACGTGA 
TGGAGGAAGA 
GGOCACCTAG 
CCAAGTCTTG 



SI 

I 

GCTCCTGGOC 
CGAACCCGAG 
TGGGTCTGGG 
GGTGACCCGC 

AAAGAGGTTA 
AATGGCGCCA 
ATACAGAGTG 
GACAGTTCCT 
AAGGCCTGAT 
GGAAAGTTGG 
CGGAGGTGTT 
AGGCATCTGC 
TCTGCCACCC 
ACAACTGCAG 
GGCTCCATCT 
CCTTCCAAAT 
ACAAACCATG 
TTGTGAAGAC 
CTGGAAGCCA 
TCOGCTGAGC 
GACGGGGAGG 
TACCGCAACT 
CTTCTACAAA 
CTCAGTGACT 
GGAGCCTGTC 
GATCACACAT 
GGATOTCACT 
GGGTTTCCAG 
T7TGACAGGA 
GTCCTTCCTT 



60 
120 
1B0 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
640 
900 
960 
1020 
1050 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



A90 Protein sequencer 
Gene name i 
Uhlgene number: 
Probeset Accession tfs 
Protein Accession #i 
Signal sequence: 
Transmembrane domains a 
Cellular Localization: 



Homo sapiens mRSA; cDNA DKFZp564Q1763 

Hb. 27373 

F13036 

FGEHESH predicted 
1-27 

94-115, 448-469 
not determined 



(from clone DKFZp56 401763) 



1 11 21 31 41 51 

I 1 1 I 1 I 

MRAVPXiPAPL LPLLLLAUA APAARASRAB SVSAPWPEPE RESRPPPGPG PGHTTRFGSG 60 

AAGGSGSSSS USSGDALVTR ISIIMDItPT IiKAAVIVAFA FTTIAiIACLli LRVFRSGKRIt 120 

KKTRKYDIIT TPAERVEMAP I^EEDDEDED STVFDIKYRV SLPAALRRQIj PGCQTLLTVP 180 

VPFPPIU5E0 LPARCSGRPD GGIRPGgTCP PAWWHFVESK SAATWGVKDW TWKPSCVGGV 240 

ETKTNVMYKT PAPSCVSGIC 6DCHWQARFH VTTMELLLPP FGHPFK^PPT STPfflSFHQLQ 300 

UJLMEKLD3S ALRHNTRAPS ARCJ.PLVIAE MAAAESDLFN PKWHTSATGS PIKTT»YTQTH 360 

6TEGLDVFCG AGQRGTFCED RAVTKVLQGS SFSKQXJftKKP ALESGPPHRT. RLLRBCPPLS 420 

THPVPJjARSD ARGQASLTGR RVFRRPRQSL HGGGSAGTAT CIJLVIiKlUA RHPHLDLFYK 480 

ICI.PCCAVEH LRBAKRSSVT VLA3FBQ-SPQ KAAAAHGEPV KRGPSGQLTR HTCPGWGITM S40 

AKLQTIPDTQ GQEGPREDVT HPGGDLDGVA BFYLEEEGFQ DGRCQKMVLM SEEGPPSLTG 600 
CERIiTGeHHF SBHSKfiWfiFls BPRQPLFliSR P 

BWIKG 

A91 UNA SEQUENCE 

Gene name: G protein-coupled receptor 64 
Unlgene number: Hs- 184942 

873 



WO 03/042661 



PCT/US02/36810 



Probeset Accession H: 
Nucleic Acid Accession tt? 
Coding sequence: 



AA43SS77 
NH_OCS756 

73-3117 (underlined sequences correspond to start and stop codons) 



1 



11 



21 



31 



51 



AGCCftGCCCG AGGACGCGAG CGGCAGGTGT QCACAGAGGT TCTCCACTTT GTTTTCTGAA 60 

CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 120 

GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT TCTGGTAACA 1B0 

TCCCTGGAAG AAGATACTQA TAATTCCRGT TTGTCACCAC CACCTGCTAA ATT AT CTGT T 240 

GTCAGTTTTG CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 300 

AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 360 

GCTTCAGGJGG TCAAACCCCA GAGAAATATC TGCAATTTGT CATCTATTTG CAATGACTCA 420 

GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 460 

CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 540 

GAGCTCAACA AAACCCTGCA AAOCCTAAGT GAGACTTACT TTATAATGTQ TGCTACAGCA 600 

GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 660 

TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC CAATGGAACA CTGCTGOTGT 720 

TCTGTCAGGA taocctgccc ttcctcccca gaagagttgg gaaagcttca GTGTGACCTG 7 bo 

CAGGATCCCA TTGTCTGTCT TGCTGACCAT CCACGTGGCC CACCATTTTC TTCCAGCCAA 640 
TCCATCCCAG TGGTGOCTCG OGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 900 
GCTQAQCCTC CAGATTATTC acctgtgacc cacaatgttc cctctocaat AGGGGAGATT sgd 
CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTGCCAT TGACATGCCC 1020 
CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATOTCTCCGG CACCCCACCT 10 80 
CCTGTGAAAG CCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 1140 
AGCGCAGCXC CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 
CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TO3GGCAGCC TGGAGCCTAA CCTOGCAGGA 1260 
GAAATGATCA ACCAAGTCAG CAOACTCCTT CATTCCCCGC CTQACATGCT GGCCCCTCTG 1320 
GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 1380 
ACtATAAGTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAQTAGT 1440 
TTCAACACAA CTACCXXTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 1500 
CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1560 
CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 
TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTQATCA GCTACGTCAT ATCATCGAGT 1680 
GTTGCAAACC TGACCGTCAG GAACTTGACA AGAAACGTGA CAGTCACATT AAAGCACATC 1740 
AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1BO0 
□GCAQAGGAG GCTGGTCAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC 1B60 
ATCTGTACCT GTAOOCATCT AACAAGCTTC GGCGTTCTGC TGGACCTATC TAGGACATCT 1920 
GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATTQQTTG TGGGCTTTCA 1980 
TCAATJTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGOAT 204 D 
TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCXGC TTCTGCTGAA CCTGGTCTTC 2100 
CTCCTGGACT CGTGGATTGC TCTGIATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 
TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 
TACCTGGCCC TTGTCAAAGT ATT TAAT ACT TACATCCGAA AATACATOCT TAAATTCTGC 2260 
ATTGTCQGTT GGGGGGTACC AGCTGTGOTT GTGACCATCA TCCTOACTAT ATCCCCAGAT 2340 
AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 
AICAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 
CTGAADSTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 2520 
CRACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GGAQTATCGC TGGCCTTACA 2560 
TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGACCAGT TAACGTGACC 2640 
TTCATGTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 
TGTGTGGCCA AA3AAAATGT CAGGAAGCAA TGGAGGCOGT ATCTTTGTTG TBGAAAGXTA 2760 
CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2B20 
GTAAACCAAG GAGTGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCA0TAA CTCCACTAAC 28B0 
TCCACCACAC TGCTAGTGAA TAATGATTGC TCAGTACACG CAAGCGGGAA TGGAAATGCT 2940 
TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCACGAT 3000 
TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG GAAAQGOCGT 3660 
ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT TTATTGAGCA AATGTGATTC 3120 
CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACRGTGTGAA ATGTCCAATT TTACCTTTTA 31B0 
CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 
AAGCTAATTA AGQGCGATGA TTATTATTAC AAQAAGAAAC CAAGACATTA CACCATGGTT 3300 
TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 
ATACACTAAG AATGACTCCT ATAAAQAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 
TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 3480 
TTGGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 3540 
GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 
TTGTAGAATC ATGGTTGTTT TGTCTCACGT GATAAXTCAG AAAATCCTTG CTOGTTCCGC 3660 
AAATCCTAAA 3CTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 
' TCAAGAAATA ATGATCCCAG CCAQACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 3780 
TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3840 
TQGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGO 3900 
OTGCAC5GACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC CPGACCGCAT 3960 
CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 4020 
AGAGQGATGA ACTGTCATCC AGACCATGTG TCAGGAAAAT TGTGAACGTA GATGAGGTAC 4080 
ATACACTGCC GCTTCTCAAA TCOCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC a 140 
CTTCCCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 42 00 
ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 4260 
TTGTTTTTTA AATGTGTCTT GAAGAATOCA CAOTGACAAG GGGAGTAGCT ATTGGGAACA 4320 
GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 4380 
TATACAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCAGTGOCTC AAGTACATCC 4440 
TTATTAOGAA CATTTCAAAC CGCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 
ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAOTAAT AATACACATT TCTGTGAGTG 4560 
CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA TTTTCTTGTA TTTATATGTT 4620 



874 



WO 03/042661 



PCT/US02/36810 



AAAATCAAAA AT GTTAAAAT CAATGAAATA AATTTGCAGT TAAGA 



A92 Protein 



sequence 



10 
15 
20 
25 
30 



Gene name ; 
iTnigene number: 
Protein Accession ft: 
Signal sequence : 
GPS domain: 

TranBmembrane domains -. 
Cellular Localization; 



G protein- coupled receptor 64 

Hs. 184942 

NP_0 05747.1 

1-38 

S64-615 

624-646, 660-682, 688-710, 733-755, 
plasma membrane 



783-805, 828-850, 858-880 



1 
1 

MVFSVRQCGH 
PSSNEVETTS 
GE1MFQYDKE 
TLiNCTFTIKX* 
VCLADHPRGP 
FQPSAPIASS 
VQTDIVNTSS 
LEWDDI0LQ 
NSIGT1TLPS 
TVSNLTRKVT 
SHLTSFGVLL 
ILIQLCAALL 
VKVFNTYIRK 
AVFYITWGY 
ITWGFAPFAW 
NSDWSKTATN 
UGVSPSVQNG 



11 
I 

VGRTEEVLLT 
L*3DVTI*SLLP 
STVPQNQHIT 
JNNTMNACAAJ 
PFSSSQSlPV 
PAI35MPPQSE 
ISDIiENQVLQ 
LNFBNTTISr* 
SLMNNLPAHD 
VTLXKINPSQ 
DLSRTSVLPA 
IiINLiVFIjIiDS 
YXLKFCTVGW 
FCVIFLLNVS 
QFVNVTPMYI* 
GLKKQTVNQG 
EVCIiKDFTGK 



21 
1 

PKIFLVIICL 
SNETEKTKIT 
NGTLTGVLSL 
AALERVKIRP 
VPRATVLSQV 
TISSPMPQTH 
MEKALSLG5L 
TSPSLALAVT 
MELASEVQFN 
DELTVRCVPW 
QMMftl/TFITY 
WIALYKMQC3L 
GVPAWVTII 
MFIWLVQLC 
FAIFNTLQGF 
VSSSENSLQS 



31 
I 

HVVLVTSI.BE 
IVKTFNASGV 
SELKRSKLNK 
MEHCCC8VRI 
PKATSFAEPP 
VSGTPPPVKA 
EPHLAGEHIN 
RVNASSFNTT 
FFETPALFQD 



IGCGLSSIFL 
CISVAVFLHY 
LTISPDNYGL 
RIKKKSGQLGA 
FIFIFYCVAK 
SSNSTNGTTTj 
CNGKGRMALR 



41 

1 

DTDNSSLSPP 
KPQRN1CNLS 
TliQTLSETYF 
PCPSSFEELG 
DYfiFVTHNVF 
SFSSPTVSAP 
QVSRLLHSFP 
TFVAQDPANL 
PSLEHLSLIS 
KSPNGC8VKD 
SVTLVTYIAF 
FIiTjVSFTWHG 
GSYGKFPNGS 
QRKTSIQDLR 
ENVRKQWRRY 
LVMHDCSVHA 
RTBKRGSLHF 



51 
I 

PAKLSW9FA 
SICNDSAFFR 
IMCATAEAQ9 
KLQCDLQDPI 
SPlGBlQFliS 
ANVNTTSAPP 
DMLAPLAQRL 
QVSLETQAPE 
YVISSSVANI. 
RRLNETICTC 
EKIRRDYPSK 
LEAFHMYLAL 
PDDFCHINKN 
SIAGLTFLLG 
IiCCGKLRLAE 



60 
120 

1H0 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



IEQM 



Fibrosis 



A93 UNA SEQUENCE 



Unigene number: 
Probe set Accession #= 
Nucleic Acid Accession fl: 
Coding sequences 



35 

40 i u 
I I 

CCGCAGAGGA GOCTOSGCCA 
CGCCCCTGCC GCGGTGCCTG 

. _ CGAGTGGAQC GGAGGACCCG 

45 GGGTCCGGCC GGCGCCCTCC 
ATGCCTCTGC OCTGGAGCCT 
GGGAACGCGG CCAGTGCAAG 
TGTCACTATG QAACTAAACT 
TGTGAAGCTA CATGGGAACC 

50 AGATGCTTTC CAGGATACAC 
AAACCCOGGC CATGCCAACA 
CTCAGTGGCC ACAT OCT CAT 
ATAAACTGTC AGTACAGCTG 

_ _ TCRGGACTCC GCCTGGCCCC 

55 GGTAAAGTCA TCXGTCCCTA 
AAATGTCACA TTGGTTTCOA 
AA5CGAATGTA CTATGGATAG 
GGGTCCTTCA AGTGTAAATG 
ATCCCTGAAA ATTCTGTGAA 

OU AAGAAOTTGC TTGCTCACAA 
CCAGAACCCA CCAGGAGTCC 
ATAGTTTCCA GAGGOGGGAA 
GAGGGGCTTG AGGATGAGAA 
AGCCTGCOAG GAGATGTGTT 

05 CTGGTCCAAA GGAAAGCGCF 
GACTGCACJCT TCAATCATGG 
TGGAATCCTG CTGATCGAGA 
GGTCACAA0A AAGACATTGG 
AACTTCTGTT TGCTCTTTGA 

70 TTTGTGAAAA ACAGTAACAA 
TGGAAGACAG GGAAAATTCA 
GAAOCAGAAC GTGGCAAGGG 
TCAGGCTTAT GTCCAGATAG 
TTGACTTTGT ATGTCAGTTC 

/D TTAGAATTAC TAGCTOAAAA 

TOCTGTATAA GATATGCCAA 
TTTCTGAATC TTTCCACATr 
CAGTATATCT {3ATTTGTATA 

Qn TAGAAAAAAA AGCACAGAGA 

OU TATGACATCA AAGATAGACT 

TGTATATTTA AATTCTTTGT 



EGF- like -domain; multiple 6 

Hs. 12844 

W67551 

SM_015507 

241-1902 (underlined sequences correspond to start and stop codons) 



21 31 41 51 

till 

GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 
GCCTOOCCTC CCAGACTGCA GGGACAOCAC CCBGTAACTG 
AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 
CGAGGGGGGC TCAOGAGGAG GAAGGAGGAC 
TGCGCTCOCG CIGCTGCTCT OCTGGGTGGC 
GCATCACGGG TTGTTAQCAT CGGCAGGTCA 
GGCCTGCTGC TACGGCTGGA GAAGAAACAG 
TGGATGTAAG TTSGGTGAGT GCGTGGGACC 
CGGGAAAACC TGCAGTCAAG ATGTGAATGA 
CAGATGTGTG AATACACACG GAAGCTACAA 
GCCAOATGCT ACGTGTGTGA ACTCTAGGAC 
TGAAGACACA GAAGAAGGGC CACAGTGCCT 
AAATGGRAGA GACTQTCTAO ATATTGATGA 
CAATCGAAGA TGTGTGAACA CATTTGGAAG 
ACTGCAATAT ATCAGTGGAC GATATGACTG 
CCATAOGTGC AGCCACCATG CCAATTGCTT 
CAAGCAGGGA TATAAAGGCA ATGGACTTOG 
GGAAOTCCTC AGAGCACCTG GTACCATCAA 
AAACAGCATG AAAAAGAAGG CAAAAATTAA 
TACCCCTAAG GTGAACTTGC AOCCCTTTCAA 
CTCTCATGGA GGTAAAAAAG GGAATGAAGA 
AAGAGAAGAG AAAGCCCTGA AGAATGACAT 
TTTCCCTAAG GTGAATGAAG CAGGTGAATT 
AACTTCCAAA CTGGAACATA AAQATTTAAA 
GATCTGTGAC TGGAAACAGG ATAGAGAAGA 
TAATQCTATT GGCITCTATA TGGCAQTTCC 
CCGATTGAAA CTTCTCCTAC CTGAOCTGCA 
TTACCGGCTG GCOGGAQACA AAGTCGGGAA 
TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 
GTTOTATCAA GGAACTGATG CTACCAAAAQ CATCATTTTT 
CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 
OCTTTTATCT GTOQATQA CT GAA TGTTACT ATCTTTATAT 
CCTGGTTTTT TTGATATTGC ATCATAGGAjG CTCTGGCATT 
ATTGTAATGT AOCAACAGAA ATATTATTGT AAGAXGCCTT 
TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 
ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCOCTCQT 
AGTAAGTTGA TGAGCTTCTC XCTACAACAT TTCTAGAAAA 
AATGT TTAAC TQTTTGACTC TTATGATACT TCTTGGAAAG 

tttgcctaag tggcttagct GGGTCTTTCA tagccaaact 

AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



aggtggtttc 
gcczogggtc 
caagggagtc 
aaacaaatgc 
gtgtggaatg 

GTGCTTTTGC 
ATGTGGCATG 
GTGTCCATOC 
ATGTGCCTCT 
CTACTACTGC 
TATAGATATA 
CAATACCCAA 

A3ACAGAATC 
AAATGTTACC 
CTATGAAGAG 
GAAAATGAAA 
AGAGGAGCGA 
CGGOCTGATT 
TATCTCQGTT 
TGATTTTGAC 
GGCCTTGGCA 
ACCCCAAAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
192 0 
1980 
2040 
2100 
21G0 
2220 
22 BO 
2340 



875 



WO 03/042661 



PCT/US02/36810 



5 

10 



A94 Protein seq uencer 
Gene name * 
Onigcno number: 
Probeset Accession ft: 
Protein Accession # : 
Signal sequence * 
Transmembrane domains: 
MAM domain r 
EGF domain: 

Cellular Localization: 



EOF- like -domain j multiple 6 

Hs- 12844 

£167551 

NP 056322.2 

1-21 

none found 
402-546 
SO -259 
secreted 



15 
20 
25 
30 

35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



1 
I 

MPLPWSLALP 
CBATCEPGCK 
LSGHKLMPDA 
GKVECPYNRR 
GSPKOCCKQG 
PEPTRTPTPK 
SLRGDVFFPK 



11 
1 

LLL6WVAGGF 
FGECVGPNXC 
TCVNSRTCRM 
CVNTFQ3YYC 
YKGHGLRCSA 
VKLQPFNYHE 
VKEAGBFGLI 
GPYMAVPAIiA 



8GLCPDSKL9 VDD 

GLIOBLASTOMA 

A95 DNA SEQUENCE 
Gene name: 
Una g^ne number t 
Probeset Accession #t 
Nucleic Acid Accession #: 



21 
I 

GNAASARHHG 
RCFPGYTGKT 
INCQYSCEDT 
XCHIGFELQY 
IPEWSVKEVL 
IVSRGGNSHG 
LVQRKALTSK 
GHKKDIGRLK 
WKTOKIQLYQ 



31 
I 

LLASARQPGV 
CSQDVNECGM 
EEQPQCLCPS 
ISGRYDCIDI 
RAPGTIXDRI 
GXKGNEEKMK 
LEHKDLNISV 
LLLPDLQPQ.fi 
GTDATKSIIF 



41 
! 

CHYGTKLACC 
KPHPCQHRCV 
SGLRLAPNGR 
NECTMDSHTC 
KKLLAHKETSM 



DCSFNHGICD 
KFCLLFDYRL 
EAERGKGKTG 



51 
I 

YGWRRNSKOV 
NTHGSYXCFC 
BCLDIDECAS 
SHHANCFNTQ 
KKKAKIXNVT 
KAIfKNDIEER 
WKQDStEDDFD 
AGDKVGKURV 
EIAVDGVLLV 



€0 
120 
1B0 
240 
300 
360 
420 
400 
540 



ESTs 

Hs. 293616 
AW043782 
none found 



Coding sequence: 38-1075 {underlined sequence corresponds to start and stop codon] 



1 
) 

AGCAACGACG 
CCTGCTGCTG 
GTGCAACATA 
GTGTQACGGG 
GTCGAAATGT 
CTTCCGGTGC 
AAACCCTCTG 
GAQCJTCATC 
AAGTTCTCAA. 
TTACCCCAGC 
CCTGCTGGCR. 
GCACC3GCTG 
CTGCAACGTC 
GAATGOGTCG 
TGCQTGGTAT 
CGACCTGCCC 
CAGCAGCCTC 



AGTTATTCCA 
TGCXCATGGG 
AACTATCTCT 
TGACATGATC 
CACCCSCATT 
AAATAGGCTG 
CQCXGGACCC 
ATGATCTAAC 
ATCAAAACCT 
AAGAAAACTT 
AAGGACTCTG 
CTCAlTCTGA 
GAGCCCCTOC 
TACACCTGCC 



GTATGTCCCT 
CTCCAAAGTT 
ACTGGTTTCT 
CTGCACTGTG 
GGTCAGGGTC 
AGACAATTTO 
TGAAACAGTG 
AGCTGXCTCT 
ACACCCTTGC 
ACATTTGTGC 
AGAGGGACTC 
TTCTCTGTCT 
AGGTGTTGTT 
CCACTCCGGG 
AACCTGTTTG 



11 
I 

CCGGGCAGCG 
AGCAQCGCCG 
CCAGGCAACT 
CTGCCTOACT 
GGCCCAACCT 
AATGGGTTTG 
CTTTGCTCCA 
TGCBATGGAC 
GAACCCGGCA 
ATCACCTATG 
CTGGTCTTGC 
CAGCACCCTG 
ACCTACAACG 
GAAGTAQGCT 
GACCTTCCTC 
CCCTACCGCT 
CTGAGGGTGG 
GAGCCCAGGG 
AAGTCCATAT 
AAGCTCTTTA 
GCATTCCCCT 
TQTTGTGCGT 
TTTCACATTA 
GGAX3RQAGCA 

aattctctct 

CAGGAflGCCA 
GCTTTGCACA 
TGGAGGTGAG 
AAACCATCTA 
GAGCTTTCCT 
CATGAGTTTA 
CTGGCTCEAC 
AGCCAAGGAA 
GTGGCCCACA 
CCCTTAACAC 
ATCACAGQTG 
CACGCTCCTC 
AGGCCTCTCC 
GAGTCAAGAT 

tgtttbttt t 
cccgctgagc 

ATTGTTGCAC 
CTCTCTCCCT 
CCAGTCAGCC 
TGGCAAGAAA 
CAGCTC3TCAC 
ACGCTAATTA 



21 
I 

GGAGCGGCGG 
CGGAGAGCCA 
TCR.TGTGCAG 
GCTTCGACAA 
TCTTCCCCTG 
AGGACTGTCC 
CCGCCCGCTA 
AGAATAACTG 
GTGGGCAGGT 
CCATCATCGG 
AOCACCAGCG 
TGCTGCTGTC 
tTCAATAATGG 
CCCCACCCTC 
CACCQCCCTA 
CCCGGTCCGG 
AAGACACCAG 
ACTCTGAGCC 
GGGTTAATCT 
AQCACCTGTA 
CCTCCCCCAG 
CTTTTCTGTC 
TTCTGXTTCT 
ATGTTTCTGT 
GCXGGGTAGT 
TCACTGGATG 
ATCCTATTTG 
TAACACCCTT 
CCCTGTATAA 
CAGCAGCATA 
TCCAAGTTCT 
AGCCACTTAC 
TGAGGACCTA 
CCCAGCCTGT 
TTGCAAAGTC 
AGAGCCATOT 
TTCCCAAGGT 
CAACATCCCA 
TTTCCATTTG 
TTCCCTTCTA 
TTCCTTTAAC 
CCCGTGATAA 
TTTGAGGTTA 
CCGTGTATAG 
ACAGGGCCCG 
CCACACTGAC 
CCATTCAGAA 
AAACAGAGCC 



31 
I 

CCGCGCCATG 
GCTGCTCCCC 
CAATGGACGG 
GAGTGAIGAG 
TGCCAGCGGC 
CGATGGCAGC 
CCACTGCAAG 
rCAAQACAAC 
GTTTGTGACT 
CAGCTCCQTC 
GAAGCGGAAC 
COGCCTGGTQ 
CATTCCAGTAT 
CTACTCCGAG 
CTGTTCTGAC 
GAOTGCCAAC 
CCACAGOCCa 
CAGCCAGGGC 
GCTCTGACTT 
AGGATGTCTC 
ACTTCAGABA 
AGGTCACTCT 
GTTGGAGAGA 
GCTATATTGG 
3ACCTTATAG 

gtcacgcccc 
atgcccccag 
cagcagtcgc 
atfctogctt 
tatcatca0c 
cagctcctaa 

CTGGTTTCTG 
ACTTGAGTTG 
CTTGCTCATT 

ctttttacct 
tcaatacctc 
cccaatacca 

GTAGTTTCTC 
GATCTATTTT 
GTTAAGGGAC 
AAGGTCCAAA 
CAAGTCACTC 
rTATTTATCA 
TCTCTATGTT 
CCTCCCTGCA 
TGATCAGGGG 
CTTCTTTCCG 
TGCAGGAAGT 



41 
I 

TGGCTGCTGG 
GGGAACAACT 
TGCATCCCGG 
AAGGAGTGCC 
ATCCATTGCA 
GATGAAGAGA 
AACGGCCTCT 
AGTGATGAGG 
TCAGAGAACC 
ATTTTTGTGC 
AACCTCATGA 
GTCCTGGACC 
GTGGCCAGCC 
GCCTTGCTGG 
ACGGAATCTC 
AGTGCCAGCT 
GGGCAGCCTG 
ACTGAAGAAG 
GTTGCCATTC 
AAGTTACAGT 
TGTTTTTCTG 
TCCCTTGGGA 
CAGCATATAA 
ATGCTCAGAA 
CATTTGGGGA 
CAAAAAAATT 
TTCAGCAGAG 
AACGTTATTT 
TAGAAATTTG 
CTCATCCTAA 
AATGCAGGCT 
GACTGTCACC 
GCCCAAAGTC 
CATGCAGCCT 
GTGCATTTGG 
CAGCAAGCTC 
GCACCTCTAG 
CTCTGAGACA 
AAATCTTTTA 
TATTTATATG 
GAAAGATGCA 
CAGACTAACC 
AGTYCTTGAA 
TQTGCTAGTT 
GGAATAAGGG 
TAAAATGOAA 
CAQCTGAAGA 
GGGGCTAAAG 



51 
I 

GGCOGCTGK3 
TCACCAATGA 
GCGCCTGGCA 
CCAAGGCTAA 
TCATTGGTCG 
ACTOCACAGC 
GTAtTGACAA 
AAAGCTGTGA 
AACTTGTGTA 

CGCTGCCCGT 
ACCCCCACCA 
AGGCGGAGCA 
ACCAGAGGCC 

CCCAGGCAGC 
GCCCCCAGGA 
T ATAAQ TCCC 
TAACAATTTG 



CCCGAGATCA 

aacautattg 

GTGCAGGAGA 



CCMTTGAGC 
TCAGTGGCCA 
TGGTT3TGTG 
CCCAAGAATG 
AATAQGCAGG 
GCCAAGACCC 
CTCCCAGCTG 
TGACCTGGCT 
CAACACTGGC 
ACTTGAGGAC 
TCCTGGCTCC 
TTAGAGTTAG 
CATGGQCAAG 
GAAATGCATT 
TGTATAGGAA 
AAAGGAGATC 
TGTGTGDCAG 
GGAAGCAGAA 
TTTCTTTTTT 
GXAAAACGTT 
CCAGGTAGAQ 
AATGTICAGT 



60 
120 
180 
240 
3C0 
360 
420 
4 BO 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2BB0 



876 



WO 03/042661 



PCT/US02/36810 



10 



15 



20 



TGATCCTGTT 
RAG CT AG CCA 
AACGGAAAGG 
CTGAGTAATC 
CATTTCATCT 
TTTeTGGTGC 
GAGTTAATCT 
AGATAAGGGA 
TGATTTTTTT 
TTTTTGGGGG 
TTTTTAACTC 
AAAATAGTCT 
AGTTACACTG 
TTTTTAAAAA 
GAAAOGTTGT 
TTATACTTTC 
AAMMAAAAAC 
TGGGGCGGCG 
GAAACCCTGG 
GGAAACCGCA 
GGACTCAACC 



CTGTAGACTT 
CTGGTATTTT 
AACCTAGCTG 
CAATAAAGAA 
CCTGTGAQTC 
TCTGGAAGTT 
CACTCGCTTT 
TGCCTACTAA 
AATGAATGTT 
GAGGGGGTTT 
ATTCCAACCA 
CATCTCTTTT 
TQATGACTGG 
ATGCAACTAA 
GTSTOGTTGC 
TAATAAATTT 
AWYWTTGGGG 
GGGCCCACGT 
CCAAGAAAAA 
GAGTGTTGCG 
AGGAGGACCC 



TTCTTTCTTT 
GTTTTGTTTA 
CCTGTATCTT 
CTTTTOATGA 
AGAAGGGCTT 
GTTTAGAOGA 
TCTGCTTCCA 
TGCT 1TTTTA 
TTTAAAAATA 
GTTTTCCAAC 
GGAAGCTTTT 
TTTCTCAAAT 
OCTATTACCT 
GTGGTTAATA 
TTTTTGTGTT 
GCAGTTTCAT 
GGGCTTGGGC 
AGGTACGGCG 
GGTGGCGAGA 
TAAAOCACAC 
AAGGGAACCC 



TTTTAACCAA 
AAAAAAAAAA 
TCATTTTTAA 
CAGCCAGAAT 
TATTTCTCCC 
AAGAATTCTA 
GGCATCTTAG 
AAACAAACAG 
TATAAATAGG 
TCAAGATGGC 
TTATACATTG 
GAGATCCGTQ 
GACTCAGCTC 
GTGTGTGACG 
TTGOTTAGGC 
TCTTTCTGTT 
CTOOOAAAAA 
ACCACGCGGG 
ATTCTOCACA 
COGAAGAGA3 
GATAOAGTAC 



ATCCAAAGGA 
GAAAGAAAGA 
AATAGCACTT 
GTGTTAGAAC 
TTTGATGGGG 
ATTTTAATTA 
GAAAAACAAA 
GGACA1TTTT 
ACACCAAAGC 
ACATTAGTGG 
CCTAAATCTA 
TTTTATTTTA 
CCTCTACCTT 
CTCAAAGTTA 
TTGGTTTTGT 
TGTGCAAAWG 
GTTTTTAACA 
CCCAAACGGG 
CCAOAAAAAA 
AACTCAGAAG 
G 



TGTTACAGAA 
AAGAAAQAAA 
GAQTT AT TTT 
TCTGGCTGAA 
CCCCTTCTTC 
ATTGCGCAGT 
TGGTTTTAGT 
ATTATAGATT 
GGCAGGGTTT 
CCAGCAATAT 
CGCCAACCAG 
GCATTAAATT 
GAAATTOACA 
ATGTAAACTG 
TTTTTAATTT 
GWMCTAMAHM 
CCACTlCGGG 
ACCCCAGAAG 
ACGCGCCGGG 
CACACAAGCa 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3760 
3840 
3900 
3960 
4020 
4080 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A96 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession #* 
Protein Accession #: 
Signal sequences 
Transmembrane domains: 
U>La domains: 
Cellular Localization; 



Hb. 293616 
AW043782 
none found 
1-17 
163-191 

28-66, 70-108, 112-143 
plasma membrane 



1 11 21 31 41 SI 

1 I I I I I 

MWUjGPLCLIi L3 9AAE SQLL PGNNFTNECN IFGHFMCSNG RCIPGAWQCD GLFDCFDKGD 
EKECPKAKSK CGPTFFPCAS GIHGIIGRFR CNGFKDCPDG SDEENCTANP I*LCSTARYHC 
KNGLCIDKSF ICDGQSNCQD HSDEESGEBS QEPGSGQVFV TSHRQIiVxYP SITYAIIG6S 
VIFVLWAIiI* ALVLBHQRKH MNLMTDFVHR LQHPVLIiSRIj WLDHPHHCH VTYNVHtJGlQ 
YtfASQABQNA SEVGSPPBY3 EAXiLDQRPAW YDLPPPPYSS DTESLNQAJDL PPYESRBGSA 
NSASSQAASS LLSVEDTSH8 5GQPGPQEGT AEPRDSEPSQ GTESV 



LUEPG 



60 
120 
180 
240 
300 



A97 DNA SEQUEHCB 

Gene earner putative GPCR, Weakly similar to dJ36soi2-i 
Unigene number i Hs. 256897 

Probe set Accession #i BE001&36 
Kuclelc Acid Accession fri BED 0183 6 

Coding sequence: 1-295S {underlined sequence b correspond to etart and atop codons) 



1 
I 

ATGG CCGTCA 
ATGTTAAGrA 
GGTTTAGGCC 
GGGAATATTQ 
AACATAGATG 
GAGAAGTCAA 
GAATATCAGC 
AATTTTCTGA 
AGAGCAAAGG 
GACAGCTACA 
GCTGGAGCAC 



ttgaattcat 

AAAGCATATG 
AGCATCGTTG 
ATTGAACATG 
GGCTCTXTCA 
AAGOATGATG 
TGTGAGTCCT 
CTGAACAAGA 
QTGCAAAATC 
TCGGTGGTGT 
TCCAATTCAA 
GTAACCAACT 
ACATTAGAAA 
AAATTCATTG 
TATCAGATTA 
GGGTCAGACC 



11 
I 

AATCTTTTTC 

TGATAGGGAT 
CACATAGCAC 
TTCACTCTAA 



TGCTGCrrCA 
AGCTCTTGAA 
CTACCACAGA 
CCTGGTTTCC 
TCCCAAGCTG 
CAAAGATTTG 
CTTCTGCTAT 
AAAGAATTCA 
GTGGGTATGA 
TTGCCGAGAA 
GAGTGTTCGG 
AATATACCCT 
CTCGGTGGCA 
ATTTCAGTAT 
TTTCTGTCAT 
CGATTCTGAG 
CAATGGAGGA 
GGACAGTCTT 
ACATCAGCAC 
ACTGGAAAGG 
AAATGTG2 C C 
AATTCCAGAG 
TTCTACCCGT 



21 
I 

CTTCAACACT 
CAGTGGCCCA 
TAAAGGTCTT 
AAAAGATGTT 
CCTGATTAGT 
AATOCAGCAA 
GGTGACCTAT 
GCCTCCATTA 
CTGCAACAGC 
TCCCTCATGC 
TGAATGTCAT 
OOGCACTTTC 
ATACTCCAAA 
AGGTTTTGAG 
AGTTGTTGGC 



31 
I 



AAAAGCCCAG 
GCCCTQCAGC 
GGTCATCAGG 
GATTGTAOGC 
CATTCGGCAA 
CAATATTTCA 
TGTCATCAGT 
ACTGCGGGAA 
TCTDGTGCCT 
OATTCCAGTG 
CCAAAATACA 
ATCCCTTCCA 
TTCCAAAAAT 



GCTGGTGTGG 
CAACATGCAA 
TCTATTAGAA 
GATGGCCATC 
GTAACTATAA 
AGAGATTCCA 
TTATGGTCAC 
CTQAATGGAG 
CTTGATCCCC 
CTCAACAACC 
AAAATTAATG 
TATGCAAATG 
TCGGTTCAGG 
TCCAGCAGTG 
GOCCTTCACA 
TGTAATGACA 
AGTGGCTACA 
GAGACTTGTG 
AATGCCACTG 
AACCCATCAA 
TCTCTGTCAC 
ATAGCTGACA 
GAAAAGTATG 
CCGACAGCTC 
AACAAAAGCC 
TCTATTCCCA 
OAAACTATTA 
GGAAATGCTC 



41 
) 

GGCTASACCC 
AAGACAAGTG 
TTAGAAGCTC 
TTTACTTACT 
AGTTAACCCA 



AGGAGAAAAG 
ATGGGCTAAT 
TCCTGCAGTG 
AGAACTGCTA 
TCAGCCAGAG 
AAAGGTTTAC 
GAATT0AAAT 
TCACCCAATT 
CATCTGAACT 
AGCTGTTTCC 
TTGTCTTTGG 
GGGGAAACAT 
TGCTCTCTCT 
AGGCAGCTGT 
CCACAGTGGG 
TGGCCAGCCA 
ATATCC3TAA 
CCAGCTCACG 
TTCCTCTGAA 
AACTCAAAAG 



51 
I 

AQACCCAATC 
GGTCATACTG 
AAATGTTGAC 
CTTTCATGAA 
AGTGCACTCA 
CCCAGTCGAA 
AGATTTGAGA 
TAOAATTATC 
TACCXGTQAA 
CCTTCACACG 
TGTCAATTTC 
AAATGACCTT 
TCAACTTAAA 
TCGAAATGGA 
GCTGXCAOCC 
ATTAGAAGAC 
ATTTGGGTCC 
CACAGCCAAG 
GCTTGAAGAA 
GTCATCCTIC 
GAATCTGGCT 
TTTCAGGGTG 
TTCAGCCTCA 
GTTACTAGAG 
TTTTTCTCGG 



TCAGCATOGC 
AGGTCAATGG 



TGTGTTAATT 
CTCGXTQACT 
ACCTGTGATA 



60 
120 
180 
240 
300 
360 
420 
400 
540 
600 
660 
720 
780 
S4Q 
900 
960 
1020 
10 BQ 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



877 



WO 03/042661 



PCT/US02/36810 



TCCACGGTTA 
TCAAACCTG/V 
GCftQGCTGCC 
ACCTCCTTCT 
TGGATCACCT 
GAGGCTTTGT 
ATGGTGAACA 
GTGGACACCA 
TTCTACCTCT 
ATCCTCGTGT 
TATGGGTGCC 
TACAAAAGGA 
TTTGTTGTCC 
ACAAAGCTCT 
ATCCGCGTGG 
GGAATAGGAA 
AATGCATTCC 
CAACTTCTGT 
TCATCAGATT 
QGCCATTATG 
GTCTCAAATG 



TTCAAAACTA 
GCCAGCCTCA 
ACCTAGTGAA 
CCATATTGAT 
AtaTGGGACT 
TTTGGAAGCA 
TAGCCCTGTC 
CGGTGAACCC 
CTTTGTTCTT 
TCCATCACAT 
CTCTCATTAT 
AAGATGTGTG 

ctgcactggc 
ggaggccgac 
ggaagagcct 
caatagtgga 

AGGGATTTTT 
TCAACAAGTT 
TATCTGCCAA 
CATTTTCTCA 
ARTAA 



TTCCATAAAT 
TTGTGTGTTT 
TGAAACTCAA 
GTCACCTTTT 
GGGTATCTCC 
GATTAAAAAA 
GCTCTTGATT 
TTCIGGAGTC 
CTGGATGCTC 
GGCCCAGCAT 
ATCTGTCATT 
TTGGCTTAAC 
TATTGTGGcr 
TGTTGGOGAA 
CCTCATTCTG 
CAGCCAGAAT 
TATCTTATGC 
GTCTGOCTTA 
ACCCAAATTC 
TACTOQAGAT 



OAAGTTTTCC 
TGGGATTTCA 
GACATCGTGA 
GTCCCCTCTA 
ATTGGAAGTC 
AGCCAAAOCT 
GCTGATGTCT 
TGCACAGYTG 
ATGCTTGGCA 
TTOATGATGG 
ACCATTGCTG 
TGGTCCAATG 
GTGAACTTOG 
AGACTGAGTC 
ACCCCTCTGC 
CTGGCTTGGC 
TTTGGAATAC 
AGTTCTTGGA 
TCAAAGCCTT 
TCCTCCGACA 



TATTTTTTTC 
GTCATTTGCA 
CGTGCCRATG 
CAATCTTCCC 
TCRTTTTATG 
CTCACACACG 
GGTTTATTGT 
CTGTGTTCTT 
TCCTGCTGGC 
CTGTTGGATT 
TCACGCAACC 
GAAGCAAACC 
TTGTGGTGCT 
GGGATGACAA 
TAGGGCTCAC 
ATGTTATTTT 
TCTTGGACftS 
AGCAAACAGA 
TCAACCCACT 
ACATCATGCT 



CAAGATAOAG 
GTGGAACGAT 
TACTCACTTG 
CGTTGTAAAA 
CCTGATCATC 
TCGTATTTGC 
TGQTGCCACA. 
TACACACTTC 
TTACOGGATC 
TTGCCTGGGT 
TAGCAATACC 
ACTCCTGGCT 
GCTAGTTCTC 
GGOCACCATC 
CTGGGGCTTT 
TGCTTTACTC 
TAAGCTGCGA 
AAAGCAAAAC 
GCAAAACAAA 
AACTCAGTTT 



1SD0 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



A98 PROTEIN SEQUEUCS 
Gene name: 
Utiigene number: 
Protein Accession ft: 
Signal sequence : 
Ptam domain: 
Transmembrane domains; 
Cellular Localization: 



putative GPCR, Weakly similar to OJ365012-1 
He. 256897 

none found 

1-17 

7tm_2 [561-820] 

545-567, 571-593, 605-627, 641-663, 
plasma membrane 



677-699, 721-743, 767-789 



MHALIlLCFSV 
LIRIIRAKAT 
QSVNFCERTK 
QFRNGSrVAG 



AVSBFVQNIiS 
1MSASVTNWT 
KRGVGYQ1KM 
SGFVISTVIQ 
QCTHLTSFSI 
TRRIQWNIA 
LAYRIIIjVFH 
KPLLAFWPA 
LTWGFGIGTI 



11 
I 

UJOASGLSbli 
TDCHfiUaGVX, 
IWGTFKUIER 
YEWG9SSAS 
TLPCSSGYRG 

VUtREEKYAS 
CPQHTSIPIR 
NY8 INEVFLF 

lmsffvfsti 

LSTjLIADVWF 
EKAQSEiMMAV 
LAIVAVHFW 
VDSQULAWHV 



21 
I 

QSPVEEYQLL 
QCTCEDSYTM 



ELIiSAIEHVA 
NITAKCEfiSG 
VGNI*ASW8I 
SRLLETLEN3 
GKVI*IGSDQF 
FSKIESKLSQ 
FFWKWITYV 
rVGATVDTTV 
GFCLGYGCPL 
VUjVLTKLHR 
IFALIiNAFQV 



31 
] 

LQVTYRDSKE 
FPFSCIiD'PCJff 
AIYSKYANGI 
SKAKTALHKL 

HGViasrcvii 

LSNISSLSLA 
BTLVFFTALP 
QRSLiPETriS 
PHCVFWDPSH 
GLGISIGSliI 
MPfiGVCTAAV 
IISVITIAVT 



RTVTITYCIV 



41 
I 

KRDLRNFLKIi 
CYIiHTAGALP 
EIQUCKAYER 
FPLEDGSFRV 
SrJiRRTiNKNF 
SHFKVSNSTM 
LNFSRKPIDW 
MA3Z«TLGNHi 
LQWNBAGCHL 
LCLIIEALFW 
FFTHFFYIiBl* 
QPSHHTYKSKD 
EJKATnUVGK 
X 



51 
I 

IiKPPLIiWSHG 
flCZECHLNNlrS 
IQGFESVQVT 
FGKAQC3SDIV 
SMIVGNATEA 
EDV1SIADNI 
KGIPVNKSQL 
PVSKNGUAQV 
VNETQDIVTC 
KQIKKSQT&n 
FFHMLHLGIL 
VCWUNWSRGS 
ELLILTFLLG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A99 DMA SEQUENCE 

Gene names putative G-protein coupled receptor 
Unigene number; Ks,160B5 
Probes et Accession #; F07953 
Nucleic Acid Accession #; HH_016334 

Coding sequence: 104-1471 (underlined sequences correspond to start and stop eodonsr 



1 
I 

AGCAOCTGGG 
GGGGAGTGGG 
CTCCAGCATG 
GCGCCAATTG 
GACGTTTGCA 
GAATAGCAGC 
GGT 1TTCATG 
TAAACAACGA 
ACTAGGAGAT 
CATCAGCCGG 
TGTCAACTGC 
TCTAGCCCTG 
GATGGCAATG 
TTTCXGGGGA 
TATTCAACAG 
TGATCTATAT 
TAATTTTCTT 
CAATATTGTT 
TGTGAATTAT 
TCTTGTTGGA 
CTTTTATGCC 
AATGGGCATG 
CCGCACCATA 
TGATGTGATC 



11 

I 

AAAAGGCAGA 
AAGTGGAGGC 
ATGATTACCT 
TTTAAAGACT 
TTTTCTTGCA 
TCCCGTTATT 
GTGCCTTTTT 
CTGCTTTTTT 
CCCTTTOCCA 

CCATACACTT 
GAACGGCGAC 
GCACGGAGAA 
ATGATAAAAA 
GAAGTGGATG 
GCTACCAAGG 
GGTTACTTTT 
TTTGATCGAG 
CTGGGAATCC 
ATAAT CATCG 
ATCTCTAGCA 
TACTTTGTCT 
ATCACTGAAG 
TTOCTGGTCa 



21 
1 

CCGTGTGAGG 
AGGAGCCTTC 
CCCAAATACT 
ATGAGATACG 
CCATGTXTGA 
TTCACTGGAA 
ACATTGGCTA 
CCTGTCTCTT 
TTCTCAGCCC 
TTGGAGTGAC 
ACATGTCTTA 
TGCTGCAAAC 
CAATGTTCCA 
GTGTTACCAC 
CTTTGOAAGA 
AGAGAATAGA 
TCTCTATTTA 
TTGGGAAAAC 
AATTTGATGT 
TCACATCCAT 
GTAAGTCCTC 
CCTCTGTOCT 
TCCTTGOAGA 



31 
I 

GGGCCTGTGG 
CTTACACriXr 
ATXTTTTGGA 
TCAGTATGTT 
GCTCATCATC 
AATGAACCIG 
TTTTATTGTG 
ATGGCTGACC 
AAAACATGGG 
TCTCATGGCT 
CTTCCTCAGG 
CATGGATATG 
GAAGGGGGAA 
TTCAGCATCA 
ATTAAGCAGG 
ATACTCCAAA 
CTGTGTTTGG 
GGATOCTGTC 
GAAGTTTTGG 
CAQAGGATTO 
CAATGTCATT 
GCTGATCCGA 
ACTGCAGTTC 
TAGCATACTC 



41 
I 

CCDCAGCGTG 
OCCATGAGrr 
TTTGOGTGGC 
GTACAGGTGA 
TTTGAAATCT 



agcaatatcc 
tttatgtatt 
atct tat cca 

CTTCTTTCTG 
AATGTGACTG 
ATCATAAGCA 
GTGCATAACA 
GGAAGTGAAA 
CAGCTTTTTC 
ACCTTCAAGQ 
AAAATTTTCA 
ACAAGAGGCA 
TCCCAACACA 
CTGATCACIC 
GTCCTGCTAT 
ATGAGTATGC 
AACTTCTATC 
TTCCTCTATT 



51 
I 

CTGTGGCCTC 
TCCTGATCGA 
UTTTCTTCAT 
TCTTCTCCGT 
TAGGAGTATT 
TGCTGATCCT 
GACTACTGCA 
TCTTCTGGAA 
TAGAACAGCT 
GATTTGGTGC 
ACACAGATAT 
AAAAGAAAAG 
AACCATCAGG 
ATCTTACTCT 
TGGAAACAGC 
GGAAATATTT 
TGGCTACCAT 
TTGAGATCAC 
TTTCCTTCAT 
TTACCAAGTT 
TAGCACAGAT 
CTTTAGAATA 
ACCGTTGGTT 

tggctcacaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



878 



WO 03/042661 



PCT/US02/36810 



ACAGGCACCA 
CA3TGGTTTC 
ATAAACAAAC 
AGQTGATACT 
ACAATACTCA 
CCAAGAAACT 
TGAGCCAAAC 
TTGAGATTGA 



GAQAAQCAAA 
AAAATTTAGA 
AAAATGCTAT 
ATGACCATGA 
GCAGAGAGCA 
AAAGGTGAAA 
ACGTAGGATT 
CTCATTAAAA 



TGGCACCTTG 
TATAAGAGGG 
GGTAGCATTT 
GT AG CATCA.G 
TCCCGTGTGG 
AATACACTGG 
TCCGTTTTAA 
TCAGAGACTG 



AACTTAAGCC 
OGGAAAAATQ 
TTGACCTTCA 
CCAGAACATG 
ATATGAGGCT 
AACTCTGGGG 
GGTTCACATG 



TACTACAGAC 
GAAOCAGGGC 
TAGCATACXC 
AGAGGGAGAA 
GGTGTAGAGG 
CAAGACATGT 
GAAAAGGTTA 



TGTTAGAGGC 1500 

CTGACATTTT 1560 
CTTOCCCCTC 
CTAACTCAAG 

CQGAGAGGAG 1740 

CT ATGOTAGC 1800 

TAGCTTTQCC I860 



1620 
1660 



Aioo Protein sequence 
Gene name: 
Unigene number: 
Protein Accession #: 
Signal sequence : 
Transmembrane domains: 
Cellular Localizations 



putative G-protein coupled receptor 
Hs,1608S 
NP_O57410 .1 
none found 

5-27, 42-64, 76-98, 110-132, 144-166, 289-311, 342-364, 380-402, 424-446 
plasma membrane 



1 
I 

MSFLIDSSIM 
EILGVlibTSSS 
MYPFWKLGDP 
VTDTDILAI»E 
SENLTIjIQQE 
IFMAT1NIVF 
ITLTKFFYAI 
FYHRWFDVIF 



11 

I 

TTSOILFPGF 
RYFHWKMKIiC 
FPILSPKHGI 
RRLLQTMDM1 
VDlAIiEBIiSRQ 
DRVGKTDPVT 
SSSKSSNVTV 
LVSALSSILF 



21 
I 

GWTjFFMRQLF 
VILLI LVFMV 
LSlEQIiISRV 
ISKKKRMAMA 
LFLETADLYA 
HGIElTVHYIi 
LLIiAQ 1MOMY 
LYIiAHKQAPE 



31 
I 

XDYEIHQYW 
PFYIGYPIVS 
GVIGVTIiMAli 
RRTMFQKGEV 
TKERI EYSKT 
GIQFDVKFWS 
FV8SVLL2RM 
KQMAP 



41 

I 

QVIFSVTFAF 
NIHLuLHKQRL 
LSGFGAVNCP 
HHKFSGFWGM 
FRGFCYFNFLG 
QHISFIIiVGI 
SMPLEYRTII 



51 
I 

SCTHFELIIF 60 

IiFSOJ^JLTF 120 

YTYMSYFLRN 1B0 

IKSVTTEASG 240 

YFFSrYCVWK 300 

IIVTSIBGLI, 360 

TEVU3ELQFN 420 



A101 DMA SEQUENCE 

Gene name: ESTs, Weakly similar to CGHU7L collagen alpha l(in) chain precursor EH .sapiens] 
Unigene number: Hs. 19322 * 

Proheset Accession #t AA06845B 
Nucleic Acid Accession #: AA0B845B 

Coding sequence: • 862-1995 {underlined sequences correspond to start and stop codons) 



1 

i 

GCCCXTGOAC 
CTGAAGAAAA 
GCGCGGGGCC 
CTGGGCCAGA 
CGGCTACTGC 
TQTGCCAGCC 
ACCTCACCCC 
CTCACCCAGG 
GCGCTCATTA 
GATTCCACCT 
AGOCCTTOGA 
GCCCAGGCAC 
GCCTGCCCCC 
ACATGGGCTG 
TGGACAGTGO 
GGTCCCATCT 
AGAGGGCGCG 
CAGGACBAGG 
GTAAGCGGGG 
CTGGOCAAGG 
GGCCTGCATG 
TTGCCCCACG 
GACAOCTCCC 
CTGGGGTCCT 
GAGAGOCCAC 
GGCAGGTCCC 
GGAGTACGCA 
GAACCAGGGG 
TCAGTGTGTG 
CCCGATGCGG 
ACACTGTCCC 

ccftceggag 

TGCTGCACCT 
GCCCXCCTAC 
ACCTCCTGGG 
AGGTGQACTG 
GGCTGGGOTC 
TGGGGGATCC 
GGTGACTTCA 



11 
I 

ACTGACATGG 
AGGAaCTGGA 
GCGACTGGTA 
GCAGAGCCAG 
CCAAGGTACA 
GGGCCCTGOC 
CGGTCTGGCA 
AQGTGACCGA 



21 
I 

ACXGAAGGAG 
GCAGGAGAAG 
CCAGCAGCAG 
CGCCGACTTr 
AGAGGTGGCC 
COCGTCCTCC 
GCAGCAGACC 
GAAGAQTGAG 



AAAGAAATAG 
CACGAGGGGA 
GCAGACCCTG 
GAGCAGCOTC 
GCGTGCACAC 
CAGAAOTGTC 
TTTTGTGTTG 



TCATCTAGTC 
GGGTGGGGGC 
AGTCCCGGAG 
GGCTGGrOOC 
GGGGCTCTCT 
GGTACCCCTC 
TCRGGGAAAG 
GGGCGGCTOC 
TGGCTGTAGC 

CTGAGGGACC 
TGCCTCCCAC 
TTGAGTCCCA 
AGQCACGTCA 
GCTCAOCCCC 
CTCGCTCAGC 
CTTGGGTGT C 
CTGGTGGGGG 
CAOGGCAACA 
TGGGGCGCAG 
GGTCAGTGCG 
ACAAGGCACC 
COCAflCTCCA 
BGTCTGCAGG 
CCTGAAGATG 
CAGGAAAGGG 
CAGCGCAGTG 
TGCCCACCAG 
TGGCATCTTT 
TCAGGAGACC 
GGCACCTCCG 
GTCCTCGCAG 
GAATTTAAAG 
CCTGGAGOCT 
CCTGQOCTCT 
TGTOATGACA 
OCCAGTTGAQ 
AtCAAGTTCC 



CTXGTGGGCC 
CCCATCGCAC 
TGGGGGCCTT 
CGCACCGAGC 
TGAGTOCGCA 
CATGAGTTAG 
GCACTGCCCA 
GACGCGGGTC 
TCGGACGGAC 
TGGCTGGGGA 
CTQGCTGCAG 
AGACCCTGGG 
CACAACATCC 
TAGGCAAAGC 
CTTTGCXCTC 
CAAGGAAAAC 
ACTCCCTCAG 
GGCCCTGCTC 
GCATCOATGG 
GGCGTCCGAT 
TGGGGGGCGC 
TGTCTCAGAG 
TGCTAACCTG 
GGIGTCOCAG 
GGAGTGGGCT 
TGCAGGTCCT 
GGTGGGCCAG 
GGCCTCCCCA 
ACTGGACTGG 
GCCGRCATAQ 
GAAAAACTGC 
TTTACAGCTT 
GCOCCGGCTG 
GCCCTAGGAC 
ATCCGCGAGG 
CCCGGAAATG 
AATCTGCCCC 



31 
I 

TAGAATGGAG 
GAGGTGCTGC 
CTGCAACGAG 
GGGGCTGCAG 
CS3GTGCCTGG 
TCCGGGCCCX: 
ATCCTCATGC 
CGCATCACOC 
GGCCTGAGOC 
GCGTGGGCCC 
CCA03CICTC 
OCTGCCGCCC 
GCTTQACTCC 
TAGTGOGCAG 
CGTCCCCCCG 
CGCCAGGCTG 
GAAGGGCAGC 
GGAAGTAGAT 
GCCCCAGQGA 
CGGATOGGCA 
GTGATGGCCT 
TGTGAOCCTG 
CTGTTTCCCC 
ACGCCCAGCC 
GAOAACCCCC 
CCCCXGCCCA 
AGCCCAACCT 
GTTCXGGAGC 
GCGGGGTCAG 
AGGGCCCCCT 
GAQGGGCCCT 
CCCACAGCAA 
GACAGGCCCA 



41 

I 

CACGAGGACA 
TQCAGGGTTT 
TGCAGGAGCG 
GGAGCCCCCG 
GGGAGCTGCT 
CCTGGCCTGC 
1GAAGGAGCA 
AGCTGGAGCA 
AGCAGGAGGG 
CCAGGGCCAG 
TGGCTGGAGA 
TTGCGAGATG 
GTTTJKGGCTC 
CTACTACTGG 
TTTCCA0CGG 
CACTTCCAAC 
TTCCCGCTCA 
GGAGGGGGTG 
TAGCGGTCGG 
CX3CC3G0GTGG 
TCCC5CCTCTT 
GCTCCCCAGG 
COACTCAGGA 
TGTCCCCAGG 
AGGGTACAGG 
GGCCCACTCC 
GGAGGGTCCC 
CCAGGGOCCC 



CGTGTCCAGG 
GGCAGGCAGC 
CCCCACAGAG 
AGXCAQCCCA 



GAGGGCCTGT 
TGGCAGCCAG 
OGTCTGCdT 
AAGCAGGAGA 
AGCTGGACCC 
CTTTCAGCCT 
GAAATCAGGC 
GCAGGOTCTA 

TGCCAGTAGC 
TCTCAGGATG 
AOAGGAACAC 
AACATCTCAQ 



GCCCCACAGC 
GGAGAAGCCC 
TGAGGGTGCC 
CAGAACAGTQ 
03GAGCTGAA 
TGGTGTTCCG 
TAGTGAGTGG 
GGTGGCTGGC 
TCAGTCTCCG 
GTGTGCAGGT 
TTGAAATGTG 
AOCCACAGCA 
CCGGGCGTGG 



51 




) 

CTGACATGGA 


60 


GGAGATGATG 


120 


CCAGCGCCGC 


180 


COCACTGGGG 


240 


GGCTGCAGCC 


300 


CCTGACGTCC 


360 


GAACC3GACTC 


420 


GGAGAAGTCG 


480 


GGGAOCTCTG 


540 


CCTGGCACTC 


600 


G0CCOGGCAG 


660 


GGCTCCCCAG 


720 


CTGGTTGYTG 


780 


CCGCTQTCAG 


840 


TGCCGOCCTG 


900 


AACGGGCAGC 


960 


ACCAGGGCAC 


X020 


GGGACGGCCT 


1080 


ACTTCAGGTT 


1140 


GGGAGAGC7T 


1200 


GGCCGGGACG 


1260 


AGGGCGGCCA 


1320 


TTTCCAAGGC 


1380 


TTTCAGCTGG 


1440 


AGGAGGCTGG 


1500 


CGCTGGTGCT 


1560 


agtgtcacca 


1620 


CGATGCGGGG 


1680 


GCGCAGGGCC 


1740 


GCACTTTGGT 


laoo 


GTGGCAACTC 


1860 


CCACATTOCC 


1920 


GCATGCAGCT 


1980 


TCAGGCCTGG 


2040 


OCCAGCAOCC 


2100 


CCCGTCAGCA 


2160 


TGCXZATGOCC 


2220 


TCTGTGCCGG 


2280 


GCGGAAATGT 


2340 


TGCAAGGTGA 


2400 


CCCTGGAGAC 


2460 


AGAGGCACAT 


2520 


TGCAGGATOT 


2580 


AOVTACACGT 


2640 




2700 


GGCCXCAGGA 


2760 


TGGTTCAOGC 


2820 



879 



WO 03/042661 



PCT/US02/36810 



CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAOCAGC CTGGGCAACG CAGTGAGAGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TCLAGACCCAG CAGAAGCAAC AGATTGACTC 3000 

TAGACQCAGA TACTAQAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3O60 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 



10 



15 



20 



25 



30 



35 



40 



50 
55 
60 
65 
70 
75 
80 



A1Q2 DMA SEQUKMCE 
Gene name: 
Unigene number- : 
Probeset Accession #: 
Nucleic Acid Accession $: 
Coding sequence; 



TMPRSS3a 
EB.2 9B241 
AI53B613 
AB03B157 

2 02-1566 (underlined 



sequences correspond to start: and .stop codons) 



1 
I 

ACCGGGCACC 
GGAAAGGGCT 
CCATCTACAT 
AGAGGTCCTG 
TCATTCCGAT 
GATGCTGTTG 
ATTGGGATCA 
TCAGGGAAGT 
GGAGTCTCGG 
AATGCCGTGC 
AAGGGTCACT 
GATAACCTCA 
CACCICTTQC 
TQTGCCTCTG 
AG CT CACGCA 
CTTCAGTTCC 
ACTGCTGCAC 
CTAGTTTCCC 
AGCAAGTACA 
CTCACGTTCA 
GATGGAAAAG 
TCCCCTQTCC 
GACGTGTACG 
GXGGACAGCT 
AAGTTAGTGG 

tacacccgtg 
acctgaagag 



11 
1 

GGACGGCTCG 
GTGTTTATGG 
TTTTOGGACT 
AAATAGTCAC 
CGCTTTTTGG 
CTGCACAGAT 
TTGCATTGAT 
ACAGATGTCG 
ATTGCAAAGA 
TCCAGGTGTT 
ACGCAAATGT 
GAGTGAGCTC 
CAGATGACAA 
GCCACGTGGT 
TOGTGGGTGG 
AGGGCTACCA 
ACTGTOTTTA 
TGTTGGACAA 
AGCCAAAQAG 
ATGAAATGAT 
TGTGCTGGAC 
TGAACCAOGC 
GTQQCAT CAT 



21 
I 

GGTACTTTCG 
GAAGCCAGTA 
CGGGAATTAT 
C ATQG QGGAA 
CCTTGATGAT 
CCTGTCACTG 
ATTAQCACTG 
CTCATCCTTT 
CGGGGAGGAC 
CACAGCTGCr 
TGCCTGTGCC 
OCTGGAGGGG 
GGTGACTGCA 
TACCTTGCAG 
AAACATGTCC 

TGACTTGTAC 
TCCAGCCCCA 
GCTGGGCAAT 
CCAGCCTGTG 
GTCAGGATGG 
GGCCGTCCCT 
CTCCCCCTCC 



45 icc^ 



GAAGCTGCrr 
AGTGCAGTGG 
TTGOCTCAGC 
TTGTATTTTT 
CCTCAAATGA 
AGGCCTAGCC 
AGGGCGGCCT 
ACGAGATAAG 
GCACCAGCCC 
CGAAACCCAC 
ATGACTCGTT 
TTGTCCTTTG 
AAAAAAAA 



GAGGHACCAG 
TCACCTCCTT 
GAAGGGGACA 
GACTCCCGTG 
AGTAGCAGGC 
TTTGTTTTTT 
CGAAATCCCT 
TTCCCCAGTA 

agtagagaca 
tcacgctcct 

TTCCCACTGG 
CAGTTATGTG 
AGAAGTGCAG 
CCTTTCTACT 



CTTTGGCATC 
CCTGGACTGG 
AGTAGCCAGC 
TAGQAACCTG 



AT TCCAAATA 



GTTTTTTTGA 
GCTCACTGCA 
GCTGGGACCA 
GGGTTTCACC 
TCAGCCTOCC 
TTCTGATCXT 
TGCATCTGGT 
ACCTCACGTG 
AACTGCAGTC 
TCCAAGACTT 
TTTCATGATT 
ATATGTTTOC 



31 

I 

TTCTTAATTA 
ACACTGTGGC 
GAGGTAGAGG 
AATGATCCGC 
TTGAAAATAA 
CTQCCATTGA 
GCCATTGGTC 
AAGTGTATCG 
GAGTAOCGCT 
TOGTGGAAOA 
CAACTGGGTT 
CAGTTCCGGG 
TTACACCACT 
TGCACAGCCT 
TTGCTCTCGC 
GGCTCTGTCA 
CTCCCCAAGT 
TCCCACTTGG 
GACATCGCCC 
TGCCTGCCCA 
GGGGCCAC3W3 
TTGATTTCCA 
ATGCTCTGOS 
CCCCTGGTGT 
GGCTGCGCAG 
ATCCAGGAGC 
TQAGTTCCTG 
CACACGAGCA 
CACCCTTCCA 
GGTGGAGTCT 
GCCTCOaCTT 
CAGGTGCC CG 
ATGTTGGCCA 
ACAGTGCTGG 
CACTAAQAAC 
TTTCTCTCCA 
CAAAGCCACC 
ACTGCACGTT 
ATTTTCAC&T 
TCTTTGTAGC 
TTCCCTCAAA 



41 

1 

GGTCATGCCC 
CTACTATCTC 
TGGAGGCGGA 
CTGCTGTTGA 
GTCCTGTTGC 
AGTTTTTTCC 
TGGGCATCCA 
AGCTGATAGC 
GTGTCCGGGT 
CCATGTGCTC 
TCCCAAGCTA 
AGGAGTTTGT 
CAGTATATGT 
GTGGTCATAG 
AGTGGGCCIG 
TCACGCCCCT 
CATGGACCAT 
TGGAGAAGAT 
TTATGAAGCT, 
ACTCTGAAGA 
AGGATGGAGC 
ACAASATCTG 
CGGGCTACCT 
GTCAAGAGAG 
AGGTGAACAA 
AGATGGAGAG 
AGGTGATGAA 
GACACCCTTG 
TCTGATTCCA 
CGCTCTGTTQ 

CCACCACACC 
GGCTGCTCTC 
GATTACAGGC 

aaaagaagga 
ggggtcttgc 
aacagccact 

TTCATCtfCTA 
GTGQGGAGGT 
ATTTGGTGCT 



SI 
1 

GTGTGAGCCA 
TTCOGTGGTG 
GCCGGATGTC 
AGCCCCCTTC 
ACCAGATGCA 
AATCATCGTC 
CTTOGACTGC 
TCGATGTGAC 
GGGTGGTCAG 
CGATGACTGG 
TGTGAGTTCA 
GTCCRTOGAT 
GAGGGAGGGA 
AAGGGGCTAC 
GCAGGCCAGC 
GTGGATCATC 
CCAGGTGGGT 
TGTCTACCAC 
GGCCGGGCCA 
GAACTTCCCC 
AGGIGAGGCC 
CAACCACAGG 
GACGGGTGGC 
GAGGCTGTGG 
GCCTGGGGTG 
AGACCTAAAA 
GACAGCCCGA 
GAGCTCTGAG 
GCACAACCTT 
COCAGGCTGG 
AGCGATTCTC 
CAACTAATTT 
AAACCCCTGA 
ATGGGCCACC 
GCAAdTGCA 
AAAATTCCTG 
CAGAAAAGAC 
GGGACCAGAA 
TAATCTAGGA 
TGACGTATTA 
AAAAAAAAAA 



A103 Protein sequence : 
Gene names 
Uhigene numbers 
Probe set Accession #: 
Protein Accession # t 
Signal sequence s 
Trans meiribrane domai.no » 
Tryp_SPc domain: 
Cellular localizations 



TMPRBS3a 
HS-. 258241 
AI538613 
BAB20077 

none found 
43-65, 239-261 
216-444 

plasma membrane 



1 
I 

MBENDPPAVE 
IAIiAIGLGIH 
TAASWKTMCS 
VTALHHSVYV 
LCGGSVTTPL 
LGHDIALMKL 
AVPLISHKIC 
FGIGCAEVMK 



11 
I 

APFSFRSLFG 



DDWKGHYAWV 
REGCASGHW 
MI1TAAHCVY 
AGPI>TFNBMZ 
NHHUUYGGII 



21 
I 

LDDLKrSPVA 
SSFKCIELIA 
ACACLGPPOT 
TLQCTACGHR 
DIiYIiPKSWTI 
QPVCtJWSEB 
SPSMLCAGYL 
LDHIHKQMEa 



31 
1 

PDADAVAAQ1 
RCDGVSDCKD 



HGYSSRrVGG 
QVGLVSl^LDN 
KFPDGKVCWT 



DIiKT 



41 
I 

ZifililiPIiKPFP 
GEDEYRCVRV 
liBGOFEBEFV 
NMSLLeOWPW 
PAPSHLVEKI 
SOWOATEDGA 
SGGPLVCQER 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
126Q 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



51 
I 

ITVIGIIAM 
GGQNAVLOVP 
STDHXrliPDDK 
QASLQFQGXH 
VYHSKYKPKR 
GDASPVLNHA 
RLWKLVGATS 



60 
120 
180 
240 
300 
360 
420 



880 



WO 03/042661 



PCT/US02/36810 



A104 DWA SEQUKSICE 

Gene name: Homo sapiens G protein- coupled receptor (HOT7T17S) 
Umgene number: HB.20B229 
Probeeet Accession #: AI819198 
Nucleic Acid Accession #1 NM_032S5l.l 

Coding sequence, 2-^97 (underlined sequences correspond to start and stop codons) 

} J X 21 31 41 51 

I I I I t I 

ATGCACACCQ TGGCTACGTC CGGACCCAAC GCQTCCTGGG GGGCACCGGC CAACGCCTCC GO 

GGCTGCCCGG GCTGTGGCGC CAACGCCTCG GACGGCCCAG TCCCTTCGCC GCGGGCCGTG 120 

GACGCCTGGC TCGTGCCGCT CTTCTTCGCG GCGCTGATGC TGCTGGGCCT GGTGGGGAAC 180 

TCGCTGGTCA TCTAOGTCAT CTGCCGCCAC AAGCCGATGC GGACCGTGAC CAACTTCtAC 240 

ATCQCCAACC TGGCGGCCAC GGACGTGACC TTCCTCCtGT GCTGOGTCCC CTTCAOGGCC 30 0 

CTG CTGT ACC CGCTGOCCGG CTGGGTGCTG GGCGACTTCA TGTGCAAGTT CGTCAACTAC 360 

ATCCAGCAGG TCTCGGTGCA GGCCACGTGT GCCACTCTGA CCGCCATGAG TGTGGACCGC 420 

TGGTACGTGA CGGTGTTCCC QTTGCGCGCC CTGCACCGCC GCACGCCCCG CCIGGCGCTG 480 

GCTGTCAGCC TCAGCAICTG GGTAGGCTCT GCGGCGGTGT CTGCGCGGGT GCTCGOCCTG S40 

CACCGCCTGT CACCCGGGCC GCGCGCCTAC TGCAGTGAGG CCTTOCOCftG CCGCGCCCTG 600 

GAGCGCGCCT TCGCACTOTA CAACCTGCTG GCGCTGTACC TGCTGCCGCT GCTCGCCACC £60 

TGCGCCTGCT ATGCGGCCAT GCTGCGCCAC CTGGGCCGGG TCGCCGTGCG CCOCGCGCCC 720 

GCCOATAGCG CCCTGCAGGG GCAGGTGCTG GCAGAGCGCG CAGGCGCCGT GCGGGCCAAG 780 

GTCTCGCGGC TGGTGGCGGC CGTGGTCCTG CTCTTCGCCG CCTGCTGGGG CCCCATCCAG 840 

CTGTTCCTGG TGCTGCAGGC GCTGGGCCCC GCGGGCTCCT GGCACCCACG CAGCTACGCC 900 

GCCTACGCGC TTAAGACCTG GGCTCACTGC ATGTCCTACA GCAACTCCGC GCTGAACCCG 960 

CTGCTCTACG CCTTCCTGGG CTCGCACTTC CGACAQGCCT TCCGCCGCGT CTGCCCCTGC 1020 

GCGCOGCGCC GCCCCCGCCG CCCCCGCCGG CCCGGACCCT CGGACCCCGC AGCCCCACAC 1080 

GCGGAGCTGC ACCGCCTGGG GTCCCACCCG GC5CCCCGCCA GGGCGCAGAA GCCAGGGAGC 1140 
AOTGGGCTGG CCGCGCGCGG GCtGTGCGTC CTGGGGGAGG ACAACGCCCC TCTCTGA 



sequence 



A105 Protei n 
Gene nacne: 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Pfam domains; 
Transmembrane domains 1 
Cellular localization: 



Homo sapiens G protein-coupled receptor (H0T7T175) 
Hs. 208229 

AX 81 9198 
none found 
7tmJ- [59-323] 

43-65, 86-108, 122-144. 159-181, 203-225, 260-2&2 



1 T T v r f 

HHTVATSGPM- ABWGAPAMAS GCPGCGANAS DGPVPSFRAV DAWI.VPWFA ALMLLGLVGN 60 

SLVIYVICRH KPMRTVTNFY IANLAATDVT PLLCCVPFTA IiLYPLPGWVIj GDFMCKFVNY 120 

IQQVSVQATC ATMAMSVDR HYVTVFPLRA LERRTPRIAlr AVBL&IHVGS AAVfiAPVLAL 180 

HRLSPGPRAY CSEAFP3RAI. ERAFAbVHM, AIiVUjPLLAT CACYAAMLRH LGKVAVEtPAP 240 

ADSALQGOVL AERAGAVRAK. VSRLVAAWL IiPAACWGPIQ LFLVLQALGP AGSWHFR8YA 300 

AYALKTKAHC MSYSHSALHP LTjTAFIjGSHF RQAFRKVCPC APHRPRRPRR PGPSDPAAPH 360 
AELHRLGSHP APARAQKPG3 SGLAARGLCV LC3EDNAPL 

A3 06 DMA SEQUENCE 



Uhigene number; 
Probeset Accession # = 
Nucleic Acid. Accession #: 
Coding sequence 1 



integrin, beta 9 
Es, 52620 
AA479726 
MM_002214 

680-2990 [underlined sequences correspond to start and stop codons) 



1 1 1 r t 1 r r 

CCCftGAGCCG OTCCCCCTG TTGCXGGCAT CCCGAGCTTC CTCCCTTGCC AGCCAGGACG 60 

CTGOCGACTT GTC3TTGCCC GCTGCTCCGC AGAOGGGGCT GCAAAGCTGC AACTAATCGT 120 

GTTGGCCTCC CTGCCCACCT GTGGAAGCAA CIGCGCTGAT TGATGCGCCA CAGACTTTTT 180 

TOCOCTCGAC CTCGCCGQCG TACCCTCCCA CAGATCCAGC ATCACCCAGT GAATGTACA* 240 

TAGGGTGGTT TCCCCCCCAG CTTCGGGCTT TGTTTGGGTT TGATTGTGTT TGGCTCTTCG 300 

CTAAGCTGAr TTATGCAGCA GAAGCCCCAC CGGCTGGAGA GAAACAAAAG C1XOTCTCTT 360 

TGTGCCGGAG CAGGCTGCGG AGCCCTTGCA GAGCCCTCTC TCCAGTCGCC GCCGGGCCCT 420 

TGGCCGTCGA AGGAGGTGCT TCTCGCGGAG ACCGCGGGAC CCGCCQTGCC GAGCCGGGAG 480 

GGCC<nKRGGG GCCCTGAGAT GCCGAGOSGT GOCCGGGCCC QCXTACCTGC ACCGCTTGCT 540 

OCGAGCOGOG GGGTCCGCCT GCTAGGCCTG CGGAAAACGT CCTAGCGACA CtCGCOCGCG 600 

GGCCCCGAGG TCGCC CGGGA GGCCGAGCCC GCGTCCGGAA GGCAGCCAGG CGGCGGGCGC 660 

GGGGCGGGCT GTTTTGCATr ATGTGOGGCT CGGCCCTGGC TTTTTTTACC GClGCATTTG 72 0 

TCTGCCTGCA AAACGAOCGG CGAGGTCCCG CCTOGTTCCT CTGGGCAGCC TGGGTGITTT 780 

CACITGTTCT TGGACTGGGC CAAGGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 840 

CCTGTGCCAG GTGCCTTGCG CTGGGTCCAG AATGTGGATG GTGTGTTCAA GAGGATTTCA 90 0 

TTTCAGGTGG ATCAAGAAGX GAACGTTGTG ATATTGTTTC CAATTTAATA AGCAAAGGCT 960 

GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 1020 

TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCAQCT GCGTCCAGGA GCCOAAGCTA 1080 

ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTGTTQ 1140 

ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAATtAAA TTCCGTIGGA AAOGATTTAT 1200 

CTAGAAAAAT GGCATTTTTC TCCCGTOACT TTCGTCTTGG ATTTQGCTCA TACGTTGATA 1260 

AAACAGTTTC ACCATACATT AGCATCCAOC CCGAAAGGAT TCATAATCAA TGCAGTOACT 1320 

ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 

TCACTGAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA SATACACCAG 1440 

881 ■ 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 



AAGGAGGTTT 
AAGAGOCTAA 
GCAAATTGGC 
ACGTCAAATC 
ACAACAACAT 
TTCTACCCCT 
ATAATTTGGT 
ACCAGGTACA 
CAGGCATGGA 
TTACAATGAA 
GTTTTAATGA 
ACAGAQOACC 
GTGATGAGAA 
ACAAGQATCA 
ACAAAATTAA 
CATATCACCA 
GCTTCAGTGG 
TCAATTCAAA 
GCACCGATCC 
GCAAGGAAAA 
TTGATCAGTG 
CAGAATGTTT 
TCXTGATTGG 
ATAAAATTAA 
TGCAAAGTGT 
TGGAXATCAG 
TTAAACACTT 
ARCTTCACAGG 
GAAGACTQAC 
AAAATGTGTC 
ACTTATTTAO 
TACCTGTTAT 
CACTACAAGG 
TATATTCTAA 
ATGAATAAAT 
AAAGATTATT 
TTTCCAAGAT 
tttttacagg 
TACTGCCATA 
OAATGTTAA 



TGACGCCATG 
AAGATTGCTG 
AGGCATAGTG 
GACAACCATG 
TAATGTCATC 
CTTGCCAGGC 
AGTGGAAGCC 
AGG CATC TAT 
AGGATGCAGA 
AAAATGTGAT 
AACCGCTAAA 
TAAAGGAAAO 
TAAATGTCAT 
GCCTGTTTGC 
GCTTGGAAAA 
TGGAAATCTG 
CTGGOAAGGT 
GGGCCAAGTG 
CAGGAGCATC 
CTGGAATTGT 
CAAAACCTCA 
CTCCAGCCCA 
GTTGCTTAAA 
QTCCTCATCA 
TTGCACAAGA 
CAAATTAAAT 
AATGGGAAAC 
AGGAGACAAA 
AAGTATCCTC 
TTACTACTGT 
ATCAGCATAG 
CCCTACOCTT 
GTACAGTAAT 
GGTTGCCAAA 
GATTCGTGTT 
QCTTTTTAAA 
GQATACTAAT 
ATAAGTTTAT 
AAAAACTAAT 



CTTCAOGCAG 
CTGGTGATGA 
GTCCCGAATG 
GAACACCCCX 
TTTGCAGTTC 
ACCATTGCTG 
TATCAGAAGC 
TTTAACATTA 
AACGTGACOA 
OTCACAGGAG 
ATTCATATAC 
TGTGTAGATG 
TTTGATGAAG 
AGTGGTCGAG 
GTGTATGGAA 
TGTGCTGGGC 
GATCGATGCC 
TGCAGTGGAA 
GGGOGCTTCT 
ATGCAATGCC 
TGTGCTCTCA 
AGCTACTTGA 
GTCCTGATCA 
GATTACAGAG 
GCAGTCACCT 
GCTCATGAAA 
TGOAATTGTT 
TTGCTCACGG 
ATCATGATGT 
TTGAGACTAG 
AATGTAGATC 
CCCAGAGAGA 
CCCTGCACTG 
CACTTCAACA 
TCACTCTTTC 
GTGTGTAGTT 
TCCAGCATTC 
GTATGTCACA 
AATACAATGT 



CTGTCTGTQA 
CAGATCAGAC 
ACGGAAACTG 
CACTAGGCCA 
AAGGAAAACA 
GTGAAATAGA 
TCATTTCAGA 
CCGCCATCTG 
GCAATQATGA 
GAAAAAACTA 
ACAOAAACTG 
AAACTTTTCT 
AT CAGT ITT C 
GAGTTTGTGT 
AATAC7TGTGA 
ATGGAGAGTG 
AGTGCOCTTC 
GAGGCACGTG 
GTGAACACTG 
TTCACCCTCA 
TGGAACAACA 
GAATATTTTT 
TTAGACAGGT 
TGTCAGCCTC 
ACCGACQTGA 
CTTTCAGGTG 
AATAATTGCT 
TCATGCCAGT 
GACTCACATA 
TGTCGTTGTA 
CTCTGAAGAG 
ACRATGCTTGT 
GACATGTGAG 
GTTQGTGGTT 
AAGAGGTGAA 
TTATQCATGT 
TCTCCTCTTT 
GATGACTQGA 
CACTTTATCA 



AAGTCATATC 
GTCTCATCTC 
rCATCTOAAA 
ACTTTCAGAG 
ATTTCATTGG 
ATCAAAGGCT 
AGTGAAAGTT 
TCCAGATGGG 
AGTTCTTTTC 
TGCAATAATC 
CAGCTGTCAG 
AGATTCCAAG 
TTCTGAGAGT 
TTGTGGGAAA 
AAAGGATGAC 
TGAAGCAGGC 
AGCAGCAGCC 
TGTGTGTGGA 
CCCCACCTGT 
CAATTTGTCT 
GCATTATGTC 
CATCATTTTC 
GATACTACAA 
AAAAAAGGAT 
GAAGCCTGAA 
CAACTTCTAA 
CCTAAAGATT 
TGCTGGTTGT 
GCTGCTGACT 
GCACTTTACT 
CACTGATTAC 
GAGAGAGTTT 
GAAAAAAATA 
GAATAGACAA 
CAGATACAAC 
GTGTTTATGG 
GGCTTTATGT 
TTAATTAAGT 
GAATACTAGT 



GGATGGCGAA 
GCTCTTGATA 
AACAACGTCT 
AAATTAATAG 
TATAAGGATC 
GCAAACCTCA 
CAGQTGGAAA 
TCCAGAAAGC 
AATGTAACAG 
AAACCTATTG 
TGTGAGGACA 
TGTTTCCAGT 
TGCAAGTCAC 
TGTTCATGTC 
TTTTCTTGTC 
AGATGCCAAT 
CAGCACTGTG 
AGGTGTGAGT 



CAGGCTATAC 
GACCAAACTT 
ATAGTTACAT 
TGGAATAGTA 
AAGTTGATTC 
GAAATAAAAA 
AAAAAGATTT 
ATAATTTTAA 
ACACTCGAAC 
TTTTCAGAGA 
GTAATATATA 
ACTTTACAGG 
AGCATTGTGT 
ATCTGGCAAG 
GAACAfiCTAG 
CTTAATCTTA 
TTTGCTTATT 
TTTGTTTTCT 
GCTAAGTTAC 
TTTAAAAGCT 



1500 
1560 
162 D 
16B0 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S30 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
31B0 
3240 
3300 
3360 
3420 
34B0 
3540 
3600 
3660 
3720 
37B0 



45 



50 



A107 Protein sequence = 
Gene nan© * 
Unicene numbers 
Prcneset Accession #s 
Protein Accession #: 
Signal sequence t 
Tretnsms rabrane domains: 
EGF domains 
JNB domain* 

Cellular localization: 



integrin, beta 8 

HS.S2 620 

AA479726 

NP 002205.1 

1-39 

682-704 

552-584 

54-469 

plasma membrane 



55 
60 
65 
70 
75 
80 



i 
! 

MCGBHjAFFT 
LGEECQKCVQ 
GEVSIQLMK3 
8RDFRLGFG3 

vHHQKisrarr 



n 
I 

AAFVCLQNDR 



TIAGEIE3KA 
NVTSNDEVIiP 
CVI>ETFXiD3K 

CSGRGTCVCG 
CAIMEQQHYV 
DYRVfiASKKD 



AEANFMLKVH 

DTPEGGPDAM 
HNVXVKSTTM 
ANLfcJNIiWBA 
HVTVTMKKCD 



21 
I 

RGPAGFLWAA 
ERCDIVSNLI 
PLKKYPVDLY 
SXHPERIHNQ 
LQAAVCE6HI 



YQKLISEVKV 
VTGGKN*YAI I 



FSCPYHHGNL 
HCECTDPRSI 
DQTSECF86P 
KLILQSVCTR 



GRFCKHCPTC 
SYLRIFP1IF 
AVTYRHEKPB 



A108 DMA sequence 
Gene name: 
Unigene xuumben 
Proboset Accession #t 
»ucleic Acid Accession #: 
Coding sequence s 



31 
1 

WVPSLVLGLG 
EKGCSVDSIE 
yiiVDVSASMH 
CSDYNU3CMP 
GWRKEAKRLL 
KLIDJ1NIKVI 
QVHMQfVGGiy 
KPIGENBTAK 
CK3HKDQPVC 
ECQCPBGWEG 
YTACKENWNC 
IVTFLIGLLK 
EIKMDISKLN 



E8Ts 

Hs. 128899 

AA983251 

AA983251 

1-1749 {underlined sequences correspond to start 



41 
1 


51 




QQEDUECASg 


1 

HAASCARCLA 


60 


yPSVHVIIPT 


ENEH3TQVTP 


120 


NNIEKOTSVG 


NDL5RKM7AFF 


180 


PHGYIHVIiSIj 


TEN1TEFEKA 


240 


I>VMTDQTSHL 


ALDSKLAGrv 


300 


FAVQGKQFHW 


YKHTJiPLIiPG 


360 


FNITAICPDG 


SRKFQKEGCR 


420 


IHIHRNC3C0 


CBDHRGPKGK 


480 


SGRGVCVCGK 


CSCHKZKLGK 


540 


DRCQCPSAAA 


QHCVNSKGQV 


600 


MQCLHPBHIiS 


GAILDQCKTS 


660 


VLriRQVIXjQ 


WN8HKIKSSS 


720 



and stop co dons) 



1 11 21 31 41 51 

1*1 I I I I 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCOCGGA 60 

GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCOCGAGG GCGAGGCAGC 120 

GACCQQQAGA GGGAOAGCCG GCOGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 

GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG aZQCXSCGCCT GGATCCGCGC CCAGCAGCAG 240 

882 



WO 03/042661 



PCT/US02/36810 



CCGCGGCCGC CGCCAOCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGGACCCT 300 

CGCCTGCGTC CTGGACGTTC CCGGGGOAGG GTCCGQTTac CAGTGAAACC TCCAGAGGCT 360 

TCCSGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 

ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 

GCrCCTGOAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 

CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 

CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGOGCTGT CCGCGCCCAG GGTGGTGCCA 660 

TGTGGGGCGC TDGCCGCTCG TCCGTCTCCT CATCCTGGAA CDCCGCTTCG CTCCTGCAGC 720 

TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGAOGGC S40 

GGCGACGCCA CCAXCTGCra CSGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 

□GGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CAT GTCT TAG GGGTACCCAA 1020 

GGAGAOGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 

TCCCCGAAAG GAAGGCAGCT CCTCAGQGCT TTCCCGGQGC TGCTGCCCCG TGCCAGAOGC 1140 

OGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 

ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAa CCTGTTGCTG CAGATOTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 

GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCOCATGA TCCCCAOTOC CAGCACCTCC 1380 

CGGGGGTCQT CCTCACGCCA GTCCAGCACA aCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 

□GSGOCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGOA AGGGACCATG 1S00 

AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 

CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 

CACGACTCTG TGCCCATQAC AGCTGTGCCA CCTTTCATGQ ACOGCCTGCA GCCTGGCTAC 1680 

AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 

ACTGT ATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 

GATT CTCG AG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1B60 

GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 

CATCCAGGOT AT CAT TG ATT TATGATGGAA AACCOGCCTC AGCTGGAGAT GACTGTGATG I960 

TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TOGTTGACGA 204 0 

AAGAATTTTA TAAACTGATA AATTAAQGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 

TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

tttttttttt ttttttttaa tcagacaggg tcttgctctg TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCT3ATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTOCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG QAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 

GOCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT QTQCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCOCCTTC 2820 

TTGATTOXAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2 880 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTOAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAOTA TTTCTGCCTA GATCTGATAA 30 60 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAOTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AAOCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

C!AAATTCAAG TGAAT X T ATT TGTGTGTTGT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 

GTATQGTATG GTCTTCTACA CATT TATGTC TATAGATATC TATCGAT CAT CTTTCTATTC 3540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 360O 

TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A109 Protein sequence 

Gene names ESTs 

Uhigene number: Ha. 12 fi 8 99 

Protein Accession #; none found 

Signal sequence i 1-11 

Tran cmcrrib iran Q doOQin£ i 402-424 

Cellular Localization; not determined 



1 11 21 31 41 51 

11)111 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRSRESRPEA AGLIjHDRAAA 60 

GEAEKGNHGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGR6RGR VRLPVKPPEA 120 

SGRQpRGPSD CIPRFPSA9A THKAVPKGTG PPAEDGDGLG APGPRARRRR I>LGVAAEGS6 180 

PRGKHHGTV3 DEARG8PGPR LLGORPAIiSG DAliSAPRWP CGALAARPSP HPGTPLRfiCS 240 

CCWIiRCnRRG RGPSGEYCHG WLUAQGWRI GPQCPKRFDG GDATICCG9C AliRYCCSSAE 300 

ARLDQGGCHN DRQQGAGEPG RADKDGPREIi GRASCLRGTQ GDGEGAPPPV RAWQRCfiPEG 360 

SPKGRQIjl»RA FPGIaLPRARS RGPPSSPRGG PSPLQRPAI4P tYVFFI*IVG3 VFVAFI3XGS 420 

I>VAACOCRCX» aPKQDPQQSR APGGHRIWET IPMIPSASTS RGSSSRQS6T AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNHPTK PSVLJjJOQOAT QTVPHQGOYIj HPPYVGYTVQ 540 
HD8VPMTAVP PPMDGLQPGV RQIQSPFPHT NBEQKHYPAV TV 

A110 DNA SEQUENCE: 

Gene name? ESTs, Weakly similar to JE03SO Anterior gradlent-2 (H. sapiens] 



883 



WO 03/042661 



PCT/US02/36810 



Untgene nuraber: Ha. 100686 

Probeset Accession #: AA4874.68 
Nucleic Acid Accession #: AA48746B 

Coding sequence: 55-555 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

\ 1 I t I I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATC 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCICCTCAG ACACTCTCAA GAGGATOOGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 24 O 

ATGGTTATTC ATGA.CCTOGA GGATTGTCAA TACTCTCAAG CACTAAAOAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA 1CATGCTAAA CCTTATGCAT 360 

OAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGT TAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA. 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 54 0 

CAGTCAGAGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATASA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTTGCA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



All 1 Protein eequencei 

Gene name : 
TTnigene number: 
Probeset Accession th 
Protein Accession ft: 
Signal sequences 
Transmembrane domains: 
Cellular Ijocalization: 



Weakly similar to JE0350 Anterior gradient -2 [H. sapiens J 
Hs. 100686 
AA487468 

none found 
1-23 

none found 
secreted 



1 11 21 31 41 51 

1 1 I I 1 1 

KMLHSALGLC LLWTVSBNL AIAIKKEKHP FQTLSRGHGD DITWVQTYEE GLFYAQKSKK 60 
PLHVTHBIxED CQYEQALXXV PAQKESIQEM AQNKFIHLNL MHETTDKNDS PDGQYVPRIM 120 
FVDPSLTVRA DIAGRYSNBL YTYEPBDLPI, LIENMKXALR L.IQSSL 

A112 UNA 5EQUEBTCE 

Gene name: Homo sapiens type II membrane serine protease raRNA 

unlgene number : Hs. 63325 

Probeset Accession #i AA411S02 

Nucleic Acid Accession tt: NM_016425 

Coding sequences 1-1314 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

11)111 

ATGTTACASG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCOGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TOCCCATCAT CATAGCACTA 120 

CTBAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGr TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACIGTC TTGCCTGTGG GAAQAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT A03ACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCOCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTXTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCAGCCCAC TCTGGATCAT TGGATG0GGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCRGG GGGAAGTCAC CQAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGAOCA GTGGCATGTG 1200 

GTGGGCATCG TTAC3CTGGGG CTATGGClGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTflA 



All 3 Protein sequences 
Gene name: 
Hnigene number: 
Probe sec Accession 
Protein Accession 
Signal sequences 
Transmembrane domains t 
LDLa domain: 
Tryp_j5Pc domains 
Cellular localization! 



Homo sapiens typo II membrane 

Hs- 63325 

AA411502 

WP_057509 

none found 

31-53 

54-94 

204-429 

plasma membrane /ER 



erine protease nSHNA 



884 



WO 03/042661 



PCT/US02/36810 



10 
15 

20 

25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



1 

I 

MLQDPDSDQP 
YFIiCGQPLHF 
GNWFSACFDN 
GPCLSGSLVS 
LTAAHCFRKH 
TFSGTVRPIC 
AYQGKVTEKM 
KVSAYLNWIY 



11 
1 

UUSLDVKPLR 
IPRKQLCDGE 
FTEALAETAC 
IiHCLACGKSL 
TDVFNWKVRA 
IiPFFBBEIrTP 
MCAGIPEGGV 
NVWKAEL 



21 
I 

KPRIPMETFR 
LDCPIiGEDHE 
RQMGYS6KPT 
KTPKWGG3E 
QSDKLGSFPS 
ATPLHIIGHG 
DTCXJGDSOGP 



31 
I 

KVGIPIIIAL 
HCVK8FPEGP 
FRAVEIGPDQ 
ASVDSWPWQV 
LAVAKI1I1E 
FTKQNGGKMfi 
LMYQSDQWHV 



41 



51 
1 



LSLAQIIIW VLIKVILDKY 60 

AVAVRIiSKDR STLQVIJD9AT 120 

DLDWEITEN SQELRMRNSS 180 

&IQYDKQHVC GGSILDPHWV 240 

FNPMYPKDND IALMKLQFPI. 300 

DIL&QASVQV IDSTRCUAtJD 360 

VGIVSWGYGC GGPSTPGVYT 420 



A114 DMA SEQUBMCBi 

Gene name: 
Unigene numbers 
Probeset Accession #i 
Nucleic Acid Accession 4*; 
Coding sequences 



TTK protein kinase 

Hs. 169840 

M86699 

NM_D03318 

1026-3551 (underlined sequences correspond to start and stop codons) 



1 
I 

GGAATTCCTT 
CAATGGCACA 
CTCAaCXTTCT 
CTTTTCTATT 
ACCTCAGGTG 
TaCCTQGCTG 
AGGCTGQAGT 
GATCCTCCCA 
CTAATTTTTG 
CTCCTGGACT 
AGTCACCATG 
GGTOCTCACA 
AGGAAGATGA 
GGAIGATGAT 
TTTGGTTTAA 
ATGTAAGACT 
TGCAGTTTXC 
CCAT AATGA A 
AACTAAGCTT 
TTATGATGAT 
ACAGTGTTCC 
ITGAAGCGCT 
GATTTGCTQA 
CCAGAaCAAA 
CACAAGGTAA 
TACCACTAGA 
TTTCAGAGGA 
TTTOCGGTTC 
CTACTAAAGC 
ACOGGAATTC 
CAGTTAACCT 
GTTTTATOAA 
AACCAAGTK3G 
TCAAGGAACC 
CCCTGAAGAA 
AAGAACCAGA 
GTATTAACCA 
AAGTTAATAC 
AGTCACCfl.CC 
GCAGCAATAC 

TTCCACcrac 

AGCATCAAAT 
ATGAATGCAT 
GXTCAAQCAA 
TGAACTTAGA 
TGAATAAACT 
ACCAGTACAT 
AGAAAAAATC 
TTCACACAAT 
TAGTTGATGG 
CAACAAGTGT 
TCAAAOATAT 
GTGATGTTTG 
AGCAGATAAT 
AATTTCCCGA 
ACCCAAAACA 
CTCATCCAGT 
AACTXGTTGG 
ACTATABTGG 
GAAAAAAATG 
GTTATACTCT 
ATCAGCAAAA 



11 
I 

TTTTTTTTTT 
ATCTCAGCTT 
CAAGTAOCTG 
TAGTAGAGAT 
ATCCACTTGC 
AtTCTTTTTT 
GCAGTGGTGC 
CTTTAGCCTC 
TATTTTTATT 
CAAGGGATCC 
CCTGACCTTA 
ACAAATCTGC 
AGTGATTCAT 
CCTAAATCCA 
ATTAATTATC 
GTACAGACTT 
TGTAGAAATG 
CAAAGTGAGA 
GAATAAAATT 
GGCAAACRAC 
GCTAAGTGAT 
TCGCCCAGAT 
ATTAAAAGCT 
CTGCAAQAAA 
TGTCAAAAAA 
AATGCTGGAA 
GGAAAAGAAG 
ACTTGOGCAT 
CAGGTTTTTA 
ATTGAGACAA 
TCTAAAIAGC 
AAGACAAACC 
AAATGATTCC 
TCTGGTGTCA 
TAAAACGGAA 
GGTTCCAGAO 
GAATCCTGCT 
AGAGCAGAAA 
AATATCAACA 
CTTGGATGAT 
TTGTCAGTTG 
ACTTGCCACT 
TTCGGTTAAA 
GGTATTTCAG 
AGAAGCAGAT 
ACAACAACAC 
CTACATGGTA 
CATTGATCCA 
CCATCAACAT 
AATGCTAAAG 
TGTTAAAGAT 
GTCTTCCTCC 
GTCCTTAGGA 
TAATCAGATT 
TATTCCAGAG 
GAGGATATCC 
TAACCAAAM 
TCTGAATTCT 
TGGTOAAAGT 
_ATTTGCAGTT 
TGAATCCCTG 
AAAATTCAGT 



21 
1 

TTTGAGATOG 
ACTGCAACCT 
GGATTACAGG 
GGQGTTTCAC 
CTTGGCCTCC 
TGTXGTTGQA 
GATCTTGGCT 
CTGASTAQCT 
AGAGACAGGG 
GCCTGCCTCC 
TAATTCTTAA 
ATTAG3CGGT 
GATTTAGAAA 
GATACAGTAA 
TAAATATCTA 
CCTAGAAAAC 
GAATCCGAGG 
GACAlTAAAA 
TCTGCTGATA 
CCAGAQGACT 
aCTCTTTTAA 
AAATAXGGCC 
ATTCAAGAGC 
TTTGCTTTTG 
AGTAAACAAC 
ATTGCCCTGC 
AATXTATCAG 
TTACAGAATA 



ACTAACAAAA 
CGAOATTGTG 
TCTAGATCAQ 
2GTGAATTAA 
GATGAAAAGA 
TCRAGTCTTC 
AGTAACCAGA 
GCATCTTCAA 
CATACCACXT 
TCTAAATGGT 
TACATOAGCT 
TCAACACCTT 
CCACTTCAAA 
QOAAGAATTT 
GTGTTAAATG 
AACCAAACTC 
AGTGATAAGA 
ATGGAQTGTG 
TGGGAACGCA 
GGCATTGXTC 
CTAATTGATT 
TCTCAGG1TG 
AGAGAGAATG 
TGTATTTTGX 
TCTAAATTAC 
AAAGATCTTC 
A1TCCTGAGC 
OCCAAGGGAA 
CCTAACTCCA 
CATAATTCTT 
ATTCGTAATG 
TOGAAATCTA 
GAGATTATCT 



31 
I 

AGTTTCACTC 
CCGCCTCCCG 
CATGTGCCAC 
CATGTTGGTC 
CAAAGTGCTA 
TTTTTGAAAC 
CACTATAACC 
GTGATTACAG 
1TTCACCATG 
ACTTCCCAAA 
GTCATTTTTT 
ACAATAATCC 
GGGGAAGTAG 
AAATGGGGTA 
AAAACATTTT 
AGTTTGGGTT 
ATTTAAGTGG 
ATAAGTTTAA 
CTACA3ATAA 
GGTTGAGTTT 
AtTAAATTGAT 
AAAATGAGAG 
C3M3ATQATGC 
TTCATATATC 
TTCTTCAAAA 
GGAATTTAAA 
CATCTACQGT 
OGAACAACAG 
ACATOCCAOC 
CTAAACAGTC 
ATGTGAAGAC 
AATGCCGAGA 
GAAATTTAAA 
GTTCTGAACT 

tagctaaatt 

AACflGTCJGCA 
ATCACTGGCA 
TTGAGCAACC 

ttoacccaaa 
gttttagaac 
atggccaacc 
atttacaggt 
at tgcat at t 
aaaagaaaca 
ttgatagtta 
tcatccgact 
gaaatattga 

AGAGTTACTG 
ACAQTT3ATCT 
TTGGGATTGC 
aCACAGTTAA 
GGAAATCTAA 
ACTATATGAC 
ATGCCATAAT 
AAGATGTGTT 
TCCTGGCTCA 
CCACTQAAGA 
TTTTGAAAGC 
CATCCTCCAA 
TCAGATAGGA 
CATTTGAAGA 
TTAAAAGAAA 



41 

I 

TTGTTQGCCA 
GGTTCAAGCG 
GACCCCTOGC 
AGGCTGGTCT 
GGATTACAQC 
AGGQTCTCCC 
TCCACCTCCT 
QCOTGCACCA 
TTGGCGAGGC 
GTCCCGAGAT 
CTGGTCCATT 
TTAACTTCAT 
TAAGOCCACT 
TGGGAAGGTA 
TGGATACATT 
CCATCTTTTC 
CAGAGAATTG 
AAATGAAGAC 
CTGGGGAACT 
QTTGCTCAAA 
TGGTCGTTAC 
TTTTGCTAGA 
ACGTGACTAC 



AGCTGTAGAA 
CCTCdAAAAA 
ATTAACTGCC 
TTGTGATTCC 
ACAAGATGCA 
ATGCCCATTT 
AGATGATTCA 
TTTGGTTGTG 
GTCTGTTCAA 
TATTATTACT 
AGAAGAAACT 
AGCTAAGAGA 
GATTCCGGAG 
TGTCTTTTCA 
ATCTATTTGT 
TCCAGTTGTA 
TGCCIGTTTC 
TTTAGCATCr 
AAAGCAGATA 
QATATATGCT 
CCGGAACGAA 
TTATQATTAT 
TCTTAATAGT 
GAAAAATATG 
TAAACCAGCT 
AAACCAAATG 
TTATATGCCA 
GTCAAAGATA 
TTACGGGAAA 
TGATCCTAAT 
AAAGTGTlGT 
TDCATATGTT 
AATGAAATAT 
TOCTAAAACT 
GACTTTTGAA 
GGTATAAAAT 
CAACATCACT 

actgtaaaaa 



51 

I 

ggctggaotg 
attctcctgc 
taactaattt 
tgaactcctg 

CGTGAAACTG 
TTGGTCGCCC 
GGTTTCAAGT 
OCACACCGGG 
TGTTCTCAAA 
TACAGGTGTG 
TCTTCCTTAG 
GATTCACAAA 
GCACACTCCT 
GAATACAAAA 
GTTOATGTGA 
ATTTCCCCAG 
ACAATTGATT 
CTjCACTGATG 
GTTAACCAAA 
GTAGAGAAAA 
AGTCAAGCAA 
ATTCAAGTBA 
TTTCAAATGG 
TTTGAACTOT 
CGfTGGAGCAG 
AAGCAGCTGC 
CAAGAATCAT 
AGAGGACAGA 
GAAATAGGTT 
GGAAGAGTGC 
GTTCJTACCTT 
CCTGGATCTA 
AATAGTCATT 
GATTCAATAA 
AAAGAGTATC 
AAGTCAGAGT 
TTAGCCCGAA 
GTTTCAAAAC 
AAGACACCAA 
AAGAATGACT 
CAGCAGCAAG 
TCTTCAGCAA 
GGAAGIGGAG 
ATAAAATATG 
ATAC3CTTATT 
GAAATC3VCQG 
TGGCTTAAAA 
TTAGAGGCA3 
AACTTTCTGA 
CAACCAGATA 
CCAGAAGCAA 
AGCGCCAAAA 
ACACCATTTC 
CATGAAATTG 
TTAAAAAGGG 
CAAATTCAAA 
GTTCTCGGCC 
TTATATGAAC 
AAAAAAAGGG 
ATATTGGACT 
CTOAAGTGTT 
TAGCAACCAC 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1S60 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3 GOO 
3660 
3720 



885 



WO 03/042661 



PCT/US02/36810 



TTATGGCACT GTATATATTG TAGACTTGTT TTCTCTGTTT TATGCTCTTG TGTAATCTAC 37B0 
TTGACATCAT TTTACTCTTG GAATAGTGGG TGGATAGCAA GTATATTCTA AAAAACTTTG 3840 
TAAATAAAGT TTTGTGGCTA AAATGA 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



A115 Protein sequence? 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession ft : 
Signal sequence: 
Transmembrane domains : 
protein Kinase Domain: 
Cellular localisation; 



TTK protein kinase 

HS.169B40 

MB6699 

1^003309 

none found 

none found 

510-775 

cytoplasmic and nuclear 



l 
I 

MNXVRDIKNK 
VBhBDRLUStK 
ANCKKFAFVH 
EKKKKNIjSAS 
NSLRQTNKTK 



PEVPESNQKQ 
PPISTSKWPD 
QIUVTPLQNL 
ItEBADHQTLD 
KSIDPWBRKS 
9WKDSQVGT 
XINQXSKLHA 
PVNQKAKGTT 
X 

OVARIAN 



11 
I 

FKNEDIiTDEL 
LIGRYSQAIE 
1 9FAQ FEL SQ 
TVLTAQESFS 
QSCPFGKVFV 
LKSVOWSHPEC 
HQAK&KSECI 
FKSICKTP3S 
QVLASGSANE 
SYRNEIAYLiN 
■yWKNMLEAVH 
VNYMPPEAIK 
IHJPNHEIEF 
EEMKYVtiGQLi 



21 
I 

BLNKIBADTT 
ALPPDKYGQN 
GNVKKSKQLli 
GSLGHLQNHN 
ULLNfiPDCDV 



NTLDDYMSCP 
CISVKGRIYS 
XLQQHBDKII 
TXHQHGIVES 



VGLNSPNSIIi 



31 
1 

I3NSGTVNQJM 
ESFARIOVRF 
QKAVERGAVP 
NSCDKRGQTT 
KTDDSWPCF 
EDIITDSITL 
WQIFKLARKV 
RTPWKNDFP 
ZIiKQIGSGGS 
RliYDYFTTDQ 
DDKPANFLXV 
SKSKXSPKSD 
VLKCCLKRDP 
KAAKTDYBHY 



41 
1 

MMANNFBDWL 
AELKAIQEPD 
liEMLEIALRN 
KARFIfYGENM 
MKRQT5RSEC 



NTEQKHTTFE 
PACOLSTFYG 
SKVFOVLNBK 
YIYMVMECGN 
DGMLKLIDFG 
VWSLGCILYY 
KQRISIPKL»Ii 



51 
I 

SLLLKLBKtfS 
DARDYFQMAR 
UJl£KKQl*bS 
PPODABIG7R 
HDIiWPGSKP 
KLEETKEYQE 
QPVFSVSKQS 
QPACFQQQQH 
JCQIYAIKYVJI 
IDLNSWLKKK 
IAKQHQPDrr 
MTYGKTPFQQ 
AHPYVQIOTH 



60 
120 
160 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
640 



All 6 DMA SEQUENCE 



Unigene numbers 
Probeset Accession #i 
Hucleic Acid Accession 
Coding sequence: 



G protein- coupled receptor 39 

HS. 85339 

AA349693 

MM_O0150Q 

1-1362 {underlined sequence b correspond to start and stop codona) 



1 
I 

ATGGC TTCA C 
CGOGAGTXTG 
TTOQTGATGO 
AAAGGATACT 



ACGTCCAGCT 
GCTA£33CTGC 
TTCAGGTACA 
GTCACCTCCG 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCACCA 
ATGTGCTGGA 
A0G0GGCCTC 
ACCATCATCr 
ATTCG3AOGA 
GCGTACATCA 
COGCTCCTGT 
TGCCGOCTGT 
ACCACOGACA 
TCTGCAAGGA 

tctaagtccc 

AATTCTGCTG 



11 

1 

CCAGCCTCCC 
AGGTGGCCAC 

TGCAGAAGGA 
TCATCGGCAT 
ACACCCTGTC 
TGCACGTGCT 
AGGCTGTGTC 



CCAGCCACCG 
C CT CCAATAT 



ACATGATGCA 
CQCAGCTGAG 
TCCTGAGGCT 
TCAXGGCTGC 
TCCTCCTCCC 
ACACGGTGTC 
OGCTGCAGCA 
GCGCCCGCTT 

gaactgagaa 
agtoatsgag 

CAGAGAATGG 



21 
I 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGCC 
GGIGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACACTCAGC 
GGGACCXTGC 

actgcccttg 

GGGTCTCACT 
GTOCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
GAAGTCCGAG 
GATTGTTGTG 
GGCCAAACCC 
CXTCTOGGAG 
CTCGCAGCAG 



TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTTCAGGAG 



31 
1 

TGCrCCCAAA 
ATCACCCTTA 
ACCATTCGGG 
CA.CATGGTGA 
TTCTACAGCA 
CACA£7TTTCC 
TTTGAGCGCT 
CAGGTGAAGC 
CTGTTTGCCA 
TGCAACCGCT 
ACCAACCTCT 
TACCTCGTGG 
AAAAGCCAGA 
AGCGAAGAGA 
ACATTGGCCG 
AAGCACGACT 
ACGTTTTTCT 
TTTCGGCGGG 
GAGAAGCGOC 
OOGTTGCTCT 
AQCACri'TTC 
CTAGAGCCCA 
CATGAAGTTT 



41 
I 

TCATTGATCA 
TTCTGGTGTA 



TCATCTGGAA 
TCTTCGAGGC 
ACATCGCCAT 
TGCTGATXGG 
TGGGTACTGA 
CCAGCACCCG 
CCAGCCGCTG 
TCCTGCTCTC 
AGGGCTCGCT 



51 
I 

CRGTCATGO'C 
OCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTAOOOCCTG 
CCACCACGAG 
GACGGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 



TATGCTGGAT 
GOACGAGGTC 
ACCTCAGCTC 
TGTTCGTGCA 
TGCGCGTACA 
TCGCGTCCCG 
AGAGCGAGGC 
ACTCAGGCGC 
GA 



GCCCAACCAG 
CTACTTOCGG 
GGTCATCAAC 



TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



60 
120 
180 
240 
300 
360 
420 
4D0 
S40 
600 
660 
720 
7B0 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 



A117 Protein sequences 
Gene name? 
Unigene number: 
Probeset Accession #i 
Protein Accession #t 
Signal sequence; 
Pfam domainst 



Cellular Localization: 



G protein -coupled receptor 39 

Hb. 85339 

AA349893 

KMJJ0150B, MP_001409 
none found 

7tm 1 [72-172, 224-344] 

32-54, 6B-90, 111-133. 151-173, 

plasma membrane 



221-243, 280-301, 320-342 



I 11 21 31 41 51 

II 1111 

KASPSliPGSD CSQ11DHSHV PSFEVATWIK ITLILVYLIJ FVMGLUGNSA TUtVTQVLQK 



886 



WO 03/042661 



PCT/US02/36810 



RGYIiQKEVTD HMVSLACSDI LVFLXGMPME FYSIIWNPLT TSSYTLSCKL HTFUEACSY 120 

ATLLHVIjTLS FERYIAICHP FRYKAVSGPC QVKZ1I/I6FVW VTSAliVAI»PI> LFAMGTBYPL 180 

VNVPSHRGL.T CCTRS5TRHHE QPBTBNM3IC TNLSSRWTVF QSSIFGAFW VLWLL6VAF 240 

KCHNHMOVLM KSQRGSLAGG TRPPQIJIKSB SEESRTARRQ TIIFLRLIW TLAVCHMPWQ 300 

IRRIMAAAKP KHDWTRSYFR AYMILLPPSE TPPYLSSVESl PLL.YTVSSQQ FRRVFVQVLC 3 60 

CitLSLQHANH EKRIiRVHAEB TTDSARFVQR PLLFASRRQS SARRTEKIFL STFQSEAEPQ 420 
9KSQSLSLBS LEPNSGAKPA NSAABNGFOE HSV 



A11B DNA sequence 
Gene name: 
Unigene xmniber : 
Probeset Accession fti 
Nucleic Acid Accession 
Coding sequence; 



bone morphogenetic protein receptor IB (ALK.-6) 

Hs. 87223 
AA250737 
NM_001203 

274-1782 (underlined sequences correspond to start and stop 



1 
I 

CGCGGGGCGC 
GAGGACGCGQ 
GTGAAAGGAA 
CATAACCATT 
TQCCATAAGT 
AATGTGGGCA 
TTGOGTTGTA 
GACGGATATT 
GGTTGCCTAG 
AGAAGATCAA 
CTGCCTCCAT 
ATATCTGTGA 
TATAAAAGAC 
ATTCCTCCTG 
TCAGGCCXCC 
ATTGGAAAAG 
GTGAAAGTGT 
ACAC3TGTTGA 
GGGTCCTGGA 
TATCTGAAGT 
AGTGGCTTAT 
CATCGAGATC 
GACCTGGGOC 
ACXCGA0TTG 
AATCACTTCC 
GTTGCTAGGA 
CTAGTGCCCA 
GGCCTCTCAT 
ATGACAGAAT 
ACACTTGCCA 
CATCTCTGCA 
TAAGCATCCA 
CTTTCAGGGA 
TCTGTXTGTA 



11 
I 

GGAGTCGGCG 
GAGCCGGGAG 
AGGAflGATCA 
TGGCTCTGAG 
GAGAAGCAAA 
CCAAGAAAGA 
AATGCCACCA 
GTTTCACGAr 
GACTAGAAGG 
TTGAATQCTG 
TGAAAAACAG 
CTGTCTGTAG 
AAOAAACCfta 
GAGAATCCCT 
ctctgctggt 
GTCGCTAtfGG 
TCTTCACCAC 
TGAGGCATGA 
CCCAGTTGTA 
CCACCACCCT 
GTCATTTACA 
TGAAAAGTAA 
TGGCIGTTAA 
GCACCAAAOG 
AGTCTTACAT 
GATGTGTATC 
OTGACOOCTC 
TCCCAAACCG 
GCTGGGCTCA 
AAATGTCAGA 
GAAAGCCAAC 
CAGTACAAGG 
GCGACCTGGG 
GGCGGAGAAA 



21 
I 

GGGCCTCGCG 
CGCACGCGCG 
TTTCATGCCT 
CTATGACAAG 
CTTCC1TQAT 
GGATGGTGAG 
CCATTGTCCA 
GATAGAAGAG 
CTCAGATTTT 
CACAGAAAGG 
AGATTTTGTT 
TTTGCTCTTG 
ACCTCGATAC 
GAGAQACTTA 
CCAAAGGACT 
GGAAGTTTBG 
AGAGGAAGCC 
AAACATTTTG 
CCTAATGACA 
AGACGCTAAA 
CACAGAAATC 
AAACATTCTG 
ATTTATTAGT 
CTAXATGCCT 
CATGGCTGAC 
AGGAGGTATA 
TTATGAGGAC 
GTGGAGCAGT 
CAATCCTGCA 
GTCCCAGGAC 
AGGTACTCTT 
CTTGAACATC 
CAAAGACAGA 
CCGTTGGGTA 



31 
I 

GGACGCGGGC 



TGTTGATAAA 
AGAGGAAACA 
AACATGCTTT 
AGTACAGCCC 
GAAGAGTCAG 
GATGACTCTG 
CRGTGTOGGG 
AACGAATGTA 
GATGGACCTA 
GTCCTTATCA 
AGCATTGGOT 
ATTGAGCAGT 
ATAGCTAAGC 
ATGGGAAAGT 
AGCTGGTTCA 
GGTTTCATTG 
GACXATCATG 
TCAATGCTGA 
TTTAGTACTC 
GTGAAGAAAA 
GA1ACAAATG 
CCAGAAGTGT 
ATGTATAGTT 
GTGGAAGAAT 
ATGAGGGAGA 
GATGAGTGTC 
TCAAGQCTGA 
ATTAAACTCT 
CTGTTTQTG3 
GTCCTGCTTC 
GAAGCTCCCA 
ACTTGTTCAA 



41 
I 

AGXGOGGAGA 
CAGCCTACTC 
GGTTCAGACT 
AAAAGTTAAA 
TGCGAAGTGC 
CCACCCCCCG 
TCAACAATAT 
GGTTGCCTGT 
ACACTCCCAT 
ATAAAGACCT 
TACACCACAG 
TA1TATTTTG 
TAGAACAGGA 
CTCAGAGCTC 
AGATTCAGAT 
GGCGTQGCOA 
GAGAGACAGA 
CTGCAGATAT 
AAAATGGTTC 
AGTTAGCCTA 
AAGGCAAACC 
ATGGAACXTG 
AAGTTGACftT 
TGGACGAGAG 
TTGGCCTCAT 
ACCAGCTTCC 
TTGTGTGCAT 
TAAGGCAGAT 
CAGCCCTGGG 
GAT AQQAGAG 
GCAGAGCAAA 
CCAGUGGGTT 
GAAGGAGAGA 
GATATGATGC 



51 

i 

CGQCGGCGCT 
TTTC TTAGAT 
TCTGCTGATT 
CTTACAAGCC 
AGGAAAATTA 
TCCAAAGGTC 
TTGCAGCACA 
GGTCACTTCT 
TCCTCATCAA 
ACAOOCTACA 
GGCTTTACTT 
TTACTTDOGG 
TGAAACTTAC 
AGGAAGTGGA 
GGTGAAACAG 
AAAGOTAGCT 
AATATATCAG 
CAAAGGGACA 
CCTTTATGAT 
CTCTTCTGTC 
AGCAATTGCC 
CTGTATTGCT 
ACCACCTAAC 
CTTGAACAGA 
OCTTTOGGAG 
TTATCATGAC 
CAAGAAGTTA 
GGGAAAACTC 
GGTTAAGAAA 
GAAAAGTAAG 
AGACATCAAA 
CAGACCTCAC 
TTGATCCGTG 
AT 



60 
120 

iao 

240 
300 
360 
420 
460 
540 
600 
660 
720 

7ao 

840 
90 O 
960 
1020 
10 BO 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1600 
1740 
1800 
i860 
1920 
I960 



All 9 Protein sequence 
Gene nanus: 
Ttaigene number i 
Probenet Accession Hj 
Protein Accession #: 
Signal sequence: 
Transmembrane domains i 
PFAM domains s 
Cellular Localization: 



bone morphogenetic protein receptor IB (Alk-6) 

Bs. 72472 / HS.B7223 
AA2S0737 / UB932S 
BP 001194 

1-13 
128-144 

actiTin_receptor Uo-m} f protein kinase [204-491] 
plasma membrane 



1 11 21 31 41 SI 

1)1)11 

MLIxRSAGKLN VGTKEDBD3ES TAPTPRPKVD RCKCHBHCFB DSTOSTECSTD GYCFTMIEED 60 

DSGLPWTSG CLGLEGSDFQ CKDTPIPHQR R9IECCTERH ECNKDIiHPTL PPliKNHDFVD 120 

GPZHHRALLI SVTVCSXiLIjV IiXILFCYFRY KRQETRPRY3 IGLEODETYI PPGESLRDLI 180 

EQ9QSSGBGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKHRQEKVAV KVFFTTEBAS 240 

WFRETSIYQT VLMRHENILG FlAADOCGrG SWTQLYBITD YHBNGSLYDY LKfiTTLDAKS 300 

MLKLAYSSVS GLCELHTEIF STQGKPAIAH KDLKSKNILV KKHGTCCIAE I^GLKVKPISD 360 

TNEVDIPPNT RTOTKRYMPP EVLDESLNRN HFQSYIMKDH YSPGLILWBV ARRCVSGGIV 420 

EBYQIiFYHDL VPSDPSYHDM REIVCIKKLR PSFPNRWSSD ECLRQKGKIiM TECHAHNPAS 480 
RlaTAtfRVKKT liAKMSESQDI XL 



A120 UNA SEQUBMCB 

Gene name: L3CV-1 protein, estrogen regulated 
TJnigene number -. Es .79136 

Probeset Accession tt: U41060 
Nucleic Acid Accession #: NM_012319.2 

Coding sequences 138-2405 < underlined sequences correspond to start and stop codecs) 



1 11 21 31 41 51 

887 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



I I I I I 1 

CTCOTGCCGA ATTCGGCACG AGACCGCGTG TTCGOGOCTG GTAGAGATTT CTCGAAGACA GO 

CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG QGCOGTGGGA CAACGAGGCC 120 

GCGGAGACGA AQQCGC AATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTX CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAQA 240 

AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 

ATCATCTACA ACflGCTTTTC TACCGCTATG GAQAAAATAA TTCTTTGTCA GTTGAAGGGT 360 

TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

ACCACGACCA TCACTCAGAC CACQ AGCAT C ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 

AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 

CTGCrrrCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA QATAGTTCAG 600 

GTAAAGATCC TAGAAACAQC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 

GAAGGRATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

TCTCTQAAGQ AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTOC 760 

CCAAAGATGT AAGCAGCTCC ACTCCACCCA OTGTCACATC AAAGAGGCGG GTGAGCCGGC 840 

TGGCTGQTAQ GAAAACAAAT GAATCTGTGA GTGAGCCCOG AAAAGGCTTT ATGTATTCCA 90 0 

GAAACACAAA TGAAAATCCT CAGGAGTQTT TCAATGCATC AAAGCTACTG ACATCTCATG 560 

GCATGGGCAT CCAGGTICCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 10 80 

CTCCAAAQAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1140 

TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 

AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 

TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 

CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAG ATAGAAGAAA 1380 

GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGQAGGCCTG TATTTCATGT 1440 

TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 

AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

GAGCAOAGTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGC3AGTC TTGGAAGAAG 1680 

AAGAGGTCAT GATAGCTCAT DCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 

GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 180 0 

TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1860 

CT CACAGTCA CAGCCAGCGC TACTCTCGGG AGGAQCTQAA AGATGCCGGC GTCGCCACTT 1920 

TGGCCTGGAT GGTGATAATG GGTGATGGOC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 

GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 

ATGAGTTQCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACOGTTA 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAQ 2160 

GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG BATATTTGCA CTTACTGCTG 2220 

GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 

GTTTTGGAAT TA3X3TTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 

T CTAGT TAAG GTTTAAATGG TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 

AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GT1AGTGGGT TTTGTGATTT 2520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2S80 

TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

TAAACRAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAA1GCT 2700 

TTTTCAAGAA CTAACAGAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 

TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATAOCAA GAAAGCTTAT ACTGAATTTA 2820 

AGCAAAGAAA TAAAGGAGAA AAGAOAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 

AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 

CAG&ATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC T TTGT ATAAT ACAGAAATCT 30 60 

AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 

TXCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 

TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA XATATGAGGA TA1TAAAACT ACACTAAGTA 3300 

TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 

GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 

GAXGTTTCTT ttttacacaa taaattcctt atatcagctt g 



A121 PBOlEIg SEQUENCE 
Gene namei 
Uhlgens numbers 
Protein Accession #t 
Signal sequencer 
Pfam domain: 
Tranemenibrane domains: 
Cellular Localisation: 



liiv-l protein r estrogen regulated 
Hb.79136 

WP_036451 

1-21 

Zip [591-743] 

330-346, 352-366, 427-444, 663-679, 688-703, 730-745 
pl aani 



1 
I 

MARKbSV"IIiI 



9VSASHVTST 
NESVSEPRKG 
ESCLIHTSBK 
LVALAVGTLS 
THXGIjTALGG 
EBXVDTDDRT 



11 
I 

LTPAIrSVTNP 
SVEGFRKItLQ 
HENHAASGKN 
VYNTVSEGTH 
FMYSHMTMHN 
KABIPPKTY8 
GDAFliHIiIiPH 
I.YPMPLVEHV 
EGYLRADSQE 



21 
1 

U5EDKAAAFP 
UIG1DK3KRI 
KRKALCPDHD 
PLfiTlEIPUP 
PQECPNASKL 
LQIAWVGGFI 



31 
I 



41 
I 



GKLFPKDVSS 
LTSHQMGIQV 
AISIISFliSX* 
HEEPAMEMKfi. 



LTLIKQFKDK 
PSHPD6QQPA 



VLEEEEVMIA 



BALfiAMLAYI. 



GLAIGAAFTB GI»SSGI»STSV 
GMATGIFIGH YAE33VSMWIF 



ALTAGLFMYV 



ESGIKVDLAI 


STROfYHIiGGjli 


60 
120 


SOGKGAHRPE 


HASGRHNVKD 


180 


STPP9VTSKS 


RVSRIiAGRKT 


240 


PLMATEFNYIi 


CPAIIKQXDA 


300 


U3VILVPLHN 


HVFFKFLLSF 


360 


GPLFSHL9SQ 


NZEESAYFDS 


420 


DDDVEIKKQL 


SKYESQLSTO 


480 


HAHPQBVYNE 


YVPRGCKNKC 


SiO 


RYSHBELKDA 


GVATLAHMVI 


600 


LGDF AVLiL KA 


GMTVKQAVi»y 


660 


ALVDKVPEML 


HHDASDHGCS 


720 



888 



WO 03/042661 



PCT/US02/36810 



RWGYFFIjQNA GMLiiGFGIMli LISIFBHKIV FRINF 



A3.Z2 DNA SEQUENCE 

Gene name; 
Unigene number: 
Probeaet Accession #: 
Nucleic Acid Accession #: 
Coding sequence t 



EGF- like-domain; multiple 6 

Hs-12844 

N67551 

NM_0155D7 

241-1902 (underlined sequences correspond to start 



and stop codons) 



l 
1 

CCGCAGAGGA 
CX3CCCCTGCC 
D3AGTGGAGC 
GGGTCCGGCC 
ATGCCTCTGC 
GGGAACGCGG 
TGTCACTAtG 
TGTGAAGCTA 
AGATGCTTTC 
AAACCCCGGC 
CTGW3TGGCC 
ATAAACIGTC 
TGAGGACTCC 
GGTAAAGTCA 
AAATGTCACA 
AATGAATGTA 
GGGTCCTTCA 
ATCCCTGAAA 
AAGAAGTTGC 
CCAOAACCCA 
ATAGTTTCCA 
GAGGGC3CTTG 
AGCCTGCGAQ 
CTGQTCCAAA 
GACTGCAGCT 
TGGAATCCTG 
GGTCACAAGA 
AACTTCTGTT 
TTTGTGAAAA 
TGGAAGACAB 
GAAGCAGAAC 
TCAOGCTTAT 
TTGACTTTGT 
TTAGAATTAC 
TCTTGTATAA 

tttctgaatc 

CAGTATATCT 
TAGAAAAAAA 
TATGACATCA 
TGTATATTTA 



11 
I 

GCCTCGGCCA 
GCGGTGCCTG 
GGAGGACCCG 
GGCGCCCTCC 
CCTGGAGCCT 
CCAGTGCAAG 
QAACTAAACT 
CATGOGAACC 
CAGGATACAC 
CATGCCAACA 
ACATGCTCAT 
AGTACAGCTQ 
GCCTGGCCCC 
TCTGTCCClA 
TTQGTTTCGA 
CTATGGATAG 
AGTGTAAATG 
ATTCTGTGAA 
TTGCTCACAA 
CCAGGACTCC 
GAGGCGGOAA 
AGGATGAGAA 
GAGATGTGTT 
GGAAAGCGCT 
TCAATCATOQ 
CTGATCGAGA 
AAGACATTGQ 
TGCTCTTTGA 
ACAGTAACAA 
GGAAAATTCA 
GTGGCAAGGG 
GTCCAGATAG 
A7GTCAGTTC 
TAGCTGAAAA 
GATATGCCAA 
TTTCCACATT 
GATTTGTATA 
AGCACAGAGA 
AAGATAGACT 
AATTCTTTGT 



21 
1 

GGCTAGCCAG 
GCCTCCCCTC 
AGCGGCTGAG 
CGAGGGGGGC 
TGCQCTCOOG 
GCATCACGGQ 
GGCCTGCTGC 
TGGATOTAAG 
CGGGAAAACC 
CAGATGTQTQ 
GCCAGATGCT 
TGAAGACACA 
AAATQOAAGA 
CAATCGAAGA 
ACTGCAATAT 
CCATACGTGC 
CAAGCAGGGA 
GGAAGTCCTC 
AAACAGCATG 
TACGCCTAAG 
CTCTCATGQA 
AAGAGAAGAG 
TTTCCCTAAa 
AACTTCCAAA 
GATCTGTGAC 
TAATQCTAXT 
COGATTGAAA 
TTACCGGCTG 
TGCCCTGQCA 
GTTGTATCAA 
CAAAACCGGC 
CCTTTTATCX 
CCTGffiTTTTT 
ATTGTAATGT 
TATTTGCTTT 
ATATTATAAA 
AGTAAGTTGA 
AATGTTTAAC 
TTTGCCTAAG 
AATAATAATA 



31 
1 

GQCGCCCCCA 
CCAGACTGCA 
GAGAGAGGAG 
TCAGGAGGAG 
CTGCTGCTCT 
TTGTTAGCAT 
TAOGGCTGGA 
TTTGGTGAQT 
TGCAOTCAAG 
AATACACACG 
ACGTQTGTGA 
GAAGAAGGGC 
GACTGTCTAG 
TQTGTGAACA 
ATCAGTGGAC 
AGCCZACCATG 
TATAAAGGCA 
AOAGCACCTG 
AAAAAGAAGO 
GTGAACTTGC 
GGTAAAAAAG 
AAAGCCCTGA 
QTGAATGAAG 
CXGGAACATA 
TGGAAACAGG 
GGCTTCTATA 
CTTCTCCTAC 
GCCGGAGACA 
TGGGAGAAQA 
GGAACTGATG 
GAAATCGCAG 
GTGGATGACT, 
TTGATAT TGC 
ACCAACAGAA 
AAATATCATA 
AXATGGAAAT 
TGAGCTTCTC 
TGTTTQACTC 
TGGCTTAGCT 
TCCAAATCAT 



41 
1 

GCCOCTCCCC 
GGGACAGCAC 
GCGGCG3CTT 
GAAGGAGGAC 
CCTGGGTQGC 
OGGCACGTCA 
GAAGAAACAG 
GCGTGGSACC 
ATGTGAATGA 
GAAGCTACAA 
ACTCTAGGAC 
CACAGTGCCT 
ATATTGATGA 
CATTTGGAAG 
GATATGACTG 
CCAAITGCTT 
ATGGACTTCG 
GTACCATCAA 
CAAAAATTAA 
AGCCCTTCAA 
GGAATGAAGA 
AGAATGACM7 
CADGTGAATT 
AAGAT1TAAA 
ATAGAGAAGA 
TGGCAGTTCC 
CTGAOCTGCA 
AAGTCGGGAA 
CCACGAGTGA 
CTACCAAAAG 
TGGATGGOGT 
GAATGTTACT 
ATCATAGGAC 
ATATTATTGT 
TCACTGTATC 
GTCAGTTTAT 
TCTACAACAT 
TTATGATACT 
GGGTCTTTCA 
CAAAAAAAAA 



51 




I 

AGGCOQCGAG 


60 


CCGGTAACTG 


120 


AGCTGCTAGQ 


180 


CCGTGCGAGA 


240 


AGGTGGTTTC 


300 


GCCTGGGGTC 


360 


CAAGGGAGTC 


420 


AAACAAATGC 


480 


GTGTGGAAIG 


540 


GTGCTTTTGC 


600 


ATGTGCCATG 


660 


GTGTCCATCC 


720 


ATGTGCCTCT 


7 BO 


CTACTACTQC 


840 


TATAGATATA 


900 


CAATACCCAA 


960 


GTQTTCTGCT 


1020 


AGACAGAATC 


1080 


AAATGTTACC 


1140 


CTATGAAGAG 


1200 


GAAAATGAAA 


1260 


AGAGGAGCGA 


3320 


CGGOCTGATT 


1380 


TATCTCGGTT 


1440 


TGATTTTGAC 


1500 


GGCCTTGGCA 


1560 


ACCOCAAAGC 


1620 


ACTTCGAGTG 


1660 


GQATGAAAAG 


1740 


CATCATTTTT 


1S00 


CTTQCTTGTT 


1860 


ATCTTEATAT 


1920 


CTCTGGCATT 


1980 


AAGATGCCTT 


2040 


TTCTCAGTCA 


2100 


CTGCCCTCCT 


2160 


TTCTAGAAAA 


2220 


TCITGGAAAC 


22 B0 


TAGCCAAACT 


2340 


AAAAAAAA 





A123 Protein 



Gene name: 
Unigene number; 
Probeset Accession #; 
Protein Accession #i 
Signal sequences 
Transmembrane domains s 
MAM domain: 
EGP domain: 

Cellular localization; 



EGF-like-domain; multiple 

Hs. 12844 

BT67551 

MP.056322.2 

1-21 

none found 
402-546 
80-259 
secreted 



1 
) 

CEATCEPGCK 
LSGEMLMPDA 
GK VICP YKRR 
Q3FKCKCKQG 
PEPTRTPTPK 
SI*RGDVPPPK 
WKPADRDNAI 
FVKEISMHALA 
SGIiCFDSLliS 



21 
I 



13 
I 

LLL6WVAGGF 
FGBCVGPNKC RCFPGYTGKT 
TCVKffiRTCAM INCQYSCBDT 
CVHTPGSYYC KCHISPELQY 
YKGNGLRCSA IPEHBVKEVIj 
VMLQPFNYEE IVS&GGNSHG 
VNEAOEPGIjI LVQRKAIiT9K 
GFYMAVPAliA GHKKDIGRLK 
WBKTTSEDEK WKTGKIQliYQ 
VDD 



31 
1 

UASARQPGV 
C3QDVNECGH 
E3GPQCLCPE 
ISGRYDCIDI 
HAPGTIKDRI 
GKKSNEBKHK 
LBEKDtNIfiV 
LLLPDLQPQS 
GTDATKSIIP 



41 51 

i I 
CHYGTXLACC YGWRSN9K0V 
KPRPCQHRCV WPHGSYKCFC 
GGLRLAP^IGR UCLDIDBCA3 



KJCLLRHKKTSM KKKAKIXNVT 
EGLEDEKRE5 KAlJECMDIEER 
DCaFBTRGICD WKQDREDDFD 
NFCLLFDYHIi AGDKVGJCbRV 
EAHRGKGKTG EIAVDQVUjV 



60 
120 
180 
240 
300 
3 SO 
420 
4B0 
540 



A124 DMA SEQUENCE 
Gene name: 
Unigene number: 
Probeset Accession # : 
Nucleic Acid Accession #r 



bone raorphogenetic protein 7 (osteogenic protein l) (BMP?) 
Hs. 170195 
BB616633 
NM 001719 



889 



WO 03/042661 



PCT/US02/36810 



Coding sequence; 



123-1 41 B (underlined sequences correspond to start and stop codons) 



GGGCGCAGCG 
CTGCCACCTG 
CGATBCACBT 
CCCTGTTCCT 
GCTTCATCCA 
CCATTtCTGGG 
CCATGTT CAT 
GCCAGGGCTT 
GCCTGCAAGA 
TGGAACATGA 
TTTCCAAGAT 
ACATCCQGGA 
AGCACTTGGG 
AGGAGGGCTG 
GGCACAACCT 
AGTTGGCGGG 
TCTTCAAGGC 
GCCAGAACCG 
AGAACAGCAG 
GAGACCTGGG 
AGGGGGAGTG 
AGACCCTGGT 
AGCTCAATGC 
ACAGAAACAT 
TTGGGGCCAA 



AAACATGAGC 
TCCTACAAGC 
GCCGGGCCAG 
TTATGAGCOC 
GGGCACATTG 
CAATAAAACG 



11 
I 

GGGCGGTGCG 
GCGCTCACTG 
GCTGCGCTCC 
CCGGCGCCTC 
CTTGCCCCAC 
GCTGGACCTG 
CTCCTACCCG 
TAG CCATTT C 
CAAGGAATTC 
CDCAGAAGGG 
ACGCTTCGAC 
CAGGGAATCG 
GCTGGTGTTT 
GGGCCTGCAG 
CCTGATTGGG 
CACGGAGGTC 
CTCCAAGACG 
CAGCGACCAG 
CTGGCAGGAC 
TGCCTXCCCT 
CCACTTCATC 
CATCTCCGTC 
GOTGGTCCGG 
GTTTTTCTGG 
TQAOACCTTC 
AGCATATGGC 
TGTQCAGGCA 
GTCATTGGCT 
CTACCAGCCA 



21 
I 

GCAGCAAGTG 
GGCCCGGAQC 
CGAGCTGCGG 
GCCCTGGCCG 
CGCAGCCAGG 
CGCCCGCGCC 
TACAACGCCA 
TACAAGGCCG 
CTCACCGACG 
TTCCACCCAC 
GAAaCTQTCA 
AATGAGACGT 
GATCTCTTTCC 
GACATCACAG 
CTCTCQGTGG 
CGGCACGGGC 
CACTTCCGCA 
CCCAAGAACC 
AGGCAGGCCT 
TGGATCATCQ 
CTGAACTCCT 
AACCCGGAAA 
CTCTACTTCG 
QCCTGTGGCT 
ATCCTCCATT 
CCCTCCCTAT 
TTTTGATCAG 
AAACCTAGCA 
GGGAAGTCTC 
GGCCACCCAG 
GAAAQGAAAA 



31 

I 

ACCGACGGCC 
CCGGAGCCCG 
CGCCGCACAG 
ACTTCAGCCT 
AGCQGCGGGA 
CGCACCTCCA 
TGGCGGTGGA 
TCTTCAGTAC 
COGACATGGT 
GCTACCACCA 
CGGCAGCCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
AGACGCTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAGCA 
CGCCTGAAGG 
ACATGAAOGC 
CGGTGCCCAA 
ATGACAGCTC 
GCCA CTAG CT 

CCCCAACTTT 
TTTTTCAGTG 
GGAAAAAAAA 
AGCCATGCAC 
CCGTGGGAGG 
TTGACCCGGA 



41 

i 

GGGACGGCCG 
GGTAGCGOGT 
CTTCGTGGCG 
GGACAAOGAG 
GATGCAGCGC 
GGGCAAGCAC 
GGAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TCGAGAGTTC 
ATTCCQGATC 
OQTTTATCAG 
CCGTACCCTC 
CCACTGGGTG 
TGGGCAGAGC 
QCAGCCCTTC 
CACGGGGAGC 
GCGGATGGCC 
CGAG CTGT AT 
CTACGCCGCC 
CACCAACCAC 
GCCCTGCTGT 
CAACGTCATC 
CCTCCGAGAA 
GOCAGGAAOC 
AAAGGTGTGA 
□CAGCATCCA 
ACAACGCATA 
GGAC1CGTTT 
AAGGGGGCGT 
AGTTCCTGTA 



51 
I 

AGAGCCGGCG 
CTCTGGGCAC 
GTGCACTCGA 
GAGATCCTCT 
AACTCGGCAC 
GGGCCCGQOG 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTGATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTCGG 
GTCAATCCGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGCGCA 
AACGTGGCAG 
GTCAGCTTCC 
T ACTA CTGT G 
GCCATCGTGC 
GCGCCCACGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAGA 
AAGAAAAATG 
CCAGAGGTAA 
GGCAAGGGGT 
ATAAATGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



AATGAATG 



A125 Protein sequence; 

hone toorpncgenet ic protein 7 (osteogenic protein l) 

Hs. 170195 
BB616633 
NP_OO1710.1 
1-30 

TGFbj>ropept ide [37-281] 
none found 
secreted 



(BMP7) 



Unigene number » 
Probeset Accession #: 
Protein Accession #; 
Signal sequence s 
P£am domains; 
Transmembrane domains t 
Cellular Localizations 



1 
I 

MHVRSfcRAAA 
ILGLPHRPKP 
ijQDgHFJvrnA 
LRERFDNETF 
KNLGLQLSVE 
QNRSKTPKHQ 
GECAFPLNSY 



11 
I 

PHSFVAWIAP 
ELQGKHN8AP 
PMVM3PWLV 
RISVYQVLQE 
TLDGQ3INPK 
KALRMANVAE 
MNATNHAIVQ 
H 



21 

I 

£iFI»DRSAIiAD 
KFKLDLYNAM 



31 
I 

FSLDHEVHS6 



HldG HES D U I t 
DAGLXGRHGP 
NSSSDQRQAC 
TTtVHFINPET 



YHHRSFRFDL 
LDaRTLWASE 
QNKQPFMVAF 
KKHELYVfiPR 
VPKPCCAPTQ 



41 
I 

FIHHRLRSQE 
QGFSYFY3JAV 
6KIPEGEAVT 
EGWLVFDITA 
FKATEVHFRS 
ULGWOJDWI1A 
IHAigVIiYFD 



51 
I 

RREMQREIfcS 60 

FBTQGPPLAS 120 

AABERIYKUY 180 

T6UHWWWPR 240 

IRSTGSKQRS 300 

PEGYAAYYCE 360 

D3SNVILKXY 420 



A126 DMA SEQUENCE 

Gene name: 
unigene number; 
Probeset Accession #t 
Nucleic Acid Accession #: 
Coding sequence i 



integrin, beta B 
Hs. 52620 
AA479726 
MH_002214 

6B0-2990 (underlined sequences correspond to start and stop codons) 



l 
I 

CCCAGAaCCG 
CTGCCGACTT 
GTTT3GCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGCG 
QGOCCCGAGG 

TCTGCCTGCA 



11 
I 

CCTCCCCCTG 
GTCTTTGCCC 
CTGCCCACCT 
CTCGCCGGCa 

TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGOCT 
TCGCCCGGGA 
GTTTTGCATT 
AAACGACCGG 



21 
1 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGOCCCAC 
AGCCCTTGCA 
TCTOaCOGAG 
GCCGAGDGOT 
GCTAGGCCTG 
GGCCOAGCCC 
ATGTGCGGCT 
CGAGGTCCCG 



31 
I 

CCCGAGCTTC 
AGACGGGGCT 
CTGCGCTQAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 



41 
i 

CrCCCTTGCC 
GCAAAGCTGC 
TGATGGGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
CCGCCGTGCC 
GCITACCTGC 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTTACC 



51 
I 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 

gaatgtacat 
tggctcttcg 

CTCTTTTCTT 
GCCGGGCCCT 
GAGCCGGGAG 
ACCGCTTGCT 
CTCaCCCGCG 
CGGCGGGCGC 
GCTGCATTTG 
TGGGTQTTTX 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 



890 



WO 03/042661 



CACTTGTTCT TGGACTGGGC CAAGGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 840 

CCTtJTGCCAG GTGCCTTGCG CTGGGTCCAG AATGTGGATG GTGTGTTCAA GAGGATTTCA 900 

TTTCAGGTGQ ATCAAGAAGT GAACGTTGTG ATATTGTTTC CAATTTAATA AGCAAAGGCT 960 

GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 1020 

TTAATACCCA. GGTGACACCA GGAGAAGTGT CTATCCAGCT GOGTCCAGGA GCCGAAGCTA 1080 

ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTQT GGATCTTTAT TATCTTQTTG 1140 

ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAATTAAA TTCCGTTGGA AAGGATT TAT 1200 

CTAGAAAAAT GGCATTTTrC TCCCGTGACT TTOGTCTTGG ATTTGGGTCA TACGTTGATA L2fi0 

AAACAGTTTC ACCATACATT AGCATCCACC CCOAAAGGAT TCATAATCAA TGCAGTGACT 1320 

ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 13 BO 

TCACTOAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA GATACACCAG 144 D 

AAGGAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 15 DO 

AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAOATCAGAC GTCTCATCTC GCTCTTGATA 1560 

GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 16B0 

ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 

TTCTACC3CCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 

ATAATTTGGT AGTGOAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGQAQ GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TOTGAGGACA 2100 

ACAGAGGACC TAAAGQAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCRGTTTTC TTCTGAGAGT TGCAAGTGAC 2220 

ACAAGGATCA GCCTQTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTXCTXGTC 2340 

CATATCAGCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGOCAAT 2400 

GCTTCAGTGG CTGGGAAGGT QATCOATQCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GOGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCAOCGATOC CAGGAGCATC GGOCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 

GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGQCTATAC 2640 

TTQATCAGTG CAAAACCTCA TGTGCTCTCA TGQAACAACA GGATTATGTC GACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2820 

ATAAAATTAA GTGCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2 8 B0 

TGCAAAGTGT TTGCACAAOA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 

TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAATGTGTC TTACTAGTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACT TAT TT AG ATCAGCATAG AATGTAGATC CTCTGAAOAQ CACTGATTAC ACTTTACAGG 3300 

TACCTGTTAT CCGTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTXAArCTTA 3S40 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTgCAAOAT GGATACTAAT TOCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 



A127 Protein ggqueocei 



Unigene 
Probeoet Accession 4h 
Protein Accession 41 1 
Signal sequence: 
Transmembrane domains? 
EGP domain i 
IHB dottvain: 

Cellulajr Localization: 



Integrin, beta 8 

Ha * 52620 

AA479726 

NPJ0O22O5.1 

X-39 

682-704 

552-584 

54-469 



1 11 21 31 41 51 

1 I I i I I 

MCGSALAPFT AAFVCLQNDH RGPA3FLWAA WVFSLVLGLG OjGEDNHCAGS NAASCAHCLA 60 

LGPECGKCVQ EDFISGGSRS ERCDIV8NI»I SKGCSVDSIE YPSVHVZTPT ENEOJTOVTP 120 

GEVSIQLRPG AEANFMDKVH PIjKKYPVDI»Y YLVDVSASMH HUIBKUJSVG MDLSRKNAFF 180 

SRDFBIOTBS YVDKTVSPYI SIHPERTHNQ CSDYNLDCMP PHGYIHVT.3I, TENTTBFEKA 240 

VHRQKIEGNI DTPEGGFDAM LQAAVCESHI GWRKEAXRLt, LVHTDQTSHli ALDSKLAGIV 300 

VPNDGNC3ILK HBVYVKSTTM EHPSLGQLSB KLIDNNINVI FAVQGKQFHW YKDIrlfPMiPG 360 

T I AGE IS SKA AKIKNLWEA YQKL.ISEVKV QVHffQVQGIY FNITAICPDO SRKPGHBGCR 420 

NVTSHDTEVIjF NVTVTCMKKCD VTGGKNYA1I KPIGPUETAK IHIERNCfiCQ CEDNRGPHGK 480 

CVDETFLDSK CPQCDENKCH PDBDQFSSES CKSHKDQPVC SGHGVCVCGK CSCHKIKLGK 540 

VYGKYCEKDD F8CPYHHGNI1 CAGHGECSAG ROQCFSGWEG DRCQCPSAAA QHCVNSSGQV 600 

CSGRGTCVCG RJCECTDPRSt GRFCEHCPTC YTACKENWHC MQCLiHPEKI*S OAILDQCKTB 660 

CALMEOOHW DQTSECFS&P SYLRIFPIIF IVTFliIGIxLK VLIIRQVILQ MWSHKIKSS3 720 
UYRVSASKKD SLIJ^QSVCTR AVTYHfiEKPE EIKMDIfiKIiN AHETFRCUF 

A12B DMA SEQUENCE 

Gene name* G protein-coupled receptor 64 



891 



WO 03/042661 



PCT/US02/36810 



Unigene number: 
Probeset Accession fc: 
Nucleic Acid Accession 3: 
Coding sequence: 



He. 184942 
AA435577 
MM 005756 



73^ 3117 (underlined eequencea correspond to start and atop codons) 



1 



11 



21 



31 



51 



AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTOAA SO 
CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 120 
GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT TCTGGTAACA 180 
TCCCTGGAAG AAGATACTGA TAATTCCAGT TTGTCACC&C CACCTGCTAA ATTATCTGTT 240 
GTCAGTTTTG CCCCCTCCTC CAATGAGGTT GAAACAACAA GOCTCAATGA TGTTACTTTA 30 0 
AGCTTACTCC CTTCAAACOA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 3 CO 
GCTTCAGGCG TCAAAOCCCA GAGAAATATC TGCAATTTGT CATCTATTTG CAATGACTCA 420 
GCATTTTTTA GAGGTGAGAT CATQTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 480 " 
CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 540 
GAGCTCAACA AAACCCTGCA AACCCTAAGT GAOACTTACT TTATAATGTG TGCTACAGCA 600 
GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATG CA 660 
TGTGCTGCAA TAGCCGCTTT GQAAAGAGTA AAOATTCGAC CAATGGAACA CTGCTGCTGT 720 
TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA GTGTGACCTG 780 
CAGGATCCCA TTGTCTGTCT TGCTGACCAT CCACGTGGCC CACCATTTTC TTCCAGCCAA 840 
TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTOCCAGG TCCCCAAAGC TACCTCTTTT 900 
GCTGAGCCTC CAGATTATTO ACCTGTGAOC CACAATGTTC CCTCTCCAAT AGGGGAGATT 960 
CAACOCCTTT CACCCCAGCC TTCAGCTOCC ATAGCTTCCA GCCCTGOCAT TGACATGCCC 1020 
CCACAGTCTG AAACGATCEC TTCCCCTATG CCCCAAACCC ATGTCTCCGG CACCCCACCT 1080 
CCTGTGAAAG CCTCATTTTC CTCTOCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 1140 
AGCQCACXTC CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 
CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCXAA CCTOGCAGGA 1260 
GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCOGC CTOACATGCT GGCCCCTCTG 1320 
GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATT GK3 CCT AC AGCTGAACTT TTCAAACACG 1380 
ACTATAAGTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCA0AGTGAA TGCCAGTAST 1440 
TTCAACACAA CTACCTTTGT GGCCCAAOAC CCTQCAAATC TTCAGGTTTC TCTGGAAACC 1500 
CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1560 
CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 
TTGTTTCAGG ATCCTTOCCT GGAGAACCTC TCTCTGATCA GCTACGTCAT ATCATCGAGT 1680 
GTTGCAAACC TGACCGTCAG GAACTTGACA AGAAAOGTGA CAGTCACATT AAAGCACATC 1740 
AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 
GGCAGAGGAG GCTGGTCAGA CAATGGCK3C TCTGTCAAAG ACAGGAGATT GAATGAAACC i860 
ATCTGTACCT GTAGCCATCT AACAAGCITC GG CG TTCTGC TGGACCTATC TAGGACATCT 1920 
GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATTGGTTG TGGGCTTTCA 1980 
TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT 2040 
TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 2100 
CTCCTGGACT COTGGATTGC TOTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 
TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GOCTAGAAGC ATTCCATATG 2220 
TACCTOGCCC TTGTCAAAGT ATTTAATACT TACATCCGAA AATACATCCT TAAATTCTGC 2280 
ATTGTCGGTT GGGGGGTAOC AGCTGTGOTT GTGACCATCA TCCTGACTAT ATCCOCAGAT 2340 
AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 
ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2400 
CTGAAGGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAAXTAA AAAGAAGAAG 2520 
CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAQACCTCA GGAGTATCGC TGGCCTTACA 2580 
TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGACCAGT TAACGTGACC 2640 
TTCATGTArC TGTTTGCCRT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 
TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG TGGAAASTTA 2760 
CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2 620 
GTAAACCAAG GAGTGTOCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAG 2880 
TCCACCACAC TGCTAGTOAA TAATGATTGC TCAGTACACG CAAGCGGGAA TGGAAATGCT 2940 
TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG GCTTCACGAT 3000 
TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG GAAAGGCOGT 3060 
ATGGCTCTCA GAAGGACTTC AAAGGGGGGA AGCTTACACT TTATTGAGCA AAT GTGAT TC 3120 
CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT TTACCTTTTA 3180 
CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCaACATCT GGAGAAGCAT 3240 
AAGCrTAATTA AGGGCGATGA 1TATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3300 
TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 
ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 
TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 3480 
TTQGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGr 3540 
GTCAAAAATC TTACTTCTAC A'lTlVJTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTC CT 3600 
TTGTAGAATC ATGGTTGTTT TGTCTCAGGT GATAATTCAG AAAATCCTTG CTCGTTCOGG 3660 
AAATCCTAAA GCTOCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 
TCAAGAAATA ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGACAGTQC CACAGTTAGC 3780 
TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3840 
TGGGTCATGG TTTGACAAAG AGAGTGAGAG ACCATATTXT AGCCCCACTC ACCCTCTTGG 3900 
GTGCACGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACOCATCCC CTGACCGCAT 3960 
CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 4020 
AGAGGGATGA ACTGTCATCC AGACCATGTG TCAGGAAAAT TGT6AAC3GTA GATGAGGTAC 4080 
ATACACTGCC GCTTCTCAAA TGCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 4140 
CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 4200 
ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 4260 
TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGAGAAG GGGAGTAGCT ATTGGGAACA 4320 
GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GC3CTTOATTG CTCCEftSTTA 4380 
TATACAGGGT CTATCTTGCT TCCTAGCTAC ATCTGCTTGA GCAGTGOCTC AAGTACATCC 4440 
TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 
ATTTCAAGTG AATGTTGGAT CTCAGACXAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 



892 



WO 03/042661 



PCT/US02/36810 



CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA ' 
AAAATCAAAA ATGTTAAAAT CAATGAAATA AATTTGCAGT TAAGA 



TTTATATGTT 4 620 



A12 9 Protein Hequence 
Gone names 
Unigene number: 
Protein Accession tf: 
Signal sequence : 
GPS domain i 

Transmembrane domains - 
Cellular Localization! 



G protein- coupled receptor 64 
UP_005747.1 

1-38 



564-615 



624-646, 660-682, 6SB-71D, 
plasma membrane 



733-755. 783-805, 828-B50, B58-6I 



1 
I 

MVFSVRQCGH 
PSStfEVETTS 
GEIMFQYDKE 
TLHCTFTIKIi 
VCLADHPRGP 
PQPSAFIA5S 
VQTDtVNTSS 
LKWDDIGLQ 
NSIOTITIiPS 
TVRNLTHNVT 
SHIiTSFHVIjI* 
ILIQLCAALL 
VKVFNTrTRK 
AVFYITWGY 
ITWGFAFFAW 
NSDWSKTATN 
NGVSFSVQNQ 



11 
I 

VGRTEEVliLT 
IiNDVTLSIiLF 
STVPQNQHIT 
KNTMNACAAJ 
PFSSSQS1PV 
PAIDMPPQSE 
ISDLBNQVLQ 
IHFSNTTISL 
SLMWWIiPABD 
VTIiKHrNPSQ 
Dl^SRTSVUA 
LLHLVFLU3S 
YILKFCIVGW 
FCVIFIiIiNVS 
QPVNVTFMYL 
GLXKQTVNQG 
DVGLHDFTGK 



21 
I 

PKIFIiVJICT* 
SNETEKTKIT 
NGTIiTGVIjSI* 
AA&ERVKIRP 
VPRATVLSQV 
TISSPMPQTH 
MHKALSIiQSlj 
TSP9UUAVI 
MELASRVQFN 



31 
I 

HWLVTSLEE 
TVKTFHASGV 
SELKRSELNK 



PKATSFAEPP 
VSGTPPFVKA 
EPNXJVGEMIN 
RVNASSFMTT 
PFETPAIiFQD 



QMMALTFITY 
WIALYKHQQL 
GVPAWVTII 
MPIWLVQLC 
FAIFNTLQGF 



IGCOL.SSIFI. 
CrgVAVFLKY 
LTI SPENYGI* 
RIKKKKQLGA 
FIFTFYCVAK 
SSNSTN8TTL 



41 
I 

DTDNSSLSPP 
KPQRNICNIiS 
TLQTI»BETYF 
PCPSSPEELG 

SFSSPTVSAP 
QVSRLLHSPP 
TFVAQDPANL. 
PSXiEKLBLIS 
HSDNGCfiVKD 
gvXbVTYZAF 
FLLVSFTWMG 
GSYGKFPNGS 
qRKTSIQDLR 
ENVRKQWRRY 
I>VNNDCSVHA 
RTSKRGSLHF 



51 




1 

PAXLSWSFA 


60 


SICMDSAFFR 


120 


IMCATAEAQS 


180 


KLQCD^QDPI 


240 


SPIGEIQFLS 


300 


AHVNTT8APP 


360 


DMLAPliAQRL 


420 


QVSLETQAPE 


480 


YVISSSVAKL 


540 


RRIWETICTC 


600 


EK1RRDYPSK 


660 


LEAFHMiUUi 


720 


PDDFCWINNH 


780 


SlAGl»TFEiIiG 


B40 


LCCGKLFXAE 


90 0 


SGNGNASTER 


960 


IEQM 





A130 UNA SEQUENCE 

Gene name: G protein- coupled receptor 49 [GPR49] (HG38) (LGR5) 
Unigene number: Hs. 285529 

Pxobeset Accession tt; AA460530 
Nucleic Acid Accession #i MMJ103667 

Coding sequence: 201-2924 (underlined sequences correspond to start and stop codons) 



I 

GTGG0QGCAA 
TCAGGAACGC 
OGCCCGGCGC 
GGCCCCCTAC 
TGCTGCTGCA 
GCCCCACJACA 
ACCTGGGGCT 
GTATGAACAA 
AGTTACGTCT 
ACAGTCTTAA 
TGCAGAATTT 
CCCCAAGCTG 
TAACAGAAAT 
CCCTGAACAA 
TTCTACATCT 
ACAGCCTAGA 
GGACACTCTC 
AGAAAGCATT 
AATTTGTTGG 
GTGCCICACA 
CTTTAACTGG 
TCCAAGTGCT 

AAAAGCttca 
tccaccagtt 
ACCCCAATGC 
TGTCGTCTTT 
ATGCCTTACA 
CTTATGCTTA 
AATGGAATAA 
TTCAGGCTCA 
AAGCCCTTCA 
TGCTTGATGG 
GTAATGCTTT 
TGTTAATTGG 
CTGGTSTGOA 



11 
I 

CCGGCACCTC 
GGCGTCTGGC 



TGCTTACTCT 
GGAAAGCTCC 
OCATGGCCGC 
CTTTGCCTTT 
CCCTTTGCTT 



TTCGGGCACC 
GCTGGCGACC 
CTGXCATTGC 
CTCGGAGCTG 
CATCAGTCAG 
TGCGGGAAAC 
AGTTCTTATG 
GGGAAGCCTT 
TTTCAGTGGC 
COCOGTCGAG 
AATACACCAC 
CCATAACAAT 
GACTTTAOAT 
CAACCTTAAA 
TGTAGGCAAC 

gaqatctgct 
aataactgaa 

AGCACAGATC 
AGATCTSTCT 
OAAAATTGAC 
GCTTAGCC'i'C 
ATTTTCCACT 
TCCTATAACT 
tteGCTTGATA 
CCAGTGCTGT 
AGGTGACAAC 
AGATGAACGT 
TTCAGTGCAG 
CTGGCTGATC 
GGTGACTTCA 
GGTCATCGCA 
TGCGTTCACT 
CCATGTCATT 
GGCAGCCCTG 
ATTTTCTAGC 
AGrrCCCCTG 



21 
1 

AGTCCCCGCC 
GCTGCAGAGG 
GTAGCAGTCC 
ATGGACACCT 
GGSGGCAGCT 
GAGCOCGACG 
CCTTCCAACC 
CTQCTOOOSA 
GCTCTGACAT 
CTGCAGAATA 
CAATCCCTOC 
CTGCATTCCC 
GCITTTAGAA 
ATACCAGACT 
AGAArCCACT 
TTAAATTACA 
GAACTAGGAT 
CCTTCTCTTA 
TTTCAACATT 
TTTCCTGATT 
TCATCTCTTC 
TACAACCTAT 
CTAAGACATA 
CGATCGCTGA 
TTGCCATCCC 
GGGTTACATG 
TCATCTGAAA 
GCATTTGGAG 
AGCAGTATGG 
GACCTTGAAQ 
TGTTCACCTT 
AGAATTGGAG 
ACAGTTTTCA 
GCAGTGAACA 
TTTGGCAGCT 
GGTTTTTTGT 
GAGCGTGGGT 
CTGAAAGTAA 
CTGOQTGGCA 



31 
I 

GOGCTTCTCC 
CCOGCTGAGT 
GGTGCTGCTC 
COCGGCTOGG 
CTCCCAGGTC 
GCAGGATGTT 
TCAGCGTCTT 
ATCCCCTGCC 
ACATTCCCAA 
ATCAGCTAA9 
GTCTGGATGC 
TGAGGCACCT 
GTTTATGGGC 
ATGCCTTTGG 
CCCTGGGAAA 
ATAACCITGA 
TTCATAGCAA 
TTACAATACA 
TACCTGAACT 
TAACTGGAAC 
CTCAAACC5GT 
TAGAAGATTT 
ATGAAATCEA 
ATTTGGCTTG 
TAATAAAGCTT 
GTTTAACTCA 
ACTTTCCAGA 



41 
I 

TCGCCGCCCA 
TGCAGAAGCC 
TCCGC0CGCG 
TGTGCTCCTG 
TGGTGTGTTG 
GCTCAGGGTG 
CACCTCCTAC 



51 
I 

CGCCGTGQGB 
CACGGAGOGG 



GGGAGCATTC 
ACACGTACCC 
TAACCACATC 
GTGGCTGGAT 



ACGACCTTCA 
ATTTCCTGCT 
CCCCAGGCCC 
TGTGGACCAT 
GATCCCCTCT 



TTGCADSACA 
CCATTTTTGC 
TCTCTGTGAA 
TCATTTTGCT 
GCAAGTATOG 
GCTACATGGT 
CCTCATGATG ACCATTGCCT ACACCAAGCT 



AAACCTCTCC 
GAAATGCTTT 
TGAATTCCCC 
CAATATCAGG 
TTTCTATGAC 
AAGAACACTG 
TGCAAACCTG 
CTGCAATCAG 
ACCCAGTTTT 
OTAAATTAAA 
GAACAAAATT 
GGACCTATCG 
CTTAAAATTA 
ACTCAAGGTT 
TGOCTAXAAG 
TAAGAAAGAT 
TGACTTTGAG 
CTTCAAAOOC 
AGCAGTTCTG 
GTACATTTCC 
AGTCICCAGT 
TGGTGCCTGG 
TrCAGAATCA 
ATATTCXGCA 
CTGTGCCCTG 



CGCTCTCAXX: 
CTACTGCAAT 



TCCTTGCCTG 
CTGAGGOGCr 
GACTGCTCGG 
CTAGACCTCA 
TTCCTGGAGG 
ACTGGCCTTT 
ACAGAAGCTC 
AGCTATGTGC 
GACAATGCGT 
ATGACCTTGG 
AGCTTGGTAG 
GATGGGCTCC 
ACTQC74ATTA 
TCGATACCTG 
AATCCCATCC 
ACTCTGAATG 
GAGAI5TCTQA 
TTACCTAATC 
TCAGTCTGOC 
GTTGACACTT 
GCTATTATXC 
TOCAACCTCC 
ACAGGAAATC 
ATAGAAATGC 
ATTTCTAATC 
GCTGGAATGT 
GAAGACCTGA 
TGTGAACAOC 
GCACTTACIT 
COCATTAAAC 
GCCGTGCTGG 
TGGGAGAATG 
TCTGTTTTCC 
AAATTTGAAA 
CTGGCCrTGA 
CTCTGCCTGC 
TTGCTCAATr 



60 
120 
3.80 
240 
30O 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10 B0 
1140 
12 00 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
16B0 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
22B0 
2340 
2400 
2460 



893 



WO 03/042661 



PCT/US02/36810 



10 



GAGACCTGGA 
CCAACTGCMT 
CATTTATCAG 
GTCTCAATCC 
TGAGAAAGCA 
CTGA.TGATGT 
CCAGCATCAC 
AGAGCTGCCA 
AATGTTTTCA 
ATAAGAAGAG 



GAATATTTGG 
CCTAAACTGC 
TCCTGAAGTA 
CCTTCTCTAC 
AAOCTACGTC 
CGAAAAACAH 
TTATGACCTG 
TCTTTCCTCT 
AAGQTTGAGA 
CTGAGGTGAA 



GACTGCTCTA 
CCTGTGGCTT 
ATTAAGTTTA 
ATCTTGTTCA 
TGGACAAGAT 
TCCTGTGACT 
CCTCCCAGTT 
GTGGCATTTG 
ACCTGAAAAT 
ACTCGGTTTA 



TGGTAAAACA 



TCCTTCTGGT 
ATCCTCACTT 
CAAAACACCC 
CAACTCAAQC 
CCGTOCCATC 
TCCCATGTCT 
GTOAGATTGA 
AA 



CATTGCCCTG 
CTCCTCTTTA 
GGTAGTCCCA 
TAAGGASOAT 
AAGCTTGATG 
CTTOGTAACC 
ACCAG CTTAT 
CTAATTAATA 
GTATATCAGA 



TTGCTCTTCA 
ATAAACCTTA 
CTTCCTGCAT 
CTGGXGAGCC 
TCAATTAACT 
TTTACCAGCT 
CCAGTGACTG 
TGTGAAGGAA 
GCAGTAATTA 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Al 31 Protein sequence 
Gene name: 
Unigene number* 
Protein Accession #: 
Signal sequences 
Transmembrane domains: 
Cellnlar localization! 



G protein-coupled receptor 49 <GPR49) (HG38) (LgH5) 
Ha. 28552 9 

UP_0036S8.1 

1-22 

557-579, 596-618, 641-663 P 687-709, 724-74G, 770-792, 803-825 
plasma membrane 



1 
I 

MDTSRLGVTili 



liQNNQLHHVP 
AFRSTjSALQA 
LNYMttLDEFP 
FGHIxPELRTD 



lpsi,ikij>i.s 

AFGVCENAYSC 
CSPSPGPFKP 
AVNMLTGVSS 



STMGYMVAIjI 
FVAPLSP39L 
WTRBKHPSLK 
VAFVPCL 



11 
I 

SLPVLLQIiAT 
UJLSHHNISQ 
TEALQULiRSlj 
MTbAENKlHH 
TADtTLSNIiK 
TLNGASQITE 
SVCQKIjQKID 

rSNQWNKGDN 

AVLAGVDAPT 
XFETKAPPSS 
LLNSLCFLMM 



SINSDDVEKQ 



21 

I 

GGSSPRSGVL 
LLPNPiiPSLR 
QSLILLEtANHI 

ELGPHfiNKIR 
FPDIrTGTANL 
LRHNEIYBIK 
GLHGItTHUCL 
SSMDDLHKKD 
RIGVWTIAVL 
FGSFARKGAW 
UCVIIIiLCMi 
TIAYTKLYCN 
IKPIItLWVP 
6CDSTQALVT 



31 
I 

LRGCPTHCHC 
FliEEIiRLAGN 



BIPEKAFVGtf 
ESI/TLTGAQI 
VDTFQQIiIsSIj 
TGNHALQSLI 
AGMFQAQDER 
AI*TCMAI»VTS 
WKNGVGCHVI 
LAIYTMAAVFI, 
LDKGDLENIW 
IiPACIiNPXiIjY 
FTSSSITYDli 



41 

I 

EPEGRMLiLlRV 
ALTYIPKGAF 
LHSUtHLWIiD 
RIHSLGKKCF 
PSLITIHFYD 
SSLPGTVCNQ 



fiSENFPBbKV 
DLEDFLLDFE 
TVFRSPLYIS 
GFLSIFASES 
LGG3KYGASP 
DCBMVKHIAL 
XJ^FNPHFKED 
PPSSVPSPAY 



51 
I 

DCSDLQIfSEL 
TGLYSLKVLM 
DMALT5IFVQ 

MP1QFVGRSA 
LPHLQVLDLS 
AIIHPNAFST 
IEMPYAYQCC 
EDLKALHSVQ 
PIKLfclGVIA 
fiVFLLTliAAL 
LCI*PIiPFGKP 
LLTTNCILNC 
LVSLRKQTYV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



A132 UNA SEQUENCE 

Gene names parathyroid hor 

Unigene number; 

Proneset Accession #x 

Nucleic Acid Accession ttt 

Coding sequence: 



receptor 2 
Hs. 159499 
U25128 
NM_0 05048 

143-1795 (underlined sequences correspond to start and stop codons) 



1 
I 

GGCCGGTGGC 
TGGGCCAGCC 
TCTTCCTACZA 
GCTAATGCTC 
TATAGAGGAG 
AGCTCAACTC 
GGCCAGAGGA 
CAACCATAAA 
CAGCTTAAAT 
CAGCATAGGA 
CATCTCTTTT 
TTGCACTAGO 
CATCTTTGTC 
AATAATGCAG 
TATCGGGTGC 
GATCCTGGTG 
CAAATACCTG 
ATGGGCTGTG 
CATCAAGTGG 
TCTGAATACG 
CACAAGGAAG 
AGTGGAT^EAC 
CCGCATGCAC 

CTGCTAcrac 

CTCGGTGGAC 
CACCGTGACG 
TAxCTCTGGC 
TG0CTATGTC 
CAAGGAAGAT 
G3AATCEA&C 
CATTTGTGGC 
ATACTCCTAT 
TTTTAGOCTC 



11 
1 



AAGTTGGCAA 
GC0GTTCOGG 
GGCAGCTGCC 
CAGATTOTCC 
CAGGAGGGAG 
ACAGTGGGGA 
GGAGTTGCTT 
AAAACATGGG 
AAGCAAJGAAT 
GGT TCC TTGG 
AACTATATCC 
AAAGACAGAG 
GKZQACCCAC 
AAGATTGCTG 
GAAGGTCTCT 
TGGGGCTTCA 
GCACGAQCAA 
ATTTATCAAG 
GTTAQAjGTTC 
CAATACAGGA 

atcgtgttcg 
tgtgagctct 

AATGGAGAGG 
TGGAAAAGGA 
CACAGCACCA 
AAAGCTGCCA 
TQQA3TAACT 
AGTGGGAGGC 
CCAGACACTG 
TGACTTTCAT 
GCTTGAGTTC 
CATGAATTQG 



21 
I 

CCACCCCAGC 
CriGGAAGCT 
GC ATSG OOGG 
TCCTGGGCAG 
TTQTGCTGAA 
AAGGTAATTG 
AAATATCQGC 
TCCGACACTG 
CCAATTATTC 
TCTTTGAACG 
CTGTGGCTAT 
ACATGCACTT 
TAGTCCATGC 
AAAATTCCAT 
TTGTGATGTT 
ACCTGCATAA 
TCTTGATAGG 

CACCGATCTT 
TAGCTACCAA 
AACTGGOCAA 
TATGCCTGCC 
TCTTCAACTC 
TTCAGGCAGA 
CACCQCCATQ 
GCAGCCAGTC 
AGATCGCCAG 
CAGAGCAGGA 
AQOQAQATGA 
AAGGATGCCA 
GGGCTGGTCC 
AAAGGCTGAA 
CTCCTGTAAA 



31 
I 

TGCGCGTOGT 
TCTCCCGGGC 
GCTGGGGGCG 
AGGCCAGCTG 



TTTOCCTGAA 
TQTTCCATGC 
TAACCCCAAT 
AGACTGCCTT 
CCTCTATGTA 
TCTCATCATT 
ATTTGTGTCT 
TCACATAGGA 
TGAGGCAACT 
TATTTACTTC 

TCTCATcrrr 

TGCGAGGTGC 
AGCAGCTATT 
AATCTGGGAG 
ATCGACAjCTG 
TCACTOCTTC 
CTTTCAGGGT 
GGTGAAGAAG 
TGGCAGGCGC 
ACAGGTGGCG 
CAGACAGGCT 
CTGCCTGCCA 
TATTCTAATG 
AGGAGAAACT 
AATGGCTGGT 
AATTCAGTTA 
TACTAAOGAC 



41 
I 

TACTGGCCAC 
TCTGGAGGAG 
TCGCTCCACG 
GATTCTGATG 
CAATGTGAAC 
TGGGATGGAC 
CCTCCTTATA 
GGAACATGGG 

ATGTATACCG 
GGTTACTTCA 
TTCATGCTGA 
GTAAAGGAGC 
TCTGTGGACA 
CrGGCTACAA 
GTGGCTTTCT 
CCAGCAGCAT 
TGGGAACTTA 
GGGCTGAATT 
ACCAATGCAG 
GTCCTGGTCC 
ACTGGGCTCG 
TTCTTTGTGT 
ATG1GGAGTC 
AGATGCGGCTT 
GCCAGCACAC 
QACAQCCACA 
CACTCTTTCC 

gagaagcxttt 

GAGGATGTTC 
TGTGTGAGAG 
AGGTGTTACT 
ATGAAAATGC 




TCTGGGGTTG 
GCAOCATTAC 
TCAACATCAC 
TCATTXGTTG 
TTTATGACTT 
ATTTTATGCA 
AG CXZAOATAT 
TTGGCTACTC 
GACGATTGCA 
GAGCTACAAG 
TGGAGTCCCT 
AATCACAATA 
ATTATTATXG 
TTTCGGACAC 
TTGTTGCAGC 
GTGCTGGAGA 
TTATTCTGTT 
TTGGGCATGA 
TAGTCTTTGG 
GGTGGGAGAT 
CTATCATCTA 
GGTGGAATCX 
CAGTGCTCAC 
GCATGGTGCT 
TCACTTTAOC 
ACGAGGAGAC 
CCAGQCCTAT 
TCTGAATGGA 
GGCTTGGCIG 
TAATAATAGT 
AAGTGTCAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
6D0 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



894 



WO 03/042661 



PCT/US02/36810 



GGAGTAG7TT ATTACCTTCT ATTGGCATCA AGTTTTOCTC TAAATTAATG TATGGTATTT 2040 

□ CTCTGTG AT TGTTCATTTT TTTCTGCTAC TTTTGQGTAG AAAAAAGATT CAATTGCTTQ 2100 

GCTGTAGCTT TCT CT CAT AT ATATCACCCT AAATATAATG AAGATCTTTT AGTGTGTATC 2160 

ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 2220 

5 ATTTATTTTG CCTGTG CAT A GGAGCAATTA GGATCTAAAA AAATATATGG GAAGATAAAA 2260 

GATCTAAGAA CAAGTACTTG CTGGAAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT 2340 

TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC 2400 

ACATGCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAGGTCT TCACTCTTTC 2460 

TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 2S20 

0 TTGATTTTGT TTGTAATGTA TTTTGATAGC AAATCATGCT GCATCTATAT CTTTTTCTTG 25 SO 

TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TTAAAAATTT GTTTTAAAAA 2640 
T 



parathyroid hormone receptor 2 ^ 
Ha. 15 94 99 

NP_00503 3.1 

1-25 

7tra_2 [141-420] 

177-197, 228-250, 253-275, 280-302. 320-342, 363-385, 398-419 
plasma membrane 



15 



A133 PROTEIN SEQUENCE 
Gene n a m e; 
UWLgene numbers 
Protein Accession #= 
Signal sequence: 
Pf am domain i 
2Xj Transmembrane domains : 
Cellular Localization: 



25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



i 
J 

MACJLGASLKV 
GNCFPSWDGb 
NYSDCLRFLQ 
HHLFVSFMDa 
VMPIYFLMH 
LADARCWELS 
IJUCSTLVLVI, 
QAfeVKKHWSR 
lASRQPDSHX 
OOQGBTHDVIj 



11 
I 

WGWIiMXiGSCI* 
ICWPRGTVGK. 
PDXSIGKQEF 
ATSIFVKDKV 
YYWILVEGLY 
AGDIKWIYQA 
VFGVHYIVFV 
WNLSVDWKRT 
TLPGYVWSNS 



21 

I 



ISAVPCPPY1 
FBRLYVMYTV 
VHAHIGVKEIi 
IiHNIalFVAFF 
PILAAIGLNF 



31 
1 

TITtEEQIVL 
YETFJJHKGVAF 
GYSISFGSLA 
ESIiIMQBDPQ 

SDTinrtwaFT 
ILFLNTVRVX* 



PPCGSKRCGS VLTTVTHSTS 
EQUCLKHSFH EfiTKED£GRQ 



41 

1 

VLKAKVQCELi 
RHCWPNGTWD 
VAILIIGYFH 
NSIRATSVUK 
IiIGWGFPAAF 
ATK1WETNAV 
FNSPOGPFVS 
SQSQVAASTR 
GTJDILMSKP& 



SI 

I 

XIITAQLQXX3E 
FMHSIiNKTWA 
RLHCTRHYTH 
SQYIGCKIAV 
VAAWAVARAT 
GHDTRKOYRK 
I1YCYCNGEV 
MVLISGKAAK 
RPMESJJPDTE 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 



A134 UNA sequence 
Gene name: 
Unigene number: 
Probeset Accession 4t -. 
Nucleic Acid Accession #: 
Coding sequences 



BSTS 

Hs. 128899 
AA9632S1. 
AA983251 

1-1749 (underlined sequences correspond to start and stop codons) 



1 
I 

ATQC TQTCTQ 
GGAAAGAAAC 
GACCGGGAGA 
GGGQAGGCGG 
CCGCGGCCGC 
CGCCTGCGTC 
TOCGGAC8AC 
ACTCATAAGG 
GCTCCTGGAC 
CCGCGCGQAA 
CTTCTCGGAG 
TGTGGGGCGC 
TGCTGCTGGC 
TGGCTGGACG 
GGCGACGCCA 
GCGCGCCTGG 
CGGGOGGACA 
GGAGACGGCG 
TCCCCGAAAG 
CGCGGATTCC 
AT CTACGTG C 
CTGGTGGCAG 
GCCCCAGGGG 
CGGGGGTCGT 
GGGGCCCGGG 
AACAACGTGT 
CAGATTGTGC 
CACGACTCTG 
AGGCAGATTC 
ACTGTATAAC 
GATTCTOSAG 
GCTTCATTTG 
CATCCAGGGT 
TTGCTGATGG 
AAGAATTTTA 
TTGTTGTTGA 



11 
1 



GAT 

TTCCGTGGGA 
GGOAGAGCCG 
AGAAGGGGAA 
CGCCAGCTGa 
CTGGACGTTC 
AGDCCCGGGG 
CAGTCCCTAA 
CTAGGGCCCG 
AGCGCOGCGG 
AOCGTCCTGC 
TCGCCGCTCG 

CGCAGGGCGT 
CCATCTGCTG 



AAGACGGGCC 
AGGGTGCGCC 
GAAGGCAGCT 
CATCTTCTCC 
CGTTCCTCAT 



GTAACCGCTT 
CCTCACGCCA 
CGCCCCCAAC 
ATGTCAACAT 
CACATCAAGG 
TGCCCATGAC 
AGTCCCCCTT 
CGAGAGTCAC 
GTGGAAGTCC 
CCCCCAGACT 
ATCATTGATT 
GTGTATAACA 
TAAACTGATA 
CTGCACAGGA 
TTTTTTTTAA 



21 
I 

GAGTCCCAGT 
GGCTTCCATC 
GCOGGAGGCT 
CCGGGGCGAG 
GCAOOCTCCC 
CGGGGGGAGG 
GCCTTCTGAC 
GGGGACGGGG 
GCGTCGTCGC 
GACAGTCAGT 



TGGTGCGATC 
AGCCTCCCAC 



GTGGCTGGGA 



CTGGCGCATC 
CGGCAGCTGC 
CTGCGACAAT 
CCGACGGCTC 
CCCACCCGTG 
CCTCAGGGCT 



TGTTGGCTCC 
CAGATGTCTC 
GATGGAGACC 
GTCCAGCACA 
AAGGTCACAG 
GCCCACGAAT 
GCAGTATCTG 
AGCTGTGCCA 
CCCTCACACC 
TGGTGGGTTC 
GCACATGTCG 
GTATGAAAAC 
TAT GATGGAA 
AATGCTTGAG 
AATTAAGGAT 
TCAAAATOCC 
TCAGACAGGG 
GCAACTTCAG 
TTACAGOTGC 



31 
I 

ACCCAQCACA 
GGTGCGCACA 
GCCQGGCTCC 
CCGCCCGCCT 
GGGACTGCGG 

TGCATCCCGA 
CCAGCGGCTG 
CTCCTGGGCG 
GACGAGGCCC 
GACGCGGTGT 
CATOCTGGAA 
CGAGGGCCCA 
GGCTTCCAGT 
GOGTTGCGCT 
GACCGCCAQC 
GGCAGGGCTT 
AGGGCCTGGC 

CCCTCTCOCC 
GTGTTTGTCG 
CGGCCTAAGC 
ATCCCCATGA 
GCTGCCAGTT 
ACCAACTGTT 
TTCTCTGTGG 
CATCCCCCAT 

cctttCatgg 

AACAGTGAAC 
CTTTACTGAA 
GTGGTATTTA 
ATCTCCGAAT 
AAOOGGCCTC 
TCCGAAGTGC 
TTTTATTATG 
TGTTATCTCC 
TCTTGCTCTG 
CCTCCTGOAT 
CTGCCOCCAT 



41 
) 



CCTGCOGAGG 
TOTGGGACCG 
GGATCCGCGC 
CTGGGGGOGC 
CAGTGAAACC 
GATTTCCATC 
AGGACGGGGA 
TCGCGGCAGA 
GGGGGTCGCC 
CCGCGOCCAG 
CGCCGCXTOG 
GCGGCGAGTA 
GTCCCGAGCG 
ACTGCTGCTC 



51 
) 

CACTCCCGGA 
GCGAGGCAGC 
CGCtGCaGCC 
CCAGCAGCAG 
GCAGGACCCT 
TCCAGAGGCT 



TGGCTTAGGA 
GGGGAGTGGO 
GGGGCCACGA 
GGTGGTGCCA 
CTCCTGCAQC 
CTGCCAOGGC 
CTTOGACGGC 
CAGCGCCGAG 



CATGTCTTAG 
AGCGGTGCTC 
TGCTGCCCCG 
TGCAGCGGCC 
OCTTTATCAT 
AGGATCCCCA 
TCCCCAGTGC 
CCAGCTCCAG 
GCTTGCCGGA 



ACGTGGGGTA 
ACGGCCTGCA 
AGAAGATGTA 
GGGAGACGAA 
TGGCACGATT 



AGCTGGAGAT 
CCTTGAGATA 
TTGTTATTAT 
CTTTTACTGG 
TTGCCCAGGC 



GGGTACCCAA 
CCCTGAAGGC 
TGCCAGACGC 
CGCCTTGCGC 
CTTGGGGTCC 
GCAGAGC5CGA 
CAGCACCTCC 
CGGCAACTCC 
AGGGACCATG 
GCAGGCCACC 
CACGGTGCAG 
GCCTGGCTAC 
CCCAGCGGTG 
GGCAGGGGTG 
CCTTTGGATG 
GGATATGTTT 
GACTGTGATG 
TGGTTGACGA 
TATTTCTTTT 
GACTTTTTTT 
TGGAGTOCAG 
CTCCTGCCXC 



GGCTAATTTT 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
192 0 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



895 



WO 03/042661 



PCT/US02/36810 



5 
10 
15 
20 
25 
30 



TGTAGAGATG 
TCnK3CCTQTC 
TGAGCCTTTT 
ATTCTAAAAG 
GGCACACCTT 
GGGCCTATTT 
TTAAAAAGAG 
TGGGATGATT 
TTGATTGTAT 
TATATTGAAA 
TGGTTATGGT 
G&GTGCAAGT 
AAAAATTTTC 
TAGCTCCTCA 
GCAGATCATA 
GATT GT T AAG 
ATTTCCTCTC 
CAAATTCAAG 
ATGTGCAAGT 
GTATGGTATG 
TGTTTCATGA 
TTTTTTAATA 
TGAAAAAAAA 



GGGTTTCACC 
TCAGCCTCCC 
TTTTTTTCTA 
QAAACCTGTT 
AATTTCATTG 
TGTGCTTTTT 
ACTGAAATAA 
TTAATCTTGA 
CTTAATTTTC 
TCATAAACTA 



AATTACTATA 
TTGTCTTAGT 
GCCATAACCT 
AGGCTTTTTA 
AGAAAAGCTT 
TCTTTCAGGC 
TGAATTTATT 
ACATTTCCAA 
GTCTTCTACA 
CTGAATAATG 
ACCAAAGGCA 
AAAAAAAAAA 



ATGTTGGCTG 
AAAGTGCTGG 
ATGCATCCAA 
TGAACXCTGT 
TAAAAAGATA 
TACCTTATGT 
ATTGTATAGT 
AGTAGTAGGT 
TGGCTTTAAG 
TCACCCGCTG 
CTTCTGTTTG 
CTTGTAAATG 
TATAAAAATT 
GAGACTTGGG 
TACTCTTGTT 
TTCAACGAAQ 
CTCAACAGGC 
TGTGTGTTCT 
CTGCTAGCAC 
CATTTATGTC 
TAAAACCRGT 
GGGGAAAATC 
AAAAAAAAAA 



GGCTGGTCTC 
GATTACAGGC 
GGTTAAQGGG 
GAGATCAGTC 
TATATATTTT 
AGAGATCTTA 
TACIT AACTA 
GGTATAGTCA 
GTGACATCTG 
CTTCTCTGAG 
GTTTTCAGAG 
AAGATCAGTA 
CAAAGAAATG 
ATOAAATTTA 
ATCAAAATGG 
GATTGCCTTT 
ACTGTATTCA 
TTACTTATAT 
AACCAGTATT 
TAT AG AT AT C! 
GTTGGCAATT 
ATTTTACTTA 



ACTCTCCTGA 
GTGAGOCACC 
AAGACGCAAA 
ATCAGTCTCA 
GTCTATTTTT 
TTACAAAGTG 
ATGAAGACAT 
TAAAACCATT 
AGAGGTAATG 
TTACTTTTAA 
CCCCATGTCT 
TTTCTGCCTA 
TGTTACAAAG 
AACCAGATAC 
CTTATTTTTC 
CTTCTCCCAC 
TTGCCAATGT 
AAAAAAAGAT 
TTGTAATTAA 
TATCGATCAT 
GGTATCATCA 
TTAATAAATA 



CCTCAAGCAA 
GCCCCCAGCC 
TAACAGGACT 
GTATTCCACA 
GTGCTTTTGG 
ATTTTCTACA 
TTCAGAACTC 
CATCCCCTTC 
CATTCTTTTT 
TTTTGCCTTG 
ATATAGTCCT 
GATCTQATAA 
ATACTTAGTA 
GATTTACTTT 
AGGCACTAAG 
ACTGTTCTTG 
TCCAAATTAT 
AACTTTAAGG 
ACAAATCGCT 
CTTTCTATTC 
ATGATACTCA 
TTTTATGATG 



A135 Protein sequence 
Geno name ; 
Unigene numbers 
Protein Accession # i 
signal sequence : 
Trarxsmembrane domains* 
Cellular Ijocalization: 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
28B0 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3S40 
3 600 
3660 



EGTs 

Bs. 12 08 99 

none found 

l-ll 
402-424 

not determined 



35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 11 21 31 41 

1111 
M&SGFXMSPS TQHHAQYTPG GKKLPWEASI 
GEASKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP 
3GRQPEGP9D CZPRFPSASA THKAVPKGTG PPAEDGDGUG 
PRGKKRGTVS DEARGSPGPR MiGDRPALgG DAliSAPRWP 
CCWIrRCWRRG RGPSGEYCHG WLDAQGVWRl GFQCPERFDG 
ABLDQGGCDN DRQQG AGEPG RADKDGPRRI> GRA5CLRGTQ 
SPKGRQI^RA FPGLTjPRASR RGFPSSPRGG PEPLQRPALP 
liVAAjCCCRCL RPKQDPQQfiR APGGNRLMET IPMIPSASTS 
GARAPPTHSQ TNCCLPEGTM NNVYVKMPTN FfiVLNCQQAT 
HDSVPMTAVP PFMDGLQPGY RQIQBPFPHT WSEQKKYPAV 



51 

I 1 
DRERESHPEA AGLLWDRAAA 
ELRPGRSRGR VRLPVKFPEA 
APQPRARRRR LLGVAAEGSG 
CGALAARPSP HPGTPldasCS 
GDATICCGSC ALRYCCSSAE 
GDGEGAPPPV RAWQRCSPBG 
IYVPPJVIVGS VFVAPII^GS 



QIVPHOGCHT^ HPPYVGYTVQ 
TV 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 



(endothelial adhesion molecule 1) 



A136 DMA SEQUENCE 

Gene name i selectin E 

Unigono number: Hs.89546 
Probe set Accession tt x M24736 
Nucleic Acid Accession 4- BM_000450 

Coding sequence x 1-1833 (underlined sequences correspond to start and stop codons) 



1 11 21 31 

1111 
ATGA TTGCTT CACAGTTTCT CTCAGCTCTC ACTTTGGTGC 
GCCTGGTCTT ACAACACCTC CACGGAAGCT ATGACTTATG 
CAGCAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAG 
TCCATATTGA GCTATTCACC AAGTTATTAC TGGATTGGAA 
TGGGTCTGGG TAGGAACCCA GAAACCTCTG ACAGAAGAAG 
QAAOCCAACA AT AGGCAAAA AGATGAGGAC TGCGTGGAGA 

gatgtgggca tgtggaatga tgagaggtqc AGCAAGAAGA 
gctgcctgta ccaatacatc ctgcagtggc cacggtgaat 

TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAGT 
acabccctgg AATCOCCTGA GCATGGAAGC CTGGTTTGCA 
AGCTACAATT CTTCCTGCTC TATCAGCTGT 
ACCATGCAGT GTATGTCCTC TGGAGAATGG 
GAGTGTGATG CTGTGACAAA TCCAGCCAAT 
AGCTTCCCAT GGAACACAAC CTGTACATTT 
GCCCAGAGCC TTCAGTGTAC CTCATCTGGG 
GCTGTGACAT GCAGGGCGGT CCGCCAGOCT 
CCTGCTGGAG AGTTCACCTT CAAATCATCG 
TTGCAGGGAC CAGCCCAGGT TGAATGCACC 
CTTTGTGAAG CTTTCCAGTG CACAGCCTTG 
CTTCCTAGTG CTTCTGGCAG tttccgttat 
GGTTTTGTGT TGAAGGGATC CAAAAGGCTC 
GAGAAGCCCA CATGTGAAGC TGTGAGATGC 

gtgaggtgtg ctcattcccc tattggagaa 

TGTOAQQAGG GATTTGAATT ATATGGATCA 
TGGACAGAAG AGGTTCCTTC CTGCCAAQTG 
AA0A.TCAACA TGAGCTGCAG TGGGGAGCCC 
CCTGAAGGAT GGAOGCTCAA TGGCTCTGCA 
TCTGGCCTGC TACCTACCTG TGAAGCTCCC 
CTTTCTGCTG CTGGACTCTC CCTCCTGACA TTAGCACCAT 



AGTGCTCCTA 
GGGTTCGTGG 
GACTGTGAAG 
AATTGGGACA 
CAGAATGGCT 
TGCAACTTCA 
ACTCAAGGGC 
TCCAACOCG0 
GGGTCCAGCT 
CAATGTGGCC 
GATGCTGTCC 
TTCACCTACA 
ACTCAACTTG 
GTAAAATQTT 
GTGTTTGGCA 
GCTCGGACAT 



41 
I 

TTCTCATTAA 
ATGAGGCCAG 
AAGAGATTGA 
TCAGAAAAGT 
CCAAGAACTG 
TCTACATCAA 
AGCTTGCCCT 
GTGTAGAGAC 
GTGAGCAAAT 
GTCACCCACT 
ACCTGCCAAG 
TTCCABCCTG 
AATGTTTCCA 
AAGGATTTGA 
ACGAGAAGCC 



51 
\ 



TGCTTATTGT 
GTAGCTAAAC 
CAACAATGTG 
GGCTCCAGGT 
GAGAGAAAAA 
ATGCTACACA 
CAT CAATAAT 
TGTGAACTGT 
GGGAAACTTC 



CCTGTGAGGA 
AiGTGGACACA 
AGCGAGGCTA 
GTGAGTTCTC 
CCACAGGGGA 
AOCAGCCCCC 

agtcctcttg 
agtgcacatc 
caagcctggc 

CTGTGTGCAA 
GTGGAQCCAC 
ACATTCCCTT 
TTCTCCl'CTG 



CAATGTGGTT 
AAACCCTGGA 
ACTAATGGGA 
AACGTGTAAA 
CAGCCATXCC 
AGGCTTCATQ 
GCAAATCCCA 
CATGAATTGT 
CTGTGAGCAG 
GTGGGACAAC 
GAAGGGTTTG 
TGCCTTCAGC 
TCAGGGACAA 
AGTTOCGGGA 
GTTCGCCTGT 



GGTAGCTGGA 
GCTTCGGAAA 



60 
120 
180 
240 
3QO 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1360 
1320 
1380 
1440 
1500 
IS 60 
1620 
1680 
1740 



896 



WO 03/042661 



PCT/US02/36810 



TGCTTACGGA AAQCAAAGAA ATTTQTTCGT GCCAGCACCT GCCAAAGCCT TGAATCAGAC 1800 
GOAAGCTACC AAAAGCCTTC TTACATCCTT TAA 

A137 Protein sequence : 

Gene name r Se lectin E (endothelial adhesion molecule 1) 

Unigene number ; Hs.89546 

Probeeet Accession #t M2473 6 

Protein Accession #= NP_000441 

S ignal sequence ; 1-22 

Trans membrane domains: 555-573 

C-lectin domain? 2 3-135 

Cellular Localization: plasma membrane 

1 11 21 31 41 51 

I I I 1 1 1 

MIASQFIiSAIi TLVLLIKESG AWSYMT9TEA MTYDBASAYC QORYTHLVAl QJfKEEIEYLN 60 
SILSYBPSYY WIGIKJCVNWV NVWVGTQXPL TKEAKNWAPG EPNNRQKDED CVEIY1KREK 120 
DVCMWNDKRC SKKKLALCYT AACTNTSCSG HGECVETINN YTCKCDPSPS GLKCEQIVNC 180 
TALBSPEHGS LVCSHPLGNF SYNSSOfilSC DRGYLPSSME TMQOISSGEW SAPIPACNW 240 
ECDXVTHPAN GFVECFQNPG SFPWHTTCTF DCEEGFELMG AQSLQCTSSG 2JWDNHKPTCK 300 
AVTCRAVRQP QNGSVRCBHS PAGKFTPKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQIP 360 
VCEAFQCTAL SNPERCJYMNC LPSASG9PRY BSSCEFSCEQ GFVLKGSKRL QCGPIGEWDN 420 
BKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSQOQ 4B0 
WTEEVPSCQV VXCSSLAVF3 XINHSCSGEP VFOTVCKFAC PEGWTLNGSA ARTCGATGKW S40 
SGLIiPTCBAP TESNIPLVAG LSAAGLSLLT LAPFLLKIiRK d»RKRKKFVP ASSCQSLESD 600 
GSYQKPfiYIL 

RX3B DMA SEQUENCE i 

Gene name: EST©, Weakly similar to JE0350 Anterior gradient-2 sapiens I 

Tin i gene number; Hs.100686 

Probeeet Accession #s AA4B7468 

Hucleic Acid Accession #i AA487468 

Coding sequence; 55-555 (underlined sequences correspond to start and stop codons} 

1 11 21 31 41 51 

I 1 I I I I 

CGGCACCAAG AGCACTGGCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTTACACTCAX3 CTTTGGGTCT CTGOCTCTTA CTCGTCACAO TTTCTTOCAA OCTTGQCATT 120 

GCAATAAAAA AOGAAAAGAG GCCTCCTCAQ ACACTCTCAA GAGGATGGGG AGATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTOGA OGATTQTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTGAT GOGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTQTACACA 480 

TATGAGCCTC GGGATTTACC CCTATXGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAOTCA3AGC TATAAGAGAT GATAGAAAAA AGCCTTCACT TCAAAGAAOT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

T TACTATTTA OTTTTTTTAA TGTGTTTGCA ATAGT CTTAT TAAAA1AAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 

A139 Protein sequence? 

Gene name: " — esTb, Weakly similar to JE0350 Anterior gradient-2 |H- sapiens] 

Unigene number i Hs- 100686 

Probeset Accession #: AA487468 

Protein Accession 4ti none Eound 

Signal sequence z 1-23 

Transmembrane domains t none found 

Cellular localization? secreted 

1 11 21 31 41 SI 

\ I I i 1 I 

MMLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEB GLFYAQKSKK 60 
PLMV1KHLED CQY5QALKKV FAQNEEIQBM AQNKFlMLNIi MHETTDKHLS PDOQYVPRIM 120 
FVDP83UTVRA DIAGRYSNRL YTYEPRDLPL LIENMKKALR LIQSEL 

A14Q PNA gEOOBNCS 

Gene name: ■ TMPRSS3a 
Unigene number: Hs. 298241 

Probeset Accession #: AI53S613 
Nucleic Acid Accession fix AB038157 

Coding sequence: 202-1566 [underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ACCGGGCACC GGACGGCTCG GOTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

QGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGQC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATOTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 



897 



WO 03/042661 



PCTAJS02/36810 



TCATTCCGAT CG CTT T T TGG CCTTQATQAT TTGAAAATAA GTCCTGTTGC ACCAOATQCA 30O 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTOC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT AT T AGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCQCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAQCTGCT TCGTGGAAGA CCATGTGCTC OGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGOCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGlCATAG AAGGGGCTAC 640 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGOCCTG GCAGGOCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCOCCT GT GGATCAT C 960 

ACTGCTGCAC ACTOTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

GTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCOCACTTGG TGGAGAAGAT TGTCTAOCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCOGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAQ TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGC AGGTGACGCC 1260 

TOCCCTGTCC TGAACCACGC GGCCGTCCCT TTGATTTCCA ACAAGATCTQ CAACCACAGG 1320 

GACOTQTAOG QTGGCATCAT CTCCCCCTCC ATGCTCTGCG CGGGCTACCT GACGGGTGGC 1380 

GTGGACAGCT GCCAGGGGGA CAGCGGGGGG CCCCTGGTGT QTCAAGAGAG GAGGCTGTGG 1440 

AAGTTAGTGG GAGCGACCAG CTTTGGCATC GGCTGCGCAG AGGTGAACAA GCCTGGGGTG 1500 

TACACOCGTG TCACCTCCTT OCTGGACTGG ATCCACQAGC AGATGGAGAG AGACCTAAAA 1560 

ACCTGAAGAG GAAGGGGACA AGTAGCCACC TGAGTTCCTG AGGTGATGAA GACAGCCCQA 1620 

TCCTCCCCTG GACTOCCGTG TAGGAACCTG CACACGAGCA GACACCCTTG GAGCTCTGAG 1680 

TTCCGGCACG AGTAGCAGGC CCGAAAGAGG CACCCTTCCA TCTGATTCCA GCACAACCTT 1740 

CAAGCTGCTT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT CGCTCTGTTG CCCAGGCTGG 1800 

AGTGCAGTGG CGAAATCGCT GCTCACTGCA GCCTCCGCTT CCCTOGTTCa AGCGATTCTC 1860 

TTQCCTCAGC TTCCCCAGTA GCTGGGACCA CAGGTGCCCG CCACCACACC CAACTAATTT 3920 

TTGTATTTTT AGTAGAGACA GGGTTTCAGC ATGTTGGCCA GGCT0CTCTC AAACCCCTGA 1980 

CCTCAAATGA TGTGCCTGCT TCAGCCTCCC ACAGTGCTGG GATTACAGGC ATGGGCCACC 204O 

ACGCCTAGCC TCACGCTCCT TTCTGATCTT CACTAAGAAC AAAAGAAGCA GCAACTTGCA 2100 

AQGGCGGCCT TXCCCACTQQ TCCATCTGGT TTTCTCTOCA GGGGTCTTGC AAAATTCCTG 2160 

ACGAGATAAG CAGTTATGTG ACCTCACGTG CAAAGCCACC AACAGCCACT CAGAAAAGAC 2220 

GCACCAGCCC AGAAGTGCAG AACTGCAGTC ACTGCACGTT TTCATCTCTA GGGACCAGAA 2280 

CCAAACCCAC CCTTTCTACT TCCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGGA 2340 

ATG ACT CGTT TAAOGCCTAT TTTCATG AT T TCTTTGTAGC ATTTGGTGCT TGACGTATTA 2400 

TTGTCCTTTG ATTCCAAATA ATATGTTTCC TTCCCTCAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAA 

A141 Protein sequence: 

Gene name: TMPRSS3a 
Unigene number i Hs. 296241 

Probeset Accession #: AI538613 
Protein Accession fh BAB20077 
signal sequence: none found 

Trans membrane domains: 43-65, 239-261 

Tryp_SPc domain? 216-444 
Cellular- Local izat ion i plasma membrane 



1 . 11 21 31 41 51 

I \ \ I I 1 

MGENDPPAVS APFSFRSLFG LDDLKISPVA PDADAVAAQI LStJJPUCFFP JIVIGIIALI 60 

LALAIGLGIH FDCSC3KYRCR SSP^CIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 

TAABWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNKRVSS LKOQFREEFV STT ITU .T.PDDK 180 

VTALBESVYV RBGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQ3YH 240 

LCGGSVITPL. WIITAAHCVY DLYLPK5WT1 QVGLVSLLDN PAtSHLVHtt VYHSIOKPKR 300 

IiGKDlALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGA ODAgPVLHHR 360 

AVPLISNKIC NHRDVYGGII SPSMLCAGYL TGGVDSCQGD SGGPLVCQER RLKKLVGATS 420 
FGIGCAEVNK. PGVYTRVTSP LDWXHEQMER DLKT 

A142 DNA SEQUENCE 

Gene names EGTb; Weakly similar to LI THOSTATHINE 1 BETA PRECURSOR tH. sapiens] 

Dnigone number £ He. 105484 

Probeset Accession #: AA314779 
Nucleic Acid Accession ffs none found 

Coding sequence; 103-579 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I 1 1 I 

CCAAACAGAT TTGCAGATCA AGGAGAACCC AGGAjSTTTCA AAGAAGCGCT AGTAAGGTCT 60 

CTGAGATCCT TGCACTAGCT ACATCCTCAG GGTAGGAGGA AGATGGCTTC CAGAAGCATG 120 

CGGCTGCTCC TATTGCTGAG CTGCCTGGCC AAAACAGGAQ TCCTGGGTGA TATCATCA1G 180 

AGACCCAGCT GTGCTCCTGG ATGGTTTTAC CACAAGTOCA ATTGCTATG3 TTACTTCAGG 240 

AAGCTGAGGA ACTGGTCTGA TGCCGAGCTC GAGTGTCAGT CTTACGGAAA CGGAGCCCAC 300 

CTGGCATCTA TCCTGAGTTT AAAGGAAGCC AGCACCATAG CAGAGTACAT AAGTOGCTAT 360 

CAGAGAAGCC AGCCGATATG OATTGOCCTG CACGACCCAC AGAAGAGGGA GCAGTGGCAG 420 

TGGATTGATG GGGCCATGTA TCTGTACAGA TCCTGGTCTG GCAAGTCCAT GGGTGGGAAC 480 

AAGCACTGTG CTGAGATGAQ CTCCAATAAC AACTTTTTAA CTTGGAGCAG CAACGAATGC 540 

AACAAGCGCC AACACTTCCT GTGCAAGTAC CGAOC ATAGA QCAAGAATCA AGATTCTGCT 600 

AACTCCTGCA CCAGCCCCGT CCTCTTOCTT TCTGCTAGCC TGGCTAAATC TGCTCATTAT 660 

TTCAGAGGGG AAACCTAGCA AACTAAGAGT GATAAGGGCC CTACTACACT GGCTTTTTTA 720 

898 



WO 03/042661 



PCT/US02/36810 



GGCTTAGAGA CASAAACTTT AQCATTGGGC CCAGTAGTGG CTTCTAGCTC TAAATGTTTG 780 

CCCCGCCATC CCTTTCCACA GTATCCTTCT TCGCTCCTCC CCTGTCTCTG GCTGTCTCGA B40 

GCAGTCTAGA AGAGTGCATC TCCAGCCTAT GAAACAGCTG GGTCTTTGGC CATAAGAAGT 900 

AAAGATTTGA AGACAGAAGG AAGAAACTCA GGAGTAAGCT TCTAGACCCC TTCAGCTTCT 96 Q 

ACACCCTTCT GCXZCTCTCTC CATTGCCTGC ACOCCACCCC AGCCACTCAA CTCCTGCTTG 1020 

TTTTTCCTTT GGCCATAGGA AGGTTTACCA GTAGAATCCT TGCTAGGTTG ATGTGGGCCA 1080 
TACATTCCTT TAATAAACCA TTGTGTACAT AAGAQAAAAA AAAAAAAAAA AAAAAAAAAA 



A143 Protein sequence: 
Gene name: 
Unigene number i 
Probes© t Accession #: 
Protein Accession # t 
Signal sequence: 
Transmembrane domains: 
C-type lectin domain: 
Cellular Localization - 



E8TS3 Weakly similar to LITE06TATEINE 1 

Be. 105484 

AA31477 9 

none found 

1-22 

none found 

47-156 

secreted 



BETA PRECURSOR [H- sapiens] 



1 11 21 31 41 51 

I I I I I i 

MAS RSKR LLL LIzSCLAKTGV LGDIIHRPSC ABGWFYHK5CT CYGYFRKLRN WSDAELECQS 
YONGAKLASI LSBKEAfiTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQWJDG AMYLYRSWSG 
KSMGGNKHCA EMSSNNNFL.T WSSNBCNKRQ HFLCKYRP 



60 
120 



A144 DMA SEQUENCE 

Gene name: 
Unigene number: 
Probeset Accession # - 
Nucleic Acid Accession tt: 
Coding sequence: 



Cadherin 3 f P-cadherin (placental) 

Hs.2877 

X63G29 

X63629 

54-2543 (start and stop eodons are underlined) 



1 11 21 31 41 51 

I I I I I I 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATOGGGC 60 

TOCCTDGTBG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 

CCTCOGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GOGGGAGGOG 180 

CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 

CAGCICTGTT TAGCACTGAT WVTGATOACT TCACTGTGCG GAATGGOGAG ACAGTOCAGG 300 

AAAGAAOOTC ACTGAAGGAA AGGAATOCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 

GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTOAAAAT GGCAAGGGTC 420 

CCTTCCCCCA GAGACTGAAT CM3CTCAAGT CTAATAAftfiA TAGAGACACC AAGATTTTCT 4B0 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAQAAGG 540 

AGACAGGCTG GTTGTTGTK3 AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 

TCXTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 

XCATCQTQAC COACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 

GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTQACAGCC ACAGATGAGG 780 

ATGATGCCAT CTACACCTAC AAXGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA B40 

AGGACCCACA GGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AOCGTCATCT 900 

CCAGTGGCCT GGAC0GGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 

TGGATGGGGA CGGCTCCACC ACCACGQCAG TGGCAGTAOT GGAGATCCTT GATGCCAATG 1020 

ACAATGCTCG CATGrTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAQTGG 1080 

GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 

GTGCCAOCTA CCTTATCATG GGOGGTGACG ACGGGGAGCA TTTTACCATC ACCACGCACC 1200 

CTGAGAGCAA CGAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 

AGCACACCCr GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 

CCACAGCCAC CATA0TGGTC CACGTOGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 

CCrCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CTGCAGAAQA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 

CAGGGTCGCT AGCCATGGAC CCAGACAGTG GGCAGOTCAC AGCTGTGGGC ACCCTCGACC 1560 

GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCQCC 1740 

ACGTQCTGAA CATCACGQAC AAGGACCTOT CTCCCCACAC CTCCCCXTTC CAGGCCCAGC 1800 

TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG I860 

TCTTGTCCCT GAAGAAGTTC CTQAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 

ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CIGGGGGGXG 2040 

TCCXGGCTCT GCTGrTCCTC CTGCTGGTGC TG CTT TTGT T GGTGAGAAAG AAGCGGAAGA 2100 

TCAAGGAGCC CCTCCTACTC CGAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 

AGGCCAGGCC GGAGGTGGTT CTCOGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 22 B0 

TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 

AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 

AGGGCAGCGG CTCCGACQCC GCGTCCCTGA GCTCCCTCAG CTCCTCCGCC TCCGACCAAG 2460 

ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAOACATGT 2520 

ACGGTGGCGQ GGAGGACGAC TAGGCGQCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTOG GAGCTIGTCA 2640 

GGAAGTGGCC GTAGCAACrr GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGBRATOTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTGOGCC AGGGTTGCCT CAQAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAS GCCTCCTGGT GCAACTTAAT TTTXTTTTTT AATGCTATCT 2940 



899 



